Stochastic equation and exponential ergodicity in Wasserstein distances   for affine processes

Martin Friesen; Peng Jin; Barbara R\"udiger

arXiv:1901.05815·math.PR·March 17, 2022

Stochastic equation and exponential ergodicity in Wasserstein distances for affine processes

Martin Friesen, Peng Jin, Barbara R\"udiger

PDF

TL;DR

This paper investigates the long-term behavior of affine processes, proving exponential ergodicity in Wasserstein distances under certain moment conditions, and establishes their representation as solutions to stochastic equations.

Contribution

It demonstrates that affine processes can be uniquely represented as solutions to stochastic equations and proves their exponential ergodicity in Wasserstein distances.

Findings

01

Affine processes are solutions to stochastic equations driven by Brownian motions and Poisson measures.

02

Subcritical affine processes exhibit exponential ergodicity in Wasserstein distances.

03

Moment conditions are crucial for establishing ergodicity.

Abstract

This work is devoted to the study of conservative affine processes on the canonical state space $D =$ R_+^m \times \R^n $, w h er e$ m + n > 0$. We show that each affine process can be obtained as the pathwise unique strong solution to a stochastic equation driven by Brownian motions and Poisson random measures. Then we study the long-time behavior of affine processes, i.e., we show that under first moment condition on the state-dependent and log-moment conditions on the state-independent jump measures, respectively, each subcritical affine process is exponentially ergodic in a suitably chosen Wasserstein distance. Moments of affine processes are studied as well.

Equations337

E_{x} (e^{i ⟨ u, X_{t} ⟩}) = exp (ϕ (t, i u) + ⟨ x, ψ (t, i u)⟩),

E_{x} (e^{i ⟨ u, X_{t} ⟩}) = exp (ϕ (t, i u) + ⟨ x, ψ (t, i u)⟩),

I = {1, \dots, m}, J = {m + 1, \dots, d} .

I = {1, \dots, m}, J = {m + 1, \dots, d} .

A=\left(\begin{array}[]{rrrr}A_{II}&A_{IJ}\\ A_{JI}&A_{JJ}\end{array}\right),

A=\left(\begin{array}[]{rrrr}A_{II}&A_{IJ}\\ A_{JI}&A_{JJ}\end{array}\right),

D \int (1 \land ∣ ξ ∣^{2} + i \in I \sum (1 \land ξ_{i})) ν (d ξ) < \infty.

D \int (1 \land ∣ ξ ∣^{2} + i \in I \sum (1 \land ξ_{i})) ν (d ξ) < \infty.

μ_{i} ({0}) = 0, D \int ∣ ξ ∣ \land ∣ ξ ∣^{2} + k \in I \ {i} \sum ξ_{k} μ_{i} (d ξ) < \infty, i \in I .

μ_{i} ({0}) = 0, D \int ∣ ξ ∣ \land ∣ ξ ∣^{2} + k \in I \ {i} \sum ξ_{k} μ_{i} (d ξ) < \infty, i \in I .

U = C_{\leq 0}^{m} \times i R^{n} = {u = (u_{1}, u_{2}) \in C^{m} \times C^{n} ∣ Re (u_{1}) \leq 0, Re (u_{2}) = 0} .

U = C_{\leq 0}^{m} \times i R^{n} = {u = (u_{1}, u_{2}) \in C^{m} \times C^{n} ∣ Re (u_{1}) \leq 0, Re (u_{2}) = 0} .

(L f) (x)

(L f) (x)

+ D \int (f (x + ξ) - f (x) - ⟨ ξ_{J}, \nabla_{J} f (x)⟩ \mathbbm 1_{{∣ ξ ∣ \leq 1}}) ν (d ξ)

+ i = 1 \sum m x_{i} D \int (f (x + ξ) - f (x) - ⟨ ξ, \nabla f (x)⟩) μ_{i} (d ξ),

D \int e^{⟨ u, x^{'} ⟩} P_{t} (x, d x^{'}) = exp (ϕ (t, u) + ⟨ x, ψ (t, u)⟩), u \in U,

D \int e^{⟨ u, x^{'} ⟩} P_{t} (x, d x^{'}) = exp (ϕ (t, u) + ⟨ x, ψ (t, u)⟩), u \in U,

\partial_{t} ϕ (t, u)

\partial_{t} ϕ (t, u)

\partial_{t} ψ_{I} (t, u)

ψ_{J} (t, u)

F (u)

F (u)

R_{i} (u)

(P_{t} ρ) (d x) = D \int P_{t} (x, d x) ρ (d x), t \geq 0, ρ \in P (D) .

(P_{t} ρ) (d x) = D \int P_{t} (x, d x) ρ (d x), t \geq 0, ρ \in P (D) .

D \times D \int (f (x) + g (x)) H (d x, d x) = D \int f (x) ρ (d x) + D \int g (x) ρ (d x) .

D \times D \int (f (x) + g (x)) H (d x, d x) = D \int f (x) ρ (d x) + D \int g (x) ρ (d x) .

P_{d_{κ}} (D) = ⎩ ⎨ ⎧ ρ \in P (D) ∣ D \int ∣ x ∣^{κ} ρ (d x) < \infty ⎭ ⎬ ⎫ .

P_{d_{κ}} (D) = ⎩ ⎨ ⎧ ρ \in P (D) ∣ D \int ∣ x ∣^{κ} ρ (d x) < \infty ⎭ ⎬ ⎫ .

P_{d_{l o g}} (D) = ⎩ ⎨ ⎧ ρ \in P (D) ∣ D \int lo g (1 + ∣ x ∣) ρ (d x) < \infty ⎭ ⎬ ⎫ .

P_{d_{l o g}} (D) = ⎩ ⎨ ⎧ ρ \in P (D) ∣ D \int lo g (1 + ∣ x ∣) ρ (d x) < \infty ⎭ ⎬ ⎫ .

W_{d} (ρ, ρ) = in f ⎩ ⎨ ⎧ D \times D \int d (x, x) H (d x, d x) ∣ H \in H (ρ, ρ) ⎭ ⎬ ⎫ .

W_{d} (ρ, ρ) = in f ⎩ ⎨ ⎧ D \times D \int d (x, x) H (d x, d x) ∣ H \in H (ρ, ρ) ⎭ ⎬ ⎫ .

D \int f (x) ρ_{n} (d x) ⟶ D \int f (x) ρ (d x), n \to \infty.

D \int f (x) ρ_{n} (d x) ⟶ D \int f (x) ρ (d x), n \to \infty.

D \int d (x, 0) ρ_{n} (d x) ⟶ D \int d (x, 0) ρ (d x), n \to \infty.

D \int d (x, 0) ρ_{n} (d x) ⟶ D \int d (x, 0) ρ (d x), n \to \infty.

R \to \infty lim n \to \infty lim sup D \int d (x, 0) \mathbbm 1_{{d (x, 0) \geq R}} ρ_{n} (d x) = 0.

R \to \infty lim n \to \infty lim sup D \int d (x, 0) \mathbbm 1_{{d (x, 0) \geq R}} ρ_{n} (d x) = 0.

∣ ξ ∣ > 1 \int lo g (∣ ξ ∣) ν (d ξ) < \infty.

∣ ξ ∣ > 1 \int lo g (∣ ξ ∣) ν (d ξ) < \infty.

W_{l o g} (P_{t} ρ, π) \leq K min {e^{- δ t}, W_{l o g} (ρ, π)} + K e^{- δ t} W_{l o g} (ρ, π), t \geq 0.

W_{l o g} (P_{t} ρ, π) \leq K min {e^{- δ t}, W_{l o g} (ρ, π)} + K e^{- δ t} W_{l o g} (ρ, π), t \geq 0.

∣ ξ ∣ > 1 \int ∣ ξ ∣^{κ} ν (d ξ) < \infty,

∣ ξ ∣ > 1 \int ∣ ξ ∣^{κ} ν (d ξ) < \infty,

W_{κ} (P_{t} ρ, π) \leq K^{'} W_{κ} (ρ, π) e^{- δ^{'} t}, t \geq 0.

W_{κ} (P_{t} ρ, π) \leq K^{'} W_{κ} (ρ, π) e^{- δ^{'} t}, t \geq 0.

W_{d} (P_{t} (x, \cdot), π) \leq K e^{- δ t} (1 + W_{d} (δ_{x}, π)), t \geq 0, x \in D,

W_{d} (P_{t} (x, \cdot), π) \leq K e^{- δ t} (1 + W_{d} (δ_{x}, π)), t \geq 0, x \in D,

\frac{1}{t} 0 \int t f (X_{s}) d s ⟶ D \int f (x) π (d x), t \to \infty

\frac{1}{t} 0 \int t f (X_{s}) d s ⟶ D \int f (x) π (d x), t \to \infty

t \to \infty lim ∥ P_{t} (x, \cdot) - π ∥_{TV} = 0, x \in D,

t \to \infty lim ∥ P_{t} (x, \cdot) - π ∥_{TV} = 0, x \in D,

D \int e^{⟨ u, x ⟩} π (d x) = exp 0 \int \infty F (ψ (t, u)) d t, u \in U,

D \int e^{⟨ u, x ⟩} π (d x) = exp 0 \int \infty F (ψ (t, u)) d t, u \in U,

E (∣ X_{t} (x) - X_{t} (x) ∣) \leq K e^{- δ t} (\mathbbm 1_{{n > 0}} ∣ y - y ∣^{1/2} + ∣ x - x ∣), t \geq 0,

E (∣ X_{t} (x) - X_{t} (x) ∣) \leq K e^{- δ t} (\mathbbm 1_{{n > 0}} ∣ y - y ∣^{1/2} + ∣ x - x ∣), t \geq 0,

E (∣ Y_{t} (x) - Y_{t} (x) ∣) \leq d ∣ y - y ∣ e^{- δ^{'} t},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Stochastic equation and exponential ergodicity in Wasserstein distances for affine processes

Martin Friesen111Fakultät für Mathematik und Naturwissenschaften, Bergische Universität Wuppertal, Gaußstraße 20, 42119 Wuppertal, Germany, [email protected]

Peng Jin222Department of Mathematics, Shantou University, Shantou, Guangdong 515063, China, [email protected]

Barbara Rüdiger333Fakultät für Mathematik und Naturwissenschaften, Bergische Universität Wuppertal, Gaußstraße 20, 42119 Wuppertal, Germany, [email protected]

Abstract: This work is devoted to the study of conservative affine processes on the canonical state space $D=\mathbb{R}_{+}^{m}\times\mathbb{R}^{n}$ , where $m+n>0$ . We show that each affine process can be obtained as the pathwise unique strong solution to a stochastic equation driven by Brownian motions and Poisson random measures. Then we study the long-time behavior of affine processes, i.e., we show that under first moment condition on the state-dependent and $\log$ -moment conditions on the state-independent jump measures, respectively, each subcritical affine process is exponentially ergodic in a suitably chosen Wasserstein distance. Moments of affine processes are studied as well.

AMS Subject Classification: 37A25; 60H10; 60J25

Keywords: affine process; ergodicity; Wasserstein distance; coupling; stochastic differential equation

1 Introduction and statement of the result

1.1 General introduction

An affine process is a time-homogeneous Markov processes $(X_{t})_{t\geq 0}$ whose characteristic function satisfies

[TABLE]

where $t\geq 0$ is the time and $X_{0}=x$ the starting point of the process. The general theory of affine processes, including a full characterization on the canonical state space $D=\mathbb{R}_{+}^{m}\times\mathbb{R}^{n}$ where $m,n\in\mathbb{N}_{0}$ and $m+n>0$ , was discussed in [14]. In particular, it is shown that the functions $\phi$ and $\psi$ should satisfy certain generalized Riccati equations. Common applications of affine processes in mathematical finance are interest rate models (e.g., the Cox-Ingersoll-Ross, Vašiček or general affine term structure short rate models), option pricing (e.g., the Heston model) and credit risk models, see also [1] and the references therein. After [14], the mathematical theory of affine processes was developed in various directions. Regularity of affine processes was studied in [37] and [38]. Based on a Hörmander-type condition, existence and smoothness of transition densities were obtained in [21]. Exponential moments for affine processes were studied in [30] and [35]. The theory of affine diffusions, i.e., processes without jumps, was developed in [20], while its application to large deviations for affine diffusions was studied in [32]. The possibility to obtain affine processes as multi-parameter time changes of Lévy processes was recently discussed in [12]. It is worthwhile to mention that the above list is, by far, not complete. For further references and additional details on the general theory of affine processes we refer to the book [1].

Below we describe two important sub-classes of affine processes. Continuous-state branching processes with immigration (shorted as CBI processes) are affine processes with state space $D=\mathbb{R}_{+}^{m}$ . Such processes have been first introduced in 1958 by Jiřina [26] and then studied in [52, 40, 48], where it was also shown that these processes arise as scaling limits of Galton-Watson processes. Various properties of one-dimensional CBI processes were studied in [25, 17, 11, 34, 22, 13] and [10]. For results applicable in arbitrary dimension we refer to [5], [7] and [19]. Let us mention that CBI processes are also measure-valued Markov processes as studied in [41]. Another important class of affine processes corresponds to the state space $D=\mathbb{R}^{n}$ and is consisted of processes of Ornstein-Uhlenbeck (OU) type. These processes include also Lévy processes as a particular case.

1.2 Affine processes

Let us describe affine processes in more detail. For $m,n\in\mathbb{N}_{0}$ let $d=n+m$ , and suppose that $d>0$ . In this work we study affine processes on the canonical state space $D=\mathbb{R}_{+}^{m}\times\mathbb{R}^{n}$ . Let

[TABLE]

If $x\in D$ , then let $x_{I}=(x_{i})_{i\in I}$ and $x_{J}=(x_{j})_{j\in J}$ . Denote by $\mathbb{R}^{d\times d}$ the space of $d\times d$ -matrices. For $A\in\mathbb{R}^{d\times d}$ we write

[TABLE]

where $A_{II}=(a_{ij})_{i,j\in I}$ , $A_{IJ}=(a_{ij})_{i\in I,\ j\in J}$ , $A_{JI}=(a_{ij})_{i\in J,\ j\in I}$ , and $A_{JJ}=(a_{ij})_{i,j\in J}$ . Denote by $S_{d}^{+}$ the space of symmetric and positive semidefinite $d\times d$ -matrices. Finally, let $\delta_{kl}$ , $k,l\in\{1,\dots,d\}$ , stand for the Kronecker-Delta.

Definition 1.1.

We call a tuple $(a,\alpha,b,\beta,\nu,\mu)$ admissible parameters, if they satisfy the following conditions:

(i)

$a\in S_{d}^{+}$ * with $a_{II}=0$ , $a_{IJ}=0$ and $a_{JI}=0$ .* 2. (ii)

$\alpha=(\alpha_{1},\dots,\alpha_{m})$ * with $\alpha_{i}=(\alpha_{i,kl})_{1\leq k,l\leq d}\in S_{d}^{+}$ and $\alpha_{i,kl}=0$ if $k\in I\backslash\{i\}$ or $l\in I\backslash\{i\}$ .* 3. (iii)

$b\in D$ . 4. (iv)

$\beta\in\mathbb{R}^{d\times d}$ * is such that $\beta_{ki}-\int_{D}\xi_{k}\mu_{i}(d\xi)\geq 0$ for all $i\in I$ and $k\in I\backslash\{i\}$ , and $\beta_{IJ}=0$ .* 5. (v)

$\nu$ * is a Borel measure on $D$ such that $\nu(\{0\})=0$ and*

[TABLE] 6. (vi)

$\mu=(\mu_{1},\dots,\mu_{m})$ * where $\mu_{1},\dots,\mu_{m}$ are Borel measures on $D$ such that*

[TABLE]

In contrast to [14], we do not consider killing for affine processes and, moreover, we suppose that $\mu_{1},\dots,\mu_{m}$ integrate $\mathbbm{1}_{\{|\xi|>1\}}|\xi|$ , i.e., the first moment for big jumps is finite. It is well-known that without killing and under first moment condition for the big jumps of $\mu_{1},\dots,\mu_{m}$ , the corresponding affine process (introduced below) is conservative (see [14, Lemma 9.2]). In this paper we work with Definition 1.1 and thus restrict our study to conservative affine processes. In order to simplify the notation, we have also set $\nu(\{0\})=0$ and $\mu_{i}(\{0\})=0$ , for $i\in I$ . Hence all integrals with respect to the measures $\mu_{1},\dots,\mu_{m},\nu$ can be taken over $D$ instead of $D\backslash\{0\}$ .

Denote by $B_{b}(D)$ the Banach space of bounded measurable functions over $D$ . This space is equipped with the supremum norm $\|f\|_{\infty}=\sup_{x\in D}|f(x)|$ . Define

[TABLE]

Note that $D\ni x\longmapsto e^{\langle u,x\rangle}$ is bounded for any $u\in\mathcal{U}$ . Here $\langle\cdot,\cdot\rangle$ denotes the Euclidean scalar product on $\mathbb{R}^{d}$ . By abuse of notation, we later also use $\langle\cdot,\cdot\rangle$ for the scalar product on $\mathbb{R}^{m}$ or $\mathbb{R}^{n}.$ The following is due to [14].

Theorem 1.2.

Let $(a,\alpha,b,\beta,\nu,\mu)$ be admissible parameters. Then there exists a unique conservative Feller semigroup $(P_{t})_{t\geq 0}$ on $B_{b}(D)$ with generator $(L,D(L))$ such that $C_{c}^{2}(D)\subset D(L)$ and, for $f\in C_{c}^{2}(D)$ and $x\in D$ ,

[TABLE]

where $\nabla_{J}=(\frac{\partial}{\partial x_{j}})_{j\in J}$ . Moreover, $C_{c}^{\infty}(D)$ is a core for the generator. Let $P_{t}(x,dx^{\prime})$ be the transition probabilities. Then

[TABLE]

where $\phi:\mathbb{R}_{+}\times\mathcal{U}\longrightarrow\mathbb{C}$ and $\psi:\mathbb{R}_{+}\times\mathcal{U}\longrightarrow\mathbb{C}^{d}$ are uniquely determined by the generalized Riccati differential equations: for $u=(u_{1},u_{2})\in\mathbb{C}_{\leq 0}^{m}\times i\mathbb{R}^{n}$ ,

[TABLE]

and $F$ , $R$ are of Lévy-Khintchine form

[TABLE]

Consequently, there exists a unique Feller process $X$ with generator $L$ . This process is called affine process with admissible parameters $(a,\alpha,b,\beta,\nu,\mu)$ .

Remark 1.3.

Let $(a,\alpha,b,\beta,\nu,\mu)$ be admissible parameters. According to [14, Lemma 10.1 and Lemma 10.2], the martingale problem with generator $L$ and domain $C_{c}^{\infty}(D)$ is well-posed in the Skorokhod space over $D$ equipped with the usual Skorokhod topology. Hence, we can characterise an affine process with admissible parameters $(a,\alpha,b,\beta,\nu,\mu)$ as the unique solution to the martingale problem with generator $L$ and domain $C_{c}^{\infty}(D)$ . In any case, it can be constructed as a Markov process on the Skorokhod space over $D$ .

Affine processes are thus constructed on the canonical state space. In order to prove the main result of this work, we provide in Section 4 a pathwise construction of affine processes. The latter one extends previous cases from the literature such as [15, 20, 43] and [5].

1.3 Ergodicity in Wasserstein distance for affine processes

Let $\mathcal{P}(D)$ be the space of all Borel probability measures over $D$ . By abuse of notation, we extend the transition semigroup $(P_{t})_{t\geq 0}$ (given by Theorem 1.2) onto $\mathcal{P}(D)$ via

[TABLE]

Then $P_{t}\rho$ describes the distribution of the affine process at time $t\geq 0$ such that it has at initial time $t=0$ law $\rho$ . Note that $P_{t}\delta_{x}=P_{t}(x,\cdot)$ , and $(P_{t})_{t\geq 0}$ is a semigroup on $\mathcal{P}(D)$ in the sense that $P_{t+s}\rho=P_{t}P_{s}\rho$ , for any $t,s\geq 0$ and $\rho\in\mathcal{P}(D)$ . Such semigroup property is simply a compact notation for the Chapman-Kolmogorov equations satisfied by $P_{t}(x,\cdot)$ . Since the martingale problem with generator $L$ and domain $C_{c}^{\infty}(D)$ is well-posed, and $C_{c}^{\infty}(D)\subset D(L)$ is a core (see Theorem 1.2 and Remark 1.3), it follows from [16, Proposition 9.2] that, for some given $\pi\in\mathcal{P}(D)$ , the following properties are equivalent:

(i)

$P_{t}\pi=\pi$ , for all $t\geq 0$ . 2. (ii)

$\int_{D}(Lf)(x)\pi(dx)=0$ , for all $f\in C_{c}^{\infty}(D)$ . 3. (iii)

$\int_{D}(P_{t}f)(x)\pi(dx)=\int_{D}f(x)\pi(dx)$ , for all $t\geq 0$ and all $f\in B(D)$ .

A distribution $\pi\in\mathcal{P}(D)$ which satisfies one of these properties (i) – (iii) is called invariant distribution for the semigroup $(P_{t})_{t\geq 0}$ . In this work we will prove that, under some appropriate assumptions, $(P_{t})_{t\geq 0}$ has a unique invariant distribution $\pi$ , this distribution has some finite $\log$ -moment and, moreover, $P_{t}(x,\cdot)\longrightarrow\pi$ with exponential rate. For this purpose we use the Wasserstein distance on $\mathcal{P}(D)$ introduced below. Given $\rho,\widetilde{\rho}\in\mathcal{P}(D)$ , a coupling $H$ of $(\rho,\widetilde{\rho})$ is a Borel probability measure on $D\times D$ which has marginals $\rho$ and $\widetilde{\rho}$ , respectively, i.e., for $f,g\in B(D)$ it holds that

[TABLE]

Denote by $\mathcal{H}(\rho,\widetilde{\rho})$ the collection of all such couplings. Let us now introduce two different metrics on $D$ as follows:

(a)

Define, for $\kappa\in(0,1]$ , $d_{\kappa}(x,\widetilde{x})=\left(\mathbbm{1}_{\{n>0\}}|y-\widetilde{y}|^{1/2}+|x-\widetilde{x}|\right)^{\kappa}$ , $x=(y,z),\ \widetilde{x}=(\widetilde{y},\widetilde{z})\in\mathbb{R}_{+}^{m}\times\mathbb{R}^{n}$ , and let

[TABLE] 2. (b)

Introduce $d_{\log}(x,\widetilde{x})=\log(1+\mathbbm{1}_{\{n>0\}}|y-\widetilde{y}|^{1/2}+|x-\widetilde{x}|)$ , $x=(y,z),\ \widetilde{x}=(\widetilde{y},\widetilde{z})\in\mathbb{R}_{+}^{m}\times\mathbb{R}^{n}$ , and let

[TABLE]

Let $d\in\{d_{\log},d_{\kappa}\}$ . The Wasserstein distance on $\mathcal{P}_{d}(D)$ is defined by

[TABLE]

The appearance of the additional factor $\mathbbm{1}_{\{n>0\}}|y-\widetilde{y}|^{1/2}$ is purely technical, it is a consequence of the estimates proved in Section 6. By general theory of Wasserstein distances we see that $(\mathcal{P}_{d}(D),W_{d})$ is a complete seperable metric space, see, e.g., [50, Theorem 6.18]. Convergence with respect to this distances is explained in the following remark, see also [50, Theorem 6.9].

Remark 1.4.

Let $d\in\{d_{\log},d_{\kappa}\}$ , $(\rho_{n})_{n\in\mathbb{N}}\subset\mathcal{P}_{d}(D)$ and $\rho\in\mathcal{P}_{d}(D)$ . The following are equivalent

(i)

$W_{d}(\rho_{n},\rho)\longrightarrow 0$ * as $n\to\infty$ .* 2. (ii)

For each continuous function $f:D\longrightarrow\mathbb{R}$ with $|f(x)|\leq C_{f}(1+d(x,0))$ , it holds that

[TABLE] 3. (iii)

$\rho_{n}\longrightarrow\rho$ * weakly as $n\to\infty$ , and*

[TABLE] 4. (iv)

$\rho_{n}\longrightarrow\rho$ * weakly as $n\to\infty$ , and*

[TABLE]

For simplicity of notation, we let $\mathcal{P}_{\kappa}(D)=\mathcal{P}_{d_{\kappa}}(D)$ , $\mathcal{P}_{\log}(D)=\mathcal{P}_{d_{\log}}(D)$ , $W_{\kappa}=W_{d_{\kappa}}$ , and $W_{\log}=W_{d_{\log}}$ . Then it is easy to see that $\mathcal{P}_{\kappa}(D)\subset\mathcal{P}_{\log}(D)$ and $W_{\log}\leq C_{\kappa}W_{\kappa}$ , for some constant $C_{\kappa}>0$ , i.e., $W_{\kappa}$ is stronger then $W_{\log}$ . The following is our main result.

Theorem 1.5.

Let $(a,\alpha,b,\beta,\nu,\mu)$ be admissible parameters. Suppose that $\beta$ has only eigenvalues with negative real parts, and

[TABLE]

Then $(P_{t})_{t\geq 0}$ has a unique invariant distribution $\pi$ and the following assertions hold:

(a)

$\pi\in\mathcal{P}_{\log}(D)$ * and there exist constants $K,\delta>0$ such that, for all $\rho\in\mathcal{P}_{\log}(D)$ ,*

[TABLE] 2. (b)

If there exists $\kappa\in(0,1]$ satisfying

[TABLE]

then $\pi\in\mathcal{P}_{\kappa}(D)$ and there exists constants $K^{\prime},\delta^{\prime}>0$ such that, for all $\rho\in\mathcal{P}_{\kappa}(D)$ ,

[TABLE]

It is worthwhile to mention that to our knowledge a convergence rate solely under a $\log$ -moment condition on the state-independent jump measure was not even obtained for one-dimensional CBI processes. In order that $W_{\log}(P_{t}\rho,\pi)$ and $W_{\kappa}(P_{t}\rho,\pi)$ are well-defined, we need to show that $P_{t}\rho$ belongs to $\mathcal{P}_{\log}(D)$ or $\mathcal{P}_{\kappa}(D)$ , respectively. This will be shown in Section 5, where general moment estimates for affine processes are studied. Using $P_{t}\delta_{x}=P_{t}(x,\cdot)$ combined with Remark 1.4 we conclude the following.

Remark 1.6.

Under the conditions of Theorem 1.5, there exist constants $\delta,K>0$ such that

[TABLE]

where $d\in\{d_{\kappa},d_{\log}\}$ . Let $W_{d\wedge 1}$ be the Wasserstein distance given by (1.4) with $d$ replaced by $d\wedge 1$ . Then similarly to Remark 1.4, convergence with respect to $W_{d\wedge 1}$ is equivalent to weak convergence of probability measures on $\mathcal{P}(D)$ . Since $W_{d\wedge 1}\leq W_{d}$ , we conclude from (1.9) that $P_{t}(x,\cdot)\longrightarrow\pi$ weakly as $t\to\infty$ with exponential rate.

Let $X=(X_{t})_{t\geq 0}$ be an affine process. For the parameter estimation of affine models, see, e.g., [3], [42] and [2], it is often necessary to prove a Birkhoff ergodic theorem, i.e.,

[TABLE]

holds almost surely for sufficiently many test functions $f$ . Using classical theory, see, e.g., [45, Theorem 17.1.7] and [47], such convergence is implied by the ergodicity in the total variation distance, i.e., by

[TABLE]

where $\|\cdot\|_{\mathrm{TV}}$ denotes the total variation distance. Unfortunately, it is typically a very difficult mathematical task to prove (1.11) even for particular models. An extension of (1.10) applicable in the case where $P_{t}(x,\cdot)\longrightarrow\pi$ holds in the Wasserstein distance generated by the metric $d(x,\widetilde{x})=1\wedge|x-\widetilde{x}|$ was recently studied in [47]. Applying the main result of [47] to the case of affine processes and using the fact that each affine process can be obtained as a pathwise unique strong solution to some stochastic equation with jumps (see Section 4), yields the following corollary.

Corollary 1.7.

Let $(a,\alpha,b,\beta,\nu,\mu)$ be admissible parameters. Suppose that $\beta$ has only eigenvalues with negative real parts, and (1.5) is satisfied. Let $(X_{t})_{t\geq 0}$ be the corresponding affine process constructed as the pathwise unique strong solution on a complete probability space $(\Omega,\mathcal{F},\mathbb{P})$ in Section 4. Let $f\in L^{p}(D,\pi)$ for some $p\in[1,\infty)$ , then (1.10) holds in $L^{p}(\Omega,\mathbb{P})$ .

Although we have formulated (1.10) in continuous time, the discrete-time analog can be obtained in the same manner.

1.4 Comparison with related works

Consider an Ornstein-Uhlenbeck process on $\mathbb{R}^{n}$ , i.e., an affine process on state space $D=\mathbb{R}^{n}$ with admissible parameters $(a,\alpha=0,b,\beta,\nu,\mu=0)$ . If $\beta$ has only eigenvalues with negative real parts and (1.5) is satisfied, then [49] is applicable and hence the corresponding Ornstein-Uhlenbeck process satisfies, for all $x\in\mathbb{R}^{n}$ , $P_{t}(x,\cdot)\longrightarrow\pi$ weakly as $t\to\infty$ . Under additional technical conditions on the measure $\nu$ , it follows that the corresponding process also satisfies (1.11) with exponential rate, see [51]. Since in view of Theorem 1.5 the convergence (in the Wasserstein distance) has already exponential rate, we conclude that the additional restriction on $\nu$ imposed in [51] is only used to guarantee that convergence takes place in the stronger total variation distance, i.e., it is not necessary for the speed of convergence.

Consider a subcritical multi-type CBI process on $\mathbb{R}_{+}^{m}$ , i.e., an affine process on state space $D=\mathbb{R}_{+}^{m}$ for which the parameter $\beta$ has only eigenvalues with negative real parts. In dimension $m=1$ , Pinsky [46] announced (without proof) the existence of a limiting distribution under condition (1.5). A proof of this fact was then given in [36, Theorem 3.16], while in [41, Theorem 3.20 and Corollary 3.21] it was shown that $P_{t}(x,\cdot)\longrightarrow\pi$ is equivalent to (1.5). Some properties of the invariant distribution $\pi$ have been studied in [34]. In [42] exponential ergodicity in total variation distance, see (1.11), was established for one-dimensional subcritical CBI processes with $\nu=0$ , while some other related results for stochastic equations on $\mathbb{R}_{+}$ have been recently considered in [18]. An extension of the techniques from [42] to arbitrary dimension $m\geq 2$ is still an interesting open problem. Recently, in [44] another approach for the exponential ergodicity in the total variation distance for affine processes on cones, including multi-type CBI processes, was provided. Their techniques were closely related to stochastic stability of Markov processes in the sense of Meyn and Tweedie [45], see also the references therein. More precisely, it was shown that each subcritical CBI process $X$ which is $\nu$ -irreducible, aperiodic and has finite second moments, where $\nu$ is a reference measure with its support having non-empty interior, is exponentially ergodic in the total variation distance. As such a result is formulated in a very general way, it becomes a delicate mathematical task to show that such conditions are satisfied for CBI processes with jumps of infinite activity or with degenerate diffusion components. Moreover, assuming that $X$ has at least finite second moments rules out some natural examples as studied in [42] for $m=1$ and in Section 2 of this work. In contrast, our results can be applied in arbitrary dimension without the need to prove irreducibility or aperiodicity, paying the price that we use the Wasserstein distance instead. Let us mention that recently also asymptotic results for supercritical CBI processes have been obtained in [33, 9, 8].

Consider now the general case of an affine process on the canonical state space $D=\mathbb{R}_{+}^{m}\times\mathbb{R}^{n}$ . Based on the stability theory of Markov chains in the sense of Meyn and Tweedie the long-time behavior of some particular two-dimensional models on state space $D=\mathbb{R}_{+}\times\mathbb{R}$ was studied in [4, 28].These results have been further developed in [53] for arbitrary dimensions, where also functional limit theorems were obtained. In order to prove irreducibility and aperiodicity, the authors supposed that the diffusion compnent is non-degenerate and that $\nu$ and $\mu_{1},\dots,\mu_{m}$ are probability measures, i.e., the corresponding affine process has only jumps of finite variation. Independently in [29] the following result was obtained.

Theorem 1.8.

[29]** Let $(a,\alpha,b,\beta,\nu,\mu)$ be admissible parameters. Suppose that $\beta$ has only eigenvalues with negative real parts and (1.5) is satisfied. Then there exists a unique invariant distribution $\pi$ for $(P_{t})_{t\geq 0}$ . Moreover, $\pi$ has Laplace transform

[TABLE]

and one has, for all $x\in D$ , $P_{t}(x,\cdot)\longrightarrow\pi$ weakly as $t\to\infty$ .

The proof of Theorem 1.8 is based on a fine stability analysis of the Riccati equations (1.2). Comparing with our main result Theorem 1.5, the authors have, in addition, established a formula for the Laplace transform of $\pi$ , but have not studied any convergence rate. We emphasize that the main aim of our Theorem 1.5 is to establish the exponential convergence speed (1.6) and (1.8) with respect to the corresponding Wasserstein metrics. However, in the process of proving (1.6) we also obtain the existence and uniqueness of an invariant distribution as a natural by-product. Moreover, in Theorem 1.5 and Theorem 1.8 existence and uniqueness of an invariant distribution is shown by essentially different techniques.

1.5 Main idea of proof and structure of the work

The proof of Theorem 1.5 is divided in 4 steps as explained below.

Step 1. Provide a stochastic description of conservative affine processes. More precisely, in Section 3 we discuss a stochastic equation for multi-type CBI processes and recall a comparison principle due to [5]. In Section 4 we prove that each affine process can be obtained as the pathwise unique strong solution $(X_{t}(x))_{t\geq 0}$ to a certain stochastic equation, where $x=(y,z)\in\mathbb{R}_{+}^{m}\times\mathbb{R}^{n}$ denotes the initial condition. The particular structure of this equation shows that the process takes the form $X_{t}(x)=(Y_{t}(y),Z_{t}(x))$ , where $(Y_{t}(y))_{t\geq 0}\subset\mathbb{R}_{+}^{m}$ is a CBI process with initial condition $y$ and $(Z_{t}(x))_{t\geq 0}$ is an OU-type process with initial condition $z$ whose coefficients depend on the process $(Y_{t}(y))_{t\geq 0}$ .

Step 2. Let $(X_{t})_{t\geq 0}$ be an affine process. Based on the stochastic equation from the first step, we study in Section 5 finiteness of the moments $\mathbb{E}(|X_{t}|^{\kappa})$ and $\mathbb{E}(\log(1+|X_{t}|))$ . Since the proofs in this section are rather standard, we only outline the main steps, while technical details are postponed to the appendix.

Step 3. Let $\left(X_{t}(x)\right)_{t\geq 0}$ and $\left(X_{t}(\widetilde{x})\right)_{t\geq 0}$ be the affine processes with initial states $x,\ \widetilde{x}\in\mathbb{R}_{+}^{m}\times\mathbb{R}^{n}$ , respectively, obtained as the unique strong solutions to the stochastic equation discussed in Section 4. Suppose that (1.7) is satisfied for $\kappa=1$ . The following key estimate is proved in Section 6:

[TABLE]

where $K,\delta>0$ are some constants. Indeed, write $X_{t}(x)=(Y_{t}(y),Z_{t}(x))$ and $X_{t}(\widetilde{x})=(Y_{t}(\widetilde{y}),Z_{t}(\widetilde{x}))$ , respectively. Using the comparison principle for the CBI component we prove that

[TABLE]

where $\delta^{\prime}>0$ is some constant. From this and the particular structure of the stochastic equation solved by $(X_{t}(x))_{t\geq 0}$ and $(X_{t}(\widetilde{x}))_{t\geq 0}$ we then easily deduce (1.13). In the literature the proof of similar inequalities to (1.13) and (1.14) is often based on the construction of a successfull coupling which is typically a difficult task. In the framework of affine processes a surprisingly simple proof of such estimates is given in Section 6 by using monotone couplings as explained above.

Step 4. The results obtained in Steps 1 – 3 provide us all necessary tools to give a full proof of Theorem 1.5 in Section 7. For the sake of simplicity, we explain below how (1.8) is shown. Estimate (1.6) can be obtained in the same way. Using classical arguments, we may deduce assertion (1.8) from the contraction estimate

[TABLE]

Next observe that, by the convexity of the Wasserstein distance (see Lemma 8.4) combined with (1.3), property (1.15) is implied by

[TABLE]

Let $(P_{t}^{0})_{t\geq 0}$ be the transition semigroup for the affine process with admissible parameters $(a=0,\alpha,b=0,\beta,m=0,\mu)$ . In view of (1.1) one has $P_{t}(x,\cdot)=P_{t}^{0}(x,\cdot)\ast P_{t}(0,\cdot)$ , where $\ast$ denotes the usual convolution of measures. A similar decomposition for affine processes was also used in [29]. Applying now Lemma 8.3 and the Jensen inequality gives

[TABLE]

where the last inequality follows from Step 3 applied to $(P_{t}^{0})_{t\geq 0}$ .

2 Examples

2.1 Anisotropic $(\gamma_{1},\gamma_{2})$ -root process

Let $Z_{1},Z_{2}$ be independent one-dimensional Lévy processes with symbols

[TABLE]

where $\gamma_{1},\gamma_{2}\in(1,2)$ . Let $S=(S_{1},S_{2})$ be another $2$ -dimensional Lévy process with symbol

[TABLE]

where $\nu$ is a measure on $\mathbb{R}_{+}^{2}$ with $\nu\left(\left\{0\right\}\right)=0$ and

[TABLE]

Suppose that $Z$ and $S$ are independent. Applying the results of [5] to this particular case shows that, for each $x\in\mathbb{R}_{+}^{2}$ , there exists a pathwise unique strong solution to

[TABLE]

This process is an affine process on $D=\mathbb{R}_{+}^{2}$ with admissible parameters

[TABLE]

and corresponding Lévy measures $\nu$ ,

[TABLE]

Applying our main result to this particular case gives the following.

Corollary 2.1.

If $\beta$ has only eigenvalues with negative real parts and $\nu$ satisfies

[TABLE]

then the assertions of Theorem 1.5 are true.

Convergence in total variation distance for a similar one-dimensional model was studied in [42]. Similar two-dimensional processes were also studied in [4] and [27]. In view of our main result Theorem 1.5, it is straightforward to extend this model to arbitrary dimension $d\geq 2$ , with possibly non-vanishing diffusion part and more general driving noise of Lévy type.

2.2 Stochastic volatility model

Let $D=\mathbb{R}_{+}\times\mathbb{R}$ , i.e., $m=n=1$ . Let $(V,Y)$ be the unique strong solution to

[TABLE]

where $b_{1}\geq 0$ , $b_{2}\in\mathbb{R}$ , $\beta_{11},\beta_{22}\in\mathbb{R}$ , $\rho\in(-1,1)$ is the correlation coefficient, $B=(B_{1},B_{2})$ is a two-dimensional Brownian motion, $J_{1}$ is a one-dimensional Lévy subordinator with Lévy measure $\nu_{1}$ , and $J_{2}$ a one-dimensional Lévy process with Lévy measure $\nu_{2}$ . Suppose that $B,\ J_{1}$ and $J_{2}$ are mutually independent. It is not difficult to see that $(V,Y)$ is an affine process with admissible parameters

[TABLE]

and measures

[TABLE]

Then we obtain the following.

Corollary 2.2.

If $\beta_{11},\beta_{22}<0$ and

[TABLE]

then the assertions of Theorem 1.5 are true.

It is straightforward to extend this model to higher dimensions and more general driving noises.

3 Stochastic equation for multi-type CBI processes

In this section we recall some results for the particular case of multi-type CBI processes, i.e. affine processes on state space $D=\mathbb{R}_{+}^{m}$ (that is, $n=0$ ). For further references and additional explanations we refer to [5] and [8]. Let $(\Omega,\mathcal{F},\mathbb{P})$ be a complete probability space rich enough to support the following objects:

(B1)

A $m$ -dimensional Brownian motion $\left(W_{t}\right)_{t\geq 0}:=(W_{t,1},\dots,W_{t,m})_{t\geq 0}$ . 2. (B2)

A Poisson random measure $M_{I}(ds,d\xi)$ on $\mathbb{R}_{+}\times\mathbb{R}_{+}^{m}$ with compensator $\widehat{M}_{I}(ds,d\xi)=ds\nu_{I}(d\xi)$ , where $\nu_{I}$ is a Borel measure supported on $\mathbb{R}_{+}^{m}$ satisfying

[TABLE] 3. (B3)

Poisson random measures $N_{1}^{I}(ds,d\xi,dr),\dots,N_{m}^{I}(ds,d\xi,dr)$ on $\mathbb{R}_{+}\times\mathbb{R}_{+}^{m}\times\mathbb{R}_{+}$ with compensators $\widehat{N}_{i}^{I}(ds,d\xi,dr)=ds\mu_{i}^{I}(d\xi)dr$ , $i\in I$ , where $\mu_{1}^{I},\dots,\mu_{m}^{I}$ are Borel measures supported on $\mathbb{R}_{+}^{m}$ satisfying

[TABLE]

The objects $W,M_{I},N_{1}^{I},\dots,N_{m}^{I}$ are supposed to be mutually independent. Let $\widetilde{M}_{I}(ds,d\xi)=M_{I}(ds,d\xi)-\widehat{M}_{I}(ds,d\xi)$ and $\widetilde{N}_{i}^{I}(ds,d\xi,dr)=N_{i}^{I}(ds,d\xi,dr)-\widehat{N}_{i}^{I}(ds,d\xi,dr)$ be the corresponding compensated Poisson random measures. Here and below we consider the natural augmented filtration generated by $W,M_{I},N_{1}^{I},\dots,N_{m}^{I}$ . Finally let

(a)

$b\in\mathbb{R}_{+}^{m}$ . 2. (b)

$\beta=(\beta_{ij})_{i,j\in I}$ such that $\beta_{ji}-\int_{\mathbb{R}_{+}^{m}}\xi_{j}\mu_{i}^{I}(d\xi)\geq 0$ , for $i\in I$ and $j\in I\backslash\{i\}$ . 3. (c)

A matrix $\sigma(y)=\mathrm{diag}(\sqrt{2c_{1}y_{1}},\cdots,\sqrt{2c_{m}y_{m}})\in\mathbb{R}^{m\times m}$ , where $c_{1},\dots,c_{m}\geq 0$ .

For $y\in\mathbb{R}_{+}^{m}$ , consider the stochastic equation

[TABLE]

where $\widetilde{\beta}_{ji}=\beta_{ji}-\int_{|\xi|>1}\xi_{j}\mu_{i}^{I}(d\xi)$ . Pathwise uniqueness for a slightly more complicated equation was recently obtained in [5], while (3.1) in this form appeared first in [8]. The following is essentially due to [5].

Proposition 3.1.

Let $(b,\beta,\sigma)$ be as in (a) – (c), and consider objects $W,M_{I},N_{1}^{I},\dots,N_{m}^{I}$ that are given in (B1) – (B3). Then the following assertions hold:

(a)

For each $y\in\mathbb{R}_{+}^{m}$ , there exists a pathwise unique strong solution $Y=(Y_{t})_{t\geq 0}$ to (3.1). 2. (b)

Let $Y$ be any solution to (3.1). Then $Y$ is a multi-type CBI process starting from $y$ , and the generator $L_{Y}$ of $Y$ is of the following form: for $f\in C_{c}^{2}(\mathbb{R}_{+}^{m}),$

[TABLE]

Conversely, given any multi-type CBI process $\widetilde{Y}$ with generator $L_{Y}$ and starting point $y$ , we can find a solution $Y$ to (3.1) such that $Y$ and $\widetilde{Y}$ have the same law.

The proof of the pathwise uniqueness is based on a comparison principle for multi-type CBI processes, see [5, Lemma 4.2]. This comparison principle is stated below.

Lemma 3.2.

[5, Lemma 4.2]** Let $(Y_{t})_{t\geq 0}$ be a weak solution to (3.1) with parameters $(b,\beta,\sigma)$ , let $(Y_{t}^{\prime})_{t\geq 0}$ be another weak solution to (3.1) with parameters $(b^{\prime},\beta,\sigma)$ , where $(b,\beta,\sigma)$ and $(b^{\prime},\beta,\sigma)$ satisfy (a) – (c). Both solutions are supposed to be defined on the same probability space and with respect to the same noises $W,M_{I},N_{1}^{I},\dots,N_{m}^{I}$ that satisfy (B1) – (B3). Suppose that, for all $j\in\{1,\dots,m\}$ , $y_{j}\leq y_{j}^{\prime}$ and $b_{j}\leq b_{j}^{\prime}$ . Then

[TABLE]

4 Stochastic equation for affine processes

Below we show that any affine process can also be obtained as the pathwise unique strong solution to a certain stochastic equation. In the two-dimensinoal case $D=\mathbb{R}_{+}\times\mathbb{R}$ such a result was first obtained in [15]. Indepedently, the case of affine diffusions on the canoncical state space $D=\mathbb{R}_{+}^{m}\times\mathbb{R}^{n}$ (i.e., processes without jumps) was studied in [20]. The main obstacle there is related with the diffusion component which is degenerate at the boundary but also has a nontrival structure in higher dimensions. In order to take this into account we use, compared to [20], another representation of the diffusion matrix (see (A0) and (A1) below). Such a representation is used to decompose the corresponding affine process into a CBI and an OU component which are then treated seperately. Consequently, based on the avaliable results for CBI processes, the proofs in this section become relatively simple.

Let $(a,\alpha,b,\beta,\nu,\mu)$ be admissible parameters. For the parameters $a$ and $\alpha=(\alpha_{1},\dots,\alpha_{m})$ consider the following objects:

(A0)

An $n\times n$ -matrix $\sigma_{a}$ such that $\sigma_{a}\sigma_{a}^{\top}=a_{JJ}$ . 2. (A1)

Matrices $\sigma_{1},\dots,\sigma_{m}\in\mathbb{R}^{d\times d}$ such that, for all $j\in I$ , $\sigma_{j}\sigma_{j}^{\top}=\alpha_{j}$ and

[TABLE]

Let us remark the following.

Remark 4.1.

(i)

The first condition is simple to check. Indeed, by definition, one has $a=\begin{pmatrix}0&0\\ 0&a_{JJ}\end{pmatrix}\in S_{d}^{+}$ , thus $a_{JJ}$ is symmetric and positive semidefinite. Hence $\sigma_{a}$ denotes the non-negative square root of $a_{JJ}$ . 2. (ii)

Concerning the second condition, recall that $\alpha_{j}\in S_{d}^{+}$ and hence $\alpha_{j,II}$ is positive semidefinite. Moreover, by definition of admissible parameters, $\alpha_{j,II}$ is everywhere zero except at the entry $(j,j)$ . Hence $\alpha_{j,jj}^{1/2}$ is well-defined. Existence of $\sigma_{j}$ satisfying (4.1) follows from the characterization of positive semidefiniteness for symmetric block matrices, see, e.g., **[23, Theorem 16.1]**. The latter result is based on pseudo-inverses and properties of the Schur-complement for block matrices.

Below we describe the noises appearing in the stochastic equation for affine processes. Let $(\Omega,\mathcal{F},\mathbb{P})$ be a complete probability space rich enough to support the following objects:

(A2)

A $n$ -dimensional Brownian motion $B=(B_{t})_{t\geq 0}$ . 2. (A3)

For each $i\in I$ , a $d$ -dimensional Brownian motion $W^{i}=(W_{t}^{i})_{t\geq 0}$ . 3. (A4)

A Poisson random measure $M(ds,d\xi)$ with compensator $\widehat{M}(ds,d\xi)=ds\nu(d\xi)$ on $\mathbb{R}_{+}\times D$ . 4. (A5)

For each $i\in I$ , a Poisson random measure $N_{i}(ds,d\xi,dr)$ with compensator $\widehat{N}_{i}(ds,d\xi,dr)=ds\mu_{i}(d\xi)dr$ on $\mathbb{R}_{+}\times D\times\mathbb{R}_{+}$ .

We suppose that all objects $B,W^{1},\dots,W^{m},M,N_{1},\dots,N_{m}$ are mutually independent. Denote by $\widetilde{M}(ds,d\xi)=M(ds,d\xi)-\widehat{M}(ds,d\xi)$ and $\widetilde{N}_{i}(ds,d\xi,dr)=N_{i}(ds,d\xi,dr)-\widehat{N}_{i}(ds,d\xi,dr)$ , $i\in I$ , the corresponding compensated Poisson random measures. Here and below we consider the natural augmented filtration generated by these noise terms. For $x\in D$ , consider the stochastic equation

[TABLE]

where $\widetilde{b}$ and $\widetilde{\beta}=(\widetilde{b}_{ki})_{k,i\in\{1,\dots,d\}}$ are, for $i,k\in\{1,\dots,d\}$ , given by

[TABLE]

Note that we have changed the drift coefficients to $\widetilde{b}$ and $\widetilde{\beta}$ in order to change the compensators in the stochastic integrals. Such change is, under the given moment conditions on $\mu=(\mu_{1},\dots,\mu_{m})$ , always possible and does not affect our results. Concerning existence and uniqueness for (4.2), we obtain the following.

Theorem 4.2.

Let $(a,\alpha,b,\beta,\nu,\mu)$ be admissible parameters. Then, for each $x\in D$ , there exists a pathwise unique $D$ -valued strong solution $X=(X_{t})_{t\geq 0}$ to (4.2).

This result will be proved later in this Section. Let us first relate (4.2) to affine processes.

Proposition 4.3.

Let $(a,\alpha,b,\beta,\nu,\mu)$ be admissible parameters. Then each solution $X$ to (4.2) is an affine process with admissible parameters $(a,\alpha,b,\beta,\nu,\mu)$ and starting point $x$ .

Proof.

Let $X$ be a solution to (4.2) and $f\in C_{c}^{2}(D)$ . Applying the Itô formula shows that

[TABLE]

is a local martingale. Note that $Lf$ is bounded. Hence

[TABLE]

and we conclude that $(M_{f}(t))_{t\geq 0}$ is a true martingale. It follows from Remark 1.3 that $X$ is an affine process with admissible parameters $(a,\alpha,b,\beta,\nu,\mu)$ . ∎

The rest of this section is devoted to the proof of Theorem 4.2. As often in the theory of stochastic equations, existence of weak solutions is the easy part.

Lemma 4.4.

Let $(a,\alpha,b,\beta,\nu,\mu)$ be admissible parameters. Then, for each $x\in D$ , there exists a weak solution $X$ to (4.2).

Proof.

Since existence of a solution to the martingale problem with sample paths in the Skorokhod space over $D$ is known, the assertion is a consequence of [39], namely, the equivalence between weak solutions to stochastic equations and martingale problems. Alternatively, following [14, p.993] we can show that each solution to the martingale problem with generator $L$ and domain $C_{c}^{2}(D)$ is a semimartingale and compute its semimartingale characteristics (see [14, Theorem 2.12]). The assertion is then a consequence of the equivalence between weak solutions to stochastic equations and semimartingales (see [31, Chapter III, Theorem 2.26]). ∎

In view of the Yamada-Watanabe Theorem (see [6]), Theorem 4.2 is proved, provided we can show pathwise uniqueness for (4.2). For this purpose we rewrite (4.2) into its components $X=(Y,Z)$ , where $Y\in\mathbb{R}_{+}^{m}$ and $Z\in\mathbb{R}^{n}$ . Introduce the notation $\xi=(\xi_{I},\xi_{J})\in D$ , where $\xi_{I}=(\xi_{i})_{i\in I}$ and $\xi_{J}=(\xi_{j})_{j\in J}$ . Moreover, let $W_{s}^{i}=(W_{s,I}^{i},W_{s,J}^{i})$ and write for the initial condition $x=(y,z)\in D$ . Finally, let $e_{1},\dots,e_{d}$ denote the canonical basis vectors in $\mathbb{R}^{d}$ . Then (4.2) is equivalent to the system of equations

[TABLE]

Observe that the first equation for $Y$ does not involve $Z$ . We will show that (4.4) is precisely (3.1), i.e., $Y$ is a multi-type CBI process and pathwise uniqueness holds for $Y$ . The second equation for $Z$ describes an OU-type process with random coefficients depending on $Y$ . If we regard $Y$ as conditionally fixed, then pathwise uniqueness for (4.5) is obvious. These ideas are summarized in the next lemma.

Lemma 4.5.

Let $(a,\alpha,b,\beta,\nu,\mu)$ be admissible parameters. Then pathwise uniqueness holds for (4.4) and (4.5), and hence for (4.2).

Proof.

Let $X=(Y,Z)$ and $X^{\prime}=(Y^{\prime},Z^{\prime})$ be two solutions to (4.2) with the same initial condition $x=(y,z)\in D$ both defined on the same probability space. Then $Y$ and $Y^{\prime}$ both satisfy (4.4). Let us show that (4.4) is precisely (3.1), from which we deduce $\mathbb{P}(Y_{t}=Y_{t}^{\prime},\ \ t\geq 0)=1$ . Set $\mathrm{pr}_{I}:D\longrightarrow\mathbb{R}_{+}^{m}$ , $\mathrm{pr}_{I}(x)=(x_{i})_{i\in I}$ , and define

•

A $m$ -dimensional Brownian motion $W_{t}:=(W_{t,1}^{1},\dots,W_{t,m}^{m})$ .

•

A Poisson random measure $M_{I}(ds,d\xi)$ on $\mathbb{R}_{+}\times\mathbb{R}_{+}^{m}$ by

[TABLE]

where $0\leq s<t$ and $A\subset\mathbb{R}_{+}^{m}$ is a Borel set.

•

Poisson random measures $N_{1}^{I},\dots,N_{m}^{I}$ on $\mathbb{R}_{+}\times\mathbb{R}_{+}^{m}\times\mathbb{R}_{+}$ by

[TABLE]

where $0\leq s<t$ , $0\leq c<d$ and $A\subset\mathbb{R}_{+}^{m}$ is a Borel set.

Note that the random objects $W,M_{I},N_{1}^{I},\dots,N_{m}^{I}$ are mutually independent. Moreover, it is not difficult to see that $M_{I}$ and $N_{1}^{I},\dots,N_{m}^{I}$ have compensators

[TABLE]

where $\nu_{I}=\nu\circ\mathrm{pr}_{I}^{-1}$ and $\mu_{i}^{I}=\mu_{i}\circ\mathrm{pr}_{I}^{-1}$ . Finally let $c_{j}=\alpha_{j,jj}$ , $j\in\{1,\dots,m\}$ , and

[TABLE]

Then (4.4) is precisely (3.1), and it follows from Proposition 3.1.(a) that $\mathbb{P}(Y_{t}=Y_{t}^{\prime},\ \ t\geq 0)=1$ .

It remains to prove pathwise uniqueness for (4.5). Define, for $l\geq 1$ , a stopping time $\inf\{t>0\ |\ \max\{|Z_{t}|,|Z_{t}^{\prime}|\}>l\}$ . Since $Z$ and $Z^{\prime}$ both satisfy (4.5) for the same $Y$ , we obtain

[TABLE]

and hence, for some constant $C>0$ ,

[TABLE]

The Grownwall lemma gives $\mathbb{P}(Z_{t\wedge\tau_{l}}=Z^{\prime}_{t\wedge\tau_{l}})=1$ , for all $t\geq 0$ and $l\geq 1$ . Note that $Z$ and $Z^{\prime}$ have no explosion. Taking $l\to\infty$ proves the assertion. ∎

5 Moments for affine processes

The stochastic equation introduced in Section 4 can be used to provide a simple proof for the finiteness of moments for affine processes. The following is our main result for this section.

Proposition 5.1.

Let $(a,\alpha,b,\beta,\nu,\mu)$ be admissible parameters. For $x\in D$ , let $X$ be the unique solution to (4.2).

(a)

Suppose that there exists $\kappa>0$ such that

[TABLE]

Then there exists a constant $C_{\kappa}>0$ (independent of $x$ and $X$ ) such that

[TABLE] 2. (b)

Suppose that (1.5) is satisfied. Then there exists a constant $C>0$ (independent of $x$ and $X$ ) such that

[TABLE]

Proof.

Define $V_{1}(h)=(1+|h|^{2})^{\kappa/2}$ and $V_{2}(h)=\log(1+|h|^{2})$ , where $h\in D$ . Applying the Itô formula for $V_{j}$ , $j\in\{1,2\}$ , gives

[TABLE]

where $(\mathcal{M}_{j}(t))_{t\geq 0}$ and $\mathcal{A}_{j}(\cdot)$ are given by

[TABLE]

where $\widetilde{b}$ was defined in (4.3). Define, for $l\geq 1$ , a stopping time $\tau_{l}=\inf\{t\geq 0\ |\ |X_{t}|>l\}$ . Then it is not difficult to see that $(\mathcal{M}_{j}(t\wedge\tau_{l}))_{t\geq 0}$ is a martingale, for any $l\geq 1$ . Moreover, we will prove in the appendix that there exists a constant $C>0$ such that

[TABLE]

Hence taking expectations in (5.1) gives

[TABLE]

Applying the Gronwall lemma gives $\mathbb{E}(V_{j}(X_{t\wedge\tau_{l}}))\leq(V_{j}(x)+Ct)e^{Ct}\leq(1+V_{j}(x))e^{C^{\prime}t}$ , for all $t\geq 0$ and some constant $C^{\prime}>0$ . Since $(X_{t})_{t\geq 0}$ has cádlág paths and $C^{\prime}$ is independent of $l$ , we may take the limit $l\to\infty$ and conclude the assertion by the lemma of Fatou. ∎

We close this section with a formula for the first moment of general affine processes. The particular case of multi-type CBI processes was treated in [5, Lemma 3.4], while recursion formulas for higher-order moments of multi-type CBI processes were provided in [7].

Lemma 5.2.

Let $(a,\alpha,b,\beta,\nu,\mu)$ be admissible parameters and suppose that

[TABLE]

Let $(X_{t})_{t\geq 0}$ be an affine process obtained from (4.2) with $X_{0}=x\in D$ . Then

[TABLE]

where $\overline{b}_{i}=b_{i}+\int_{|\xi|>1}\xi_{i}\nu(d\xi)+\mathbbm{1}_{I}(i)\int_{|\xi|\leq 1}\xi_{i}\nu(d\xi)$ . $x=(y,z)\in\mathbb{R}_{+}^{m}\times\mathbb{R}^{n}$ and $X=(Y,Z)\in\mathbb{R}_{+}^{m}\times\mathbb{R}^{n}$ , then

[TABLE]

Proof.

First observe that, by definition of admissible parameters and (5.3), we may apply Proposition 5.1 (a) and deduce that $X_{t}$ has finite first moment. Taking expectations in (4.2) gives

[TABLE]

Solving this equation gives the desired formula for $\mathbb{E}(X_{t})$ . Taking expectations in (3.1) (or (4.4)) gives

[TABLE]

which implies the desired formula for $\mathbb{E}(Y_{t})$ . Finally, taking expectations in (4.5) gives

[TABLE]

Solving this equation and using previous formula for $\mathbb{E}(Y_{s})$ , we obtain the assertion. ∎

6 Contraction estimate for trajectories of affine processes

The following is our main estimate for this section.

Proposition 6.1.

Let $(a,\alpha,b,\beta,\nu,\mu)$ be admissible parameters, suppose that (5.3) is satisfied, and assume that $\beta$ has only eigenvalues with negative real parts. Let $x=(y,z),\widetilde{x}=(\widetilde{y},\widetilde{z})\in\mathbb{R}_{+}^{m}\times\mathbb{R}^{n}$ , and let $X(x)=(Y(y),Z(x))$ and $X(\widetilde{x})=(Y(\widetilde{y}),Z(\widetilde{x}))$ be the unique strong solutions to (4.2) with initial condition $x$ and $\widetilde{x}$ , respectively. Then there exist constants $K,\delta,\delta^{\prime}>0$ independent of $X(x)$ and $X(\widetilde{x})$ such that, for all $t\geq 0$ ,

[TABLE]

Proof.

Let us first prove (6.1). Note that $Y(y)$ and $Y(\widetilde{y})$ are multi-type CBI processes with the same parameters. If $\widetilde{y}_{j}\leq y_{j}$ for all $j\in\{1,\dots,m\}$ , then we obtain from Lemma 3.2 and Lemma 5.2

[TABLE]

where we have used that $\beta_{II}$ has only eigenvalues with negative real parts (since $\beta$ has this property and $\beta_{IJ}=0$ ). For general $y,\widetilde{y}$ , let $y^{0},\dots,y^{m}\in\mathbb{R}_{+}^{m}$ be such that

[TABLE]

where $e_{1},\dots,e_{m}$ denote the canonical basis vectors in $\mathbb{R}^{m}$ . Then, for each $j\in\{0,\dots,m-1\}$ , the elements $y^{j},y^{j+1}$ are comparable in the sense that $y_{k}^{j}=y_{k}^{j+1}$ if $k\neq j+1$ , and either $y_{j+1}^{j}\leq y_{j+1}^{j+1}$ or $y_{j+1}^{j}\geq y_{j+1}^{j+1}$ . In any case, we obtain from the previous consideration

[TABLE]

where we have used $|y^{j}-y^{j+1}|=|y_{j+1}-\widetilde{y}_{j+1}|$ . This completes the proof of (6.1).

If $n=0$ , then (6.2) is trivial. Suppose that $n>0$ . Applying the Itô formula to $e^{-t\beta}X_{t}(x)$ and $e^{-t\beta}X_{t}(\widetilde{x})$ , and then taking the difference, gives

[TABLE]

Here and below we denote by $K>0$ a generic constant which may vary from line to line. Moreover, we find $\delta_{0}>0$ and $\delta\in(0,\delta^{\prime})$ such that

[TABLE]

The stochastic integral against the Brownian motion is estimated by the BDG-inequality as follows

[TABLE]

where we have used (6.1) and (6.3). For the stochastic integral against $\widetilde{N}_{i}$ we consider the cases $|\xi|\leq 1$ and $|\xi|>1$ separately. For $|\xi|\leq 1$ we apply first the BDG-inequality and then the Jensen inequality to obtain, for each $i\in I$ ,

[TABLE]

For $|\xi|>1$ , we apply first the BDG-inequality and then use the sub-additivity of $a\longmapsto a^{1/2}$ to obtain

[TABLE]

where we have used $|y-\widetilde{y}|\leq|x-\widetilde{x}|$ . Collecting all estimates proves the assertion. ∎

7 Proof of Theorem 1.5

7.1 The $\log$ -Wasserstein estimate

Based on the results of Section 6, we first deduce the following estimate with respect to the $\log$ -Wasserstein distance.

Proposition 7.1.

Let $(P_{t})_{t\geq 0}$ be the transition semigroup with admissible parameters $(a,\alpha,b,\beta,\nu,\mu)$ , suppose that $\beta$ has only eigenvalues with negative real parts, and (1.5) is satisfied. Then there exist constants $K,\delta>0$ such that, for any $\rho,\widetilde{\rho}\in\mathcal{P}_{\log}(D)$ , one has

[TABLE]

Proof.

Let $\left(P_{t}^{0}(x,\cdot)\right)_{t\geq 0}$ be the transition semigroup with admissible parameters $(a,\alpha,b=0,\beta,m=0,\mu)$ given by Theorem 1.2. Take $x=(y,z),\widetilde{x}=(\widetilde{y},\widetilde{z})\in\mathbb{R}_{+}^{m}\times\mathbb{R}^{n}$ and let $X^{0}(x)=(Y^{0}(y),Z^{0}(x))$ and $X^{0}(\widetilde{x})=(Y^{0}(\widetilde{y}),Z^{0}(\widetilde{x}))$ , respectively, be the corresponding affine processes obtained from (4.2) with admissible parameters $(a=0,\alpha,b=0,\beta,m=0,\mu)$ . Since $X_{t}^{0}(x)$ has law $P_{t}^{0}(x,\cdot)$ and $X_{t}^{0}(\widetilde{x})$ has law $P_{t}^{0}(\widetilde{x},\cdot)$ , there exist by Proposition 6.1 constants $K,\delta>0$ such that

[TABLE]

Next observe that, for $u\in\mathcal{U}$ , one has

[TABLE]

Combining this with (1.1) proves $P_{t}(x,\cdot)=P_{t}^{0}(x,\cdot)\ast P_{t}(0,\cdot)$ , where $\ast$ denotes the convolution of measures on $D$ . Let us now prove the desired $\log$ -estimate. Using Lemma 8.3 from the appendix and then the Jensen inequality for the concave function $\mathbb{R}_{+}\ni a\longmapsto\log(1+a)$ , gives for some generic constant $K>0$

[TABLE]

where we have used, for $a,b\geq 0$ , the elementary inequality

[TABLE]

which is proved in the appendix. Applying now Lemma 8.4 from the appendix gives for any $H\in\mathcal{H}(\rho,\widetilde{\rho})$

[TABLE]

Choosing $H$ as the optimal coupling of $(\rho,\widetilde{\rho})$ , i.e.,

[TABLE]

proves the assertion. ∎

Based on previous proposition, the proof of Theorem 1.5 is easy. It is given below.

Lemma 7.2.

Let $(P_{t})_{t\geq 0}$ be the transition semigroup with admissible parameters $(a,\alpha,b,\beta,\nu,\mu)$ . Suppose that $\beta$ has only eigenvalues with negative real parts, and (1.5) is satisfied. Then $(P_{t})_{t\geq 0}$ has a unique invariant distribution $\pi$ . Moreover, this distribution belongs to $\mathcal{P}_{\log}(D)$ and, for any $\rho\in\mathcal{P}_{\log}(D)$ , one has (1.6).

Proof.

Let us first prove existence of an invariant distribution $\widetilde{\pi}\in\mathcal{P}_{\log}(D)$ . Observe that, by Proposition 5.1, we easily deduce that $P_{t}\mathcal{P}_{\log}(D)\subset\mathcal{P}_{\log}(D)$ , for any $t\geq 0$ . Fix any $\rho\in\mathcal{P}_{\log}(D)$ and let $k,l\in\mathbb{N}$ with $k>l$ . Then

[TABLE]

Since the right-hand side tends to zero as $k,l\to\infty$ , we conclude that $(P_{k}\rho)_{k\in\mathbb{N}}$ is a Cauchy sequence in $(\mathcal{P}_{\log}(D),W_{\log})$ . In particular, there exists a limit $\pi\in\mathcal{P}_{\log}(D)$ , i.e., $W_{\log}(P_{k}\rho,\pi)\longrightarrow 0$ as $k\to\infty$ . Let us show that $\pi$ is an invariant distribution for $P_{t}$ . Indeed, take $h\geq 0$ , then

[TABLE]

Since $W_{\log}(P_{k}\rho,\pi)\longrightarrow 0$ as $k\to\infty$ , we conclude that all terms tend to zero. Hence $W_{\log}(P_{h}\pi,\pi)=0$ , i.e., $P_{h}\pi=\pi$ , for all $h\geq 0$ . Next we prove that $\pi$ is the unique invariant distribution. Let $\pi_{0},\pi_{1}$ be any two invariant distributions and define $W_{\log}^{\leq 1}$ as in (1.4) with $d_{\log}$ replaced by $d_{\log}\wedge 1$ . Then we obtain, for any $t\geq 0$ and all $x,\widetilde{x}\in D$ , by the proof of Proposition 7.1 (see (7.1))

[TABLE]

Fix any $H\in\mathcal{H}(\pi_{0},\pi_{1})$ , then using the invariance of $\pi_{0},\pi_{1}$ together with the convexity of the Wasserstein distance gives

[TABLE]

By dominated convergence we deduce that the right-hand side tends to zero as $t\to\infty$ and hence $\pi_{0}=\pi_{1}$ . The last assertion can now be deduced from

[TABLE]

where we have first used the invariance of $\pi$ and then Proposition 7.1. ∎

7.2 The $\kappa$ -Wasserstein estimate

As before, we start with an estimate with respect to the Wasserstein distance $W_{\kappa}$ .

Proposition 7.3.

Let $(P_{t})_{t\geq 0}$ be the transition semigroup with admissible parameters $(a,\alpha,b,\beta,\nu,\mu)$ . Suppose that $\beta$ has only eigenvalues with negative real parts, and (1.7) is satisfied for some $\kappa\in(0,1]$ . Then there exist constants $K,\delta>0$ such that, for any $\rho,\widetilde{\rho}\in\mathcal{P}_{\kappa}(D)$ , one has

[TABLE]

Proof.

Let $\left(P_{t}^{0}(x,\cdot)\right)_{t\geq 0}$ be the transition semigroup with admissible parameters $(a=0,\alpha,b=0,\beta,m=0,\mu)$ given by Theorem 1.2. Arguing as in the proof of Proposition 7.1, we obtain

[TABLE]

and $P_{t}(x,\cdot)=P_{t}^{0}(x,\cdot)\ast P_{t}(0,\cdot)$ . Then we obtain from Lemma 8.3 from the appendix

[TABLE]

where the second inequality follows from the Jensen inequality and the third is a consequence of (7.2). Using now Lemma 8.4 from the appendix, we conclude that

[TABLE]

This proves the assertion. ∎

Based on previous proposition, the proof of the $W_{\kappa}$ -estimate in Theorem 1.5 can be deduced by exactly the same arguments as in Lemma 7.2. So Theorem 1.5 is proved.

8 Appendix

8.1 Moment estimates for $V_{1}$ and $V_{2}$

In this section we prove (5.2).

Lemma 8.1.

Suppose that the same conditions as in Proposition 5.1 (a) are satisfied. Then there exists a constant $C=C_{\kappa}>0$ such that

[TABLE]

Proof.

Observe that $\nabla V_{1}(x)=\kappa x(1+|x|^{2})^{\frac{\kappa-2}{2}}$ . Using $|x|\leq(1+|x|^{2})^{1/2}$ gives $|\nabla V_{1}(x)|\leq\kappa(1+|x|^{2})^{\frac{\kappa-1}{2}}$ , and hence we obtain for some generic constant $C=C_{\kappa}>0$

[TABLE]

For the second order term we first observe that, for $k,l\in\{1,\dots,d\}$ ,

[TABLE]

where $\delta_{kl}$ denotes the Kronecker-Delta symbol. Using $x_{k}x_{l}\leq\frac{x_{k}^{2}+x_{l}^{2}}{2}\leq|x|^{2}\leq(1+|x|^{2})$ gives $\left|\frac{\partial^{2}V_{1}(x)}{\partial x_{k}\partial x_{l}}\right|\leq C(1+|x|^{2})^{\frac{\kappa-2}{2}}$ . This implies that

[TABLE]

Let us now estimate the integrals against $m$ and $\mu_{1},\dots,\mu_{m}$ . Consider first the case $|\xi|>1$ . The mean value theorem gives

[TABLE]

where we have used $\left\langle\xi,x+t\xi\right\rangle\leq|\xi||x+t\xi|\leq|\xi|(1+|x+t\xi|^{2})^{1/2}$ in the last inequality. If $\kappa>1$ , then

[TABLE]

If $\kappa\in(0,1]$ , then $|\xi|(1+|x+t\xi|^{2})^{\frac{\kappa-1}{2}}\leq|\xi|$ . In any case, we obtain, for $|\xi|>1$ ,

[TABLE]

Using $\left\langle\xi,\nabla V_{1}(x)\right\rangle\leq|\xi||\nabla V_{1}(x)|\leq C|\xi|(1+|x|^{2})^{\frac{\kappa-1}{2}}$ and

[TABLE]

for the integral against $\nu$ , gives

[TABLE]

where we have used $x_{i}\leq|x|\leq(1+|x|^{2})^{1/2}$ , $i\in\{1,\dots,m\}$ . It remains to estimate the corresponding integrals for $|\xi|\leq 1$ . Applying twice the mean value theorem gives

[TABLE]

where we have used $\xi_{k}\xi_{l}\leq\frac{\xi_{k}^{2}+\xi_{l}^{2}}{2}\leq|\xi|^{2}$ . Using, for $i\in I$ and $|\xi|\leq 1$ ,

[TABLE]

we conclude that

[TABLE]

Collecting all estimates proves the desired estimate for $\mathcal{A}_{1}$ . ∎

Let us now prove the desired estimate for $\mathcal{A}_{2}$ .

Lemma 8.2.

Suppose that the same conditions as in Proposition 5.1 (b) are satisfied. Then there exists a constant $C>0$ such that

[TABLE]

Proof.

Observe that $\nabla V_{2}(x)=\frac{2x}{1+|x|^{2}}$ . Hence we obtain for some generic constant $C>0$

[TABLE]

Observe that, for $k,l\in\{1,\dots,d\}$ ,

[TABLE]

Using $x_{k}x_{l}\leq C(1+|x|^{2})$ gives $\left|\frac{\partial^{2}V_{2}(x)}{\partial x_{k}\partial x_{l}}\right|\leq\frac{C}{1+|x|^{2}}$ . This implies that

[TABLE]

Let us estimate the integrals against $\nu$ and $\mu_{1},\dots,\mu_{m}$ . Consider first the case $|\xi|>1$ . Then

[TABLE]

and hence we obtain

[TABLE]

From the mean value theorem we obtain

[TABLE]

In view of $x_{i}\leq x_{i}+t\xi_{i}\leq|x_{I}+t\xi_{I}|\leq|x+t\xi|$ for $i\in I$ , we obtain $x_{i}(V_{2}(x+\xi)-V_{2}(x))\leq 2|\xi|$ . Using $\left\langle\xi,\nabla V_{2}(x)\right\rangle\leq|\xi||\nabla V_{2}(x)|\leq C|\xi|$ gives

[TABLE]

It remains to estimate the corresponding integrals for $|\xi|\leq 1$ . As in (8.1), we get

[TABLE]

This implies

[TABLE]

For $i\in I$ , by $x_{i}\leq|x+s\xi|$ , we get $\frac{x_{i}}{1+|x+s\xi|^{2}}\leq 1$ and hence

[TABLE]

Collecting all estimates proves the desired estimate for $\mathcal{A}_{2}$ . ∎

8.2 Some estimate on the Wasserstein distance

Here and below we let $d\in\{d_{\kappa},d_{\log}\}$ . Below we provide two simple and known estimates for Wasserstein distances.

Lemma 8.3.

Let $f,\widetilde{f},g\in\mathcal{P}_{d}(D)$ . Then

[TABLE]

Proof.

Using the Kantorovich duality (see [50, Theorem 5.10, Case 5.16], we obtain

[TABLE]

where $\|h\|=\sup_{x\neq x^{\prime}}\frac{|h(x)-h(x^{\prime})|}{d(x,x^{\prime})}$ . Using now the definition of the convolution on the right-hand side gives

[TABLE]

where $\widetilde{h}(x)=\int_{D}h(x+x^{\prime})g(dx^{\prime})$ . Since $\|\widetilde{h}\|\leq 1$ , we conclude that

[TABLE]

where we have used again the Kantorovich duality. This completes the proof. ∎

The next estimate shows that the Wasserstein distance is convex. For additional details we refer to [50, Theorem 4.8].

Lemma 8.4.

Let $P(x,\cdot)$ be a Markov transition function on $D\times\mathcal{P}_{d}(D)$ . Then, for any $f,g\in\mathcal{P}_{d}(D)$ and any coupling $H$ of $(f,g)$ , it holds that

[TABLE]

8.3 Proof of the elementary inequality with respect to $\log$

Below we prove the following inequality.

Lemma 8.5.

For any $a,b\geq 0$ one has

[TABLE]

Proof.

Using the elementary inequality $\log(e+ab)\leq\log(e+a)\log(e+b)$ , see [24], we easily obtain

[TABLE]

from which we readily deduce

[TABLE]

Fix any $\varepsilon>0$ . If $a\geq\varepsilon$ , then we obtain

[TABLE]

The case $b\geq\varepsilon$ can be treated in the same way. Finally, if $0\leq a,b\leq\varepsilon$ , then we obtain

[TABLE]

Collecting both estimates gives, for all $a,b\geq 0$ , the estimate

[TABLE]

where $g(\varepsilon)=\min\left\{\log(e+\varepsilon),\frac{\log(e+\varepsilon)}{\log(1+\varepsilon)}\right\}$ . A simple extreme value analysis shows that $g$ attains its maximum at $\varepsilon=e-1$ which gives $\inf\limits_{\varepsilon>0}g(\varepsilon)=g(e-1)=\log(2e-1)$ . ∎

Acknowledgments

The authors would like to thank Jonas Kremer for several discussions on affine processes and pointing out some interesting references on this topic. Peng Jin is supported by the STU Scientific Research Foundation for Talents (No. NTF18023).

Bibliography53

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Alf [15] Aurélien Alfonsi. Affine diffusions and related processes: simulation, theory and applications , volume 6 of Bocconi & Springer Series . Springer, Cham; Bocconi University Press, Milan, 2015.
2BBAKP [18] Mátyás Barczy, Mohamed Ben Alaya, Ahmed Kebaier, and Gyula Pap. Asymptotic properties of maximum likelihood estimator for the growth rate for a jump-type CIR process based on continuous time observations. Stochastic Process. Appl. , 128(4):1135–1164, 2018.
3BDLP [13] Mátyás Barczy, Leif Döring, Zenghu Li, and Gyula Pap. On parameter estimation for critical affine processes. Electron. J. Stat. , 7:647–696, 2013.
4BDLP [14] Mátyás Barczy, Leif Döring, Zenghu Li, and Gyula Pap. Stationarity and ergodicity for an affine two-factor model. Adv. in Appl. Probab. , 46(3):878–898, 2014.
5[5] Mátyás Barczy, Zenghu Li, and Gyula Pap. Stochastic differential equation with jumps for multi-type continuous state and continuous time branching processes with immigration. ALEA Lat. Am. J. Probab. Math. Stat. , 12(1):129–169, 2015.
6[6] Mátyás Barczy, Zenghu Li, and Gyula Pap. Yamada-Watanabe results for stochastic differential equations with jumps. Int. J. Stoch. Anal. , pages Art. ID 460472, 23, 2015.
7BLP [16] Mátyás Barczy, Zenghu Li, and Gyula Pap. Moment formulas for multitype continuous state and continuous time branching process with immigration. J. Theoret. Probab. , 29(3):958–995, 2016.
8[8] Mátyás Barczy, Sandra Palau, and Gyula Pap. Almost sure, L 1 subscript 𝐿 1 L_{1} - and L 2 subscript 𝐿 2 L_{2} -growth behavior of supercritical multi-type continuous state and continuous time branching processes with immigration. ar Xiv:1803.10176 [math.PR] , 2018.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Stochastic equation and exponential ergodicity in Wasserstein distances for affine processes

1 Introduction and statement of the result

1.1 General introduction

1.2 Affine processes

Definition 1.1**.**

Theorem 1.2**.**

Remark 1.3**.**

1.3 Ergodicity in Wasserstein distance for affine processes

Remark 1.4**.**

Theorem 1.5**.**

Remark 1.6**.**

Corollary 1.7**.**

1.4 Comparison with related works

Theorem 1.8**.**

1.5 Main idea of proof and structure of the work

2 Examples

2.1 Anisotropic (γ1,γ2)(\gamma_{1},\gamma_{2})(γ1​,γ2​)-root process

Corollary 2.1**.**

2.2 Stochastic volatility model

Corollary 2.2**.**

3 Stochastic equation for multi-type CBI processes

Proposition 3.1**.**

Lemma 3.2**.**

4 Stochastic equation for affine processes

Remark 4.1**.**

Theorem 4.2**.**

Proposition 4.3**.**

Proof.

Lemma 4.4**.**

Proof.

Lemma 4.5**.**

Proof.

5 Moments for affine processes

Proposition 5.1**.**

Proof.

Lemma 5.2**.**

Proof.

6 Contraction estimate for trajectories of affine processes

Proposition 6.1**.**

Proof.

7 Proof of Theorem 1.5

7.1 The log⁡\loglog-Wasserstein estimate

Proposition 7.1**.**

Proof.

Lemma 7.2**.**

Proof.

7.2 The κ\kappaκ-Wasserstein estimate

Proposition 7.3**.**

Proof.

8 Appendix

8.1 Moment estimates for V1V_{1}V1​ and V2V_{2}V2​

Lemma 8.1**.**

Proof.

Lemma 8.2**.**

Proof.

8.2 Some estimate on the Wasserstein distance

Lemma 8.3**.**

Proof.

Lemma 8.4**.**

8.3 Proof of the elementary inequality with respect to log⁡\loglog

Lemma 8.5**.**

Proof.

Acknowledgments

Definition 1.1.

Theorem 1.2.

Remark 1.3.

Remark 1.4.

Theorem 1.5.

Remark 1.6.

Corollary 1.7.

Theorem 1.8.

2.1 Anisotropic $(\gamma_{1},\gamma_{2})$ -root process

Corollary 2.1.

Corollary 2.2.

Proposition 3.1.

Lemma 3.2.

Remark 4.1.

Theorem 4.2.

Proposition 4.3.

Lemma 4.4.

Lemma 4.5.

Proposition 5.1.

Lemma 5.2.

Proposition 6.1.

7.1 The $\log$ -Wasserstein estimate

Proposition 7.1.

Lemma 7.2.

7.2 The $\kappa$ -Wasserstein estimate

Proposition 7.3.

8.1 Moment estimates for $V_{1}$ and $V_{2}$

Lemma 8.1.

Lemma 8.2.

Lemma 8.3.

Lemma 8.4.

8.3 Proof of the elementary inequality with respect to $\log$

Lemma 8.5.