Subexponential upper and lower bounds in Wasserstein distance for Markov processes
Ari Arapostathis, Guodong Pang, and Nikola Sandri\'c

TL;DR
This paper establishes subexponential and exponential convergence bounds in Wasserstein distance for various Markov processes, providing sharp characterizations of convergence rates under different conditions.
Contribution
It introduces new subexponential bounds for Wasserstein convergence in Markov processes using Foster-Lyapunov conditions and applies these to specific stochastic models.
Findings
Subexponential convergence bounds for irreducible, aperiodic Markov processes.
Exponential ergodicity under asymptotic flatness for Itô processes.
Sharp rate characterizations for Langevin, Ornstein-Uhlenbeck, and recurrence time chains.
Abstract
In this article, relying on Foster-Lyapunov drift conditions, we establish subexponential upper and lower bounds on the rate of convergence in the -Wasserstein distance for a class of irreducible and aperiodic Markov processes. We further discuss these results in the context of Markov L\'evy-type processes. In the lack of irreducibility and/or aperiodicity properties, we obtain exponential ergodicity in the -Wasserstein distance for a class of It\^{o} processes under an asymptotic flatness (uniform dissipativity) assumption. Lastly, applications of these results to specific processes are presented, including Langevin tempered diffusion processes, piecewise Ornstein-Uhlenbeck processes with jumps under constant and stationary Markov controls, and backward recurrence time chains, for which we provide a sharp characterization of the rate of convergence via…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Stochastic processes and financial applications · Stochastic processes and statistical mechanics
Subexponential upper and lower bounds in Wasserstein distance
for Markov processes
Nikola Sandrićlabel=e1][email protected] [
Ari Arapostathislabel=e2][email protected] [
Guodong Panglabel=e3][email protected] [ University of Zagreb\thanksmarkm1, University of Texas at Austin\thanksmarkm2,
and Pennsylvania State University\thanksmarkm3
Department of Mathematics, University of Zagreb
Bijenička cesta 30, 10000 Zagreb, Croatia
Department of Electrical and Computer Engineering
University of Texas at Austin, 2501 Speedway, EER 7.824, Austin, TX 78712
Department of Computational and Applied Mathematics
Rice University, Houston, TX 77005
Abstract
In this article, relying on Foster-Lyapunov drift conditions, we establish subexponential upper and lower bounds on the rate of convergence in the -Wasserstein distance for a class of irreducible and aperiodic Markov processes. We further discuss these results in the context of Markov Lévy-type processes. In the lack of irreducibility and/or aperiodicity properties, we obtain exponential ergodicity in the -Wasserstein distance for a class of Itô processes under an asymptotic flatness (uniform dissipativity) assumption. Lastly, applications of these results to specific processes are presented, including Langevin tempered diffusion processes, piecewise Ornstein–Uhlenbeck processes with jumps under constant and stationary Markov controls, and backward recurrence time chains, for which we provide a sharp characterization of the rate of convergence via matching upper and lower bounds.
60J05; 60J25,
60H10; 60J75,
exponential and subexponential ergodicity,
Wasserstein distance,
Itô process,
Foster–Lyapunov condition,
asymptotic flatness (uniform dissipativity),
Langevin diffusion process,
Ornstein-Uhlenbeck process,
keywords:
[class=MSC]
keywords:
\startlocaldefs\endlocaldefs
,
and
1 Introduction
One of the classical directions in the analysis of Markov processes centers around their ergodic properties. In this article, we focus on both qualitative and quantitative aspects of this problem. Let be a locally compact Polish space, i.e. a locally compact separable completely metrizable topological space. Denote the corresponding metric by , and let or be the time parameter set. We endow with its Borel -algebra . Further, let \bigl{(}\Omega,\mathcal{F},\{\mathcal{F}_{t}\}_{t\in{\mathbb{T}}},\{\theta_{t}\}_{t\in{\mathbb{T}}},\{X(t)\}_{t\in{\mathbb{T}}},\{{\mathbb{P}}_{x}\}_{x\in\mathbb{X}}\bigr{)}, denoted by in the sequel, be a time-homogeneous conservative strong Markov process with càdlàg sample paths (when ) and state space \bigl{(}\mathbb{X},\mathfrak{B}(\mathbb{X})\bigr{)}, in the sense of [10]. Here, is a family of probability spaces and satisfies , is a filtration on (non-decreasing family of sub--algebras of ) and is a family of shift operators on satisfying for all . Recall, is said to be conservative if for all and . In the present article, we present (sharp) sufficient conditions under which admits a unique invariant probability measure , and which ensure that the marginals of converge to , as , in the -Wasserstein distance at exponential and subexponential rates.
1.1 Summary of the results
Before stating the main results of this article, we introduce some notation we need in the sequel. Denote by for and , the transition kernel of . We endow with the standard (Euclidean Borel in the case when , and discrete when ) -algebra. The process is called
- (i)
irreducible if there exists a -finite measure on such that whenever we have for all , where stands for the Lebesgue measure on when , and the counting measure when ; 2. (ii)
transient if it is irreducible, and there exist and a covering of , such that for all and ; 3. (iii)
recurrent if it is irreducible, and implies that for all ; 4. (iv)
aperiodic if there exists such that is irreducible, in the case when ; and there does not exist a partition with of such that for all and all , and for all , in the case when .
Let us remark that if is irreducible, then it is either transient or recurrent (see [79, Theorem 2.3]). A Borel measure on is called invariant for if for all . It is well known that if is recurrent, then it possesses a unique (up to constant multiples) invariant measure (see [79, Theorem 2.6]). If the invariant measure is finite, then it may be normalized to a probability measure. If is recurrent with finite invariant measure, then it is called positive recurrent; otherwise it is called null recurrent. Note that a transient Markov process cannot have a finite invariant measure. A set is called petite for if there exist a probability measure on and a non-trivial Borel measure on , such that
[TABLE]
for all and Recall that petite sets play a role of singletons for Markov processes on general state spaces (see [64, Chapter 5] for a detailed discussion). Denote by the class of all Borel probability measures on , and for (the space of real-valued Borel measurable functions on ) let denote the class of all with the property that . When f(x)=\bigl{(}\mathsf{d}(x_{0},x)\bigr{)}^{p} for some and , we denote this as . We adopt the usual notation
[TABLE]
for , , and . Therefore, with denoting the Dirac measure concentrated at , we have . Finally, recall that the -Wasserstein distance on with is defined by
[TABLE]
where is the family of couplings of and , i.e. if, and only if, is a probability measure on having and as its marginals. It is well known that is a complete separable metric space under the metric [82, Theorem 6.18]. The topology generated by on is finer than the Prokhorov topology, i.e. the topology of weak convergence.
We now state the main results of this article.
Theorem 1.1**.**
Suppose that is irreducible and aperiodic, and there exist a continuous , a constant , a nondecreasing differentiable concave function , and a (topologically) closed petite set such that
[TABLE]
for all . Assume further that , and
[TABLE]
for some and some (and therefore any) . Then admits a unique invariant . In addition, with and , the following hold.
- (i)
If , then for some we have
[TABLE] 2. (ii)
If , then for any there exists such that
[TABLE]
for all , where \overline{m}_{\eta}=\uppi\bigl{(}\bigl{(}\mathsf{d}(x_{0},\cdot\,)\bigr{)}^{\eta}\bigr{)}. 3. (iii)
If for some , then there exist and , such that
[TABLE]
In addition, for any there exists such that
[TABLE]
The results in Theorem 1.1 should be compared to equations (2.3) and (2.5) in [12, Theorems 2.1 and 2.4] (see also [28, Theorem 3 (ii)] and [53, Chapter 4]). The underlying metric is assumed to be bounded in [12]. The starting point is a Foster-Lyapunov condition of the form in Eq. 1.1, and the irreducibility and aperiodicity assumptions are replaced by a closely related structural property: the metric is contracting, and the sublevel sets of are -small (see (3) and (4) in [12, Theorems 2.1 and 2.4]). Then an analogous estimate to Eq. 1.3 holds for the corresponding -distance. Observe that when is bounded, the relation in Eq. 1.1 trivially holds for any . Provided is irreducible and aperiodic, this gives an analogous result to the one obtained in [12, Theorems 2.1 and 2.4] (in -distance) without assuming either contraction properties of or -smallness of the sublevel sets of . The proof of Theorem 1.1 relies on [24, Theorem 3.2] and [25, Theorem 2.8], where, under the assumptions of Theorem 1.1, the authors show ergodicity of in the -norm with rate and , for any pair of Young’s functions. Recall, for a signed Borel measure on and a function the so-called -norm of is defined as
[TABLE]
generalizing the usual total variation norm \lVert\upmu\rVert_{\mathrm{TV}}\coloneqq\sup_{g\in{\mathcal{B}}(\mathbb{X}),\,\lvert g\rvert\leq 1}\,\bigl{\lvert}\upmu(g)\bigr{\rvert}. We remark here that convergence in the -norm does not in general imply convergence in the -distance, and vice versa (see Section 3 for examples of such Markov processes).
In the following theorem we establish a lower bound for -convergence, which matches the upper bounds obtained in Eqs. 1.3 and 1.5. For (the space of continuous mappings from to ) let
[TABLE]
The space is called a length space if
[TABLE]
Theorem 1.2**.**
Assume that is a length space, satisfies Eq. 1.1, and there exist a Lipschitz continuous function and constants and , such that
[TABLE]
In addition, suppose that admits an invariant such that \int_{\mathbb{X}}\bigl{(}L(x)\bigr{)}^{\vartheta+\varepsilon}\,\uppi(\mathrm{d}x)=\infty for some . Then, for each , and , there exist a constant and a diverging increasing sequence , depending on these parameters, such that
[TABLE]
Note that the parameters , , , and are such that the exponent in the above expression is always strictly negative. Obtaining lower bound for the convergence in the total variation norm is discussed in [38, Theorem 5.1 and Corollary 5.2]. Applications of Theorem 1.2 are discussed in Section 3.
1.2 Ergodicity of a class of Lévy-type processes
Here, we discuss ergodic properties of a class of Markov processes on the Euclidean space (endowed with the standard Euclidean metric) generated by a (Lévy-type) operator given by
[TABLE]
Here, is Borel measurable, is a symmetric non-negative definite matrix-valued Borel measurable function, is a nonnegative Borel kernel on , called the Lévy kernel, satisfying
[TABLE]
and
[TABLE]
The symbol stands for the domain of , i.e. the set of functions for which Eq. 1.10 is well defined, and denote the standard inner product and the corresponding Euclidean norm on , stands for the trace of a square matrix , and denotes the Hessian of . An open (resp. closed) ball of radius centered at is denoted by (resp. ). If , we write (resp. ), and the unit open (resp. closed) ball centered at [math] is denoted by (resp. ). Observe that , where , , denotes the space of times differentiable functions such that all derivatives up to order are bounded. We also denote by \lVert M\rVert\coloneqq\bigl{(}\operatorname{Tr}MM^{\prime}\bigr{)}^{\nicefrac{{1}}{{2}}} the Hilbert–Schmidt norm of a matrix , where stands for the transpose of .
We introduce the following assumption:
- (MP)
There exists a conservative strong Markov process with càdlàg sample paths such that
[TABLE]
is a -martingale (with respect to ) for any (the space of smooth functions with compact support).
Define
[TABLE]
and observe that
[TABLE]
for all and where denotes the Fourier transform of . In other words, is a pseudo-differential operator with symbol . According to [52, Theorem 1.1], (MP) is satisfied if
- (LB)
The functions , , and x\mapsto\int_{{\mathbb{R}}^{n}}\bigl{(}1\wedge|y|^{2}\bigr{)}\,\upnu(x,\mathrm{d}y) are locally bounded.
- (SG)
is continuous for all , and is locally uniformly continuous at , i.e.
[TABLE]
Observe that the second condition in (SG) essentially means that the coefficients , , and have a sublinear growth. Namely, it is satisfied if
[TABLE]
In order to allow linear growth of the coefficients, we replace (LB) and (SG) by
- (LG)
\mathcal{L}\bigl{(}C_{c}^{\infty}({\mathbb{R}}^{n})\bigr{)}\subseteq C_{\infty}({\mathbb{R}}^{n}), is continuous for all , and
[TABLE]
(see [50, Corollary 3.2]).
Here, stands for the space of continuous functions vanishing at infinity. Clearly, the last condition in (LG) follows from
[TABLE]
Let us also remark that due to [51, Theorem A1] the map is continuous for all if and are continuous, and for any , and ,
[TABLE]
Furthermore, under the continuity of (for all ) in the same reference it has been shown that \mathcal{L}\bigl{(}C_{c}^{\infty}({\mathbb{R}}^{n})\bigr{)}\subseteq C_{b}({\mathbb{R}}^{n}). In addition, if
[TABLE]
we easily see that \mathcal{L}\bigl{(}C_{c}^{\infty}({\mathbb{R}}^{n})\bigr{)}\subseteq C_{\infty}({\mathbb{R}}^{n}).
Definition 1.1**.**
Let denote the class of positive definite matrices in . For , let for , and be some nonnegative, symmetric convex function such that for . For and , we define
[TABLE]
Further, let
[TABLE]
and when , let .
We now discuss ergodic properties of the Lévy-type process .
Theorem 1.3**.**
Assume (LB) and (MP), and suppose that is irreducible and aperiodic, and that every compact set is petite for . Then the following hold.
- (i)
If ,
[TABLE]
for some , and there exist and such that
[TABLE]
then admits a unique invariant . In addition, if , then Theorem 1.1 (i) and (ii) hold with , and . 2. (ii)
If , Eq. 1.13 holds for some , and there exists such that
[TABLE]
then admits a unique invariant . In addition, if , then the conclusion of Theorem 1.1 (iii) holds with and . 3. (iii)
Suppose that is bounded, and there exist and , such that
[TABLE]
and
[TABLE]
Then the conclusion of Theorem 1.1 (iii) holds with for any sufficiently small and any .
Irreducibility and aperiodicity are crucial structural properties of the underlying process in Theorems 1.1, 1.2 and 1.3. Roughly speaking, they ensure that the process does not show singular behavior in its motion, and together with the Foster-Lyapunov condition in Eq. 1.1 (which ensures controllability of the -modulated moment of return-times to the petite set , see [24, Theorem 4.1]) they lead to the ergodic properties stated.
Under an asymptotic flatness (uniform dissipativity) property (see Eq. 1.16), we use a completely different approach to this problem, the so-called synchronous coupling method (see [14, Example 2.16] for details), to obtain ergodic properties for a class of Itô processes which are not necessarily irreducible and aperiodic. Recall that an Itô process is a solution to a stochastic differential equation (SDE) of the following form
[TABLE]
where , and are Borel measurable, is a standard -dimensional Brownian motion, and is a Poisson random measure on \mathfrak{B}({\mathbb{R}})\otimes\mathfrak{B}\bigl{(}[0,\infty)\bigr{)}, with intensity measure (a -finite measure on ). According to [13, Theorem 3.33], every Itô process is a semimartingale Hunt process. In particular, it is a conservative strong Markov process with càdlàg sample paths. Conversely, again by [13, Theorem 3.33], for every -dimensional semimartingale Hunt process , and every -finite nonfinite and nonatomic measure on , there exist , , , , and as above (possibly defined on an enlargement of the initial stochastic basis), such that satisfies Eq. 1.15. By setting
[TABLE]
and
[TABLE]
Eq. 1.15 reads as
[TABLE]
Set , and let be as in Eq. 1.10. According to [40, Theorem II.2.42] (with ), for any , the process , defined as in Eq. 1.11, is a -local martingale for every . In addition, if (LB) holds true, then is a -local martingale for every and every , i.e. (MP) is satisfied.
For define
[TABLE]
[TABLE]
If (resp. , or ), then of course (resp. , or ) is equal to zero.
Theorem 1.4**.**
Assume that and are locally bounded and satisfy the linear growth condition in Eq. 1.12, and that is such that . If for some there exist , and a -finite nonfinite and nonatomic measure on such that Eq. 1.15 admits a unique strong solution , and
[TABLE]
for some and all , where is given in Eq. 1.15, then
[TABLE]
for all and , where () stands for the largest (smallest) eigenvalue of . Furthermore, admits a unique invariant , and
[TABLE]
for all and .
In addition, if , a constant, , , and Eq. 1.16 holds for some , then Eqs. 1.17 and 1.18 remain valid.
We remark that ergodic properties of a Markov process with respect to the -distance are invariant under the Bochner’s random time-change method. Recall that a subordinator is a nondecreasing Lévy process on with Laplace transform \mathbb{E}\bigl{[}\mathrm{e}^{-uS_{t}}\bigr{]}=\mathrm{e}^{-t\psi(u)}, . The characteristic (Laplace) exponent is a Bernstein function, i.e. it is of class and for all . It is well known that every Bernstein function admits a unique (Lévy-Khintchine) representation
[TABLE]
where is the drift parameter and is a Lévy measure, i.e. a Borel measure on \mathfrak{B}\bigl{(}(0,\infty)\bigr{)} satisfying . For additional reading on subordinators and Bernstein functions we refer the reader to the monograph [74]. Suppose is a Markov process on \bigl{(}\mathbb{X},\mathfrak{B}(\mathbb{X})\bigr{)} with transition kernel , and let be a subordinator with characteristic exponent , independent of . The process X^{\psi}(t)\coloneqq X\bigl{(}S(t)\bigr{)}, , obtained from by a random time change through , is referred to as the subordinate process with subordinator in the sense of Bochner. It is easy to see that is again a Markov process with transition kernel
[TABLE]
where . It is also elementary to check that if is an invariant measure for , then it is also invariant for the subordinate process .
Proposition 1.1**.**
Assume that admits an invariant such that for some , and all and , where is Borel measurable, and . Then,
[TABLE]
where r_{\psi}(t)\coloneqq\Bigl{(}\mathbb{E}\Bigl{[}\bigl{(}r(S(t))\bigr{)}^{p}\Bigr{]}\Bigr{)}^{\nicefrac{{1}}{{p}}}.
Ergodic properties of Markov processes under subordination in the -norm are discussed in [20, 21, 22].
1.3 Literature review
Our work contributes to the understanding of the ergodic properties of Markov processes. Most of the existing literature focuses on characterizing the exponential or subexponential ergodicity under the -norm, and in particular the total variation norm, see [3, 19, 24, 25, 26, 32, 33, 35, 58, 64, 65, 66, 78] and the references therein. However, there have been some recent developments in understanding ergodic properties of Markov processes (both continuous and discrete time) under the Wasserstein distances; see [12, 28, 34, 53, 57, 56, 29, 30, 85, 59, 60]. As already mentioned, exponential and subexponential convergence rates in the -distance for general Markov processes that are (possibly) not irreducible or aperiodic are established in [12, 28, 53], under the Foster-Lyapunov condition in Eq. 1.1, contractivity of the underlying metric, and smallness of sublevel sets of the corresponding Lyapunov function. Using the coupling approach, the authors in [29, 30, 59] studied exponential ergodicity with respect to a class of Wasserstein distances for SDEs driven by an additive Brownian noise term and a drift term satisfying an asymptotic flatness property at infinity. Under the same assumption on the drift term, these results have been extended in [60, 85] to allow for more general additive Lévy noises. Subexponential ergodicity with respect to the -distance for stochastic differential equations driven by an additive Lévy noise term, with a drift term satisfying asymptotic flatness property at zero, has been studied in [56]. By combining the Foster-Lyapunov method with the coupling approach, exponential ergodicity with respect to a class of -norms and Wasserstein distances (given in terms of the underlying Lyapunov function) is established in [57] for a class of Mckean-Vlasov SDE with Lévy noise. Lastly, exponential ergodicity with respect to the -distance for one-dimensional positive-valued stochastic differential equations with jumps and the drift term satisfying asymptotic flatness property has been studied in [34].
Our results on both exponential and subexponential ergodicity under the -distance contribute to this active research topic. Of particular interest is the result obtained in Theorem 1.2 which seems to be completely new in the literature, and which, in some cases, allows one to conclude that the obtained upper bound on the rate of convergence is sharp.
As we have already remarked, irreducibility and aperiodicity are crucial structural properties of the underlying process used in Theorems 1.1, 1.2 and 1.3. There is a vast literature on these, and related questions such as the strong Feller property and heat kernel estimates of Markov processes. In particular, we refer the readers to [8, 15, 16, 17, 18, 36, 42, 43, 45, 46, 47, 48, 55, 67, 71, 77] for the case of a class of Markov Lévy-type processes with bounded coefficients, and to [7, 9, 39, 44, 56, 62, 63, 68, 69, 72, 76, 86] for the case of a class of Itô processes.
Recall that the Foster-Lyapunov condition in Eq. 1.1 implies that for any the -modulated moment of the -shifted hitting time of of (with respect to ) is finite and controlled by (see [24, Theorem 4.1]). However, this property in general does not immediately imply ergodicity of . Namely, we also need to ensure that a similar property holds for any other “reasonable” set. If is irreducible with irreducibility measure , then indeed for any the -modulated moment of , for any with , is again finite and controlled by (see [24, the discussion after Theorem 4.1]). However, can also show certain cyclic behavior which destroys ergodicity (see [65, Section 5] and [64, Chapter 5]). By assuming aperiodicity, which excludes this type of behavior, (sub)exponential ergodicity in the -distance of follows as discussed in Theorem 1.1, and in the -norm as discussed in [33, Theorem 1].
1.4 Organization of the article
In Section 2, we give the proofs of Theorems 1.1, 1.2, 1.3, 1.4 and 1.1 together with some auxiliary lemmas. Applications of the main results to several classes of Markov processes, including Langevin tempered diffusion processes, Ornstein-Uhlenbeck processes with jumps, piecewise Ornstein-Uhlenbeck processes with jumps under constant and stationary Markov controls, state-space models, and backward recurrence time chains, are contained in Section 3.
2 Proofs of the main results
We start with the proof of Theorem 1.1.
Proof of Theorem 1.1.
We consider the case when only. The case when proceeds in an analogous way, by employing the results from [25, Theorem 2.8] and [64, Theorem 15.0.2].
First, under the assumptions of the theorem, it has been shown in [24, Proposition 3.1] and [66, Theorem 4.2] that admits a unique invariant . This, together with Eq. 1.2, implies that . We continue now with the proof of part (i). By the Kantorovich-Rubinstein theorem, we have
[TABLE]
where the supremum is taken over all Lipschitz continuous functions with Lipschitz constant . We apply [24, Theorem 3.2],
[TABLE]
Note that if is such that and , then . Thus
[TABLE]
(recall the definition of the -norm in Eq. 1.8). Now, from [24, (3.5) and (3.6)] we have
[TABLE]
for some , and all and , which proves Eqs. 1.3 and 1.4, respectively.
We next prove part (ii). Applying Eq. 1.3 and [24, (3.5)] with , and , we obtain , for some , and all and . Hence
[TABLE]
Further, for , , and , we have
[TABLE]
Using Eqs. 2.1 and 2.2, and the bound \int_{{\mathscr{B}}_{t}^{c}(x_{0})}\bigl{(}\mathsf{d}(x,x_{0})\bigr{)}^{p}\,\uppi(\mathrm{d}x)\leq t^{p-\eta}\,\overline{m}_{\eta}, we have
[TABLE]
and combining this with Eq. 1.3 we obtain
[TABLE]
for all and , from which Eq. 1.5 follows with .
Moving on to the proof of part (iii), note that according to [65, Proposition 6.1], [66, Theorem 4.2], and [26, Theorem 5.2], there exist constants and , such that
[TABLE]
Equation 1.6 now follows from the Kantorovich-Rubinstein theorem and Eq. 1.2. Let . First, from Eq. 2.3 we obtain , for some , and all and , which again implies Eq. 2.1. By Eqs. 2.1 and 2.2, we have
[TABLE]
and combining this with Eq. 1.6 we obtain
[TABLE]
from which Eq. 1.7 follows again with . This completes the proof. ∎
We proceed with the proof of Theorem 1.2.
Proof of Theorem 1.2.
We again consider the case when only. The case when proceeds in a similar manner.
Fix some , and . For , define by
[TABLE]
We have
[TABLE]
Since, by assumption, \int_{\mathbb{X}}\bigl{(}L(x)\bigr{)}^{\vartheta+\varepsilon}\,\uppi(\mathrm{d}{x})=\infty, there exists an increasing diverging sequence such that
[TABLE]
Note also that \bigr{(}f_{s}(x)\bigl{)}^{p}\leq 2^{\theta-p}s^{p-\theta}\,\bigl{(}L(x)\bigr{)}^{\theta}\leq\frac{2^{\theta-p}}{c}\,s^{p-\theta}\,{\mathscr{V}}(x) for all and . This follows from the facts that for and such that ,
[TABLE]
and . Thus, by the Foster-Lyapunov equation Eq. 1.1 (see [66, Theorem 1.1]), we obtain
[TABLE]
Select a sequence such that
[TABLE]
Combining Eqs. 2.4, 2.5, 2.6 and 2.7 above we have
[TABLE]
The result then follows by [82, Proposition 7.29], which asserts that
[TABLE]
for all and Lipschitz with Lipschitz constant \mathrm{Lip}\bigl{(}f\bigr{)}. ∎
For the proof of Theorem 1.3 we need two auxiliary results given in Lemmas 2.1 and 2.2 below. First, recall that is said to be conservative if for all and , and note that this is equivalent to
[TABLE]
where for (here it is also essential that has càdlàg sample paths). Namely, for and it holds that
[TABLE]
Lemma 2.1**.**
Assume (LB) and (MP). Then for any and any nonnegative such that the map is locally bounded, is a -local martingale (with respect to ).
Proof.
For , let be such that and for . Then, for any , and , , [31, Theorem 2.2.13] implies that
[TABLE]
Next, by employing the monotone and dominated convergence theorems, we easily see that
[TABLE]
and
[TABLE]
Hence, for each , and , is integrable. Also,
[TABLE]
for all , , and . The assertion now follows from the conservativeness of . ∎
For we let
[TABLE]
[TABLE]
whenever the integrals are well defined.
Lemma 2.2**.**
Suppose that , and that Eq. 1.13 holds for some . Then, we have the following:
- (i)
If , and satisfies
[TABLE]
then vanishes at infinity. 2. (ii)
If , and satisfies
[TABLE]
then vanishes at infinity when , and the map is bounded when . 3. (iii)
If Eq. 1.14 holds for some , then there exist and , such that for any \zeta\in\bigl{(}0,\frac{1}{2}\theta\lVert Q\rVert^{-\nicefrac{{1}}{{2}}}\bigr{)} we have
[TABLE]
Proof.
The proof of parts (i) and (ii) follows as a straightforward adaptation of [7, Lemma 5.1] by setting
[TABLE]
To prove part (iii), we use the identity
[TABLE]
Consider the set
[TABLE]
On this set we have the bound
[TABLE]
for some . Since , and on the set , there exists such that
[TABLE]
for all and . Hence, using Eqs. 2.10 and 2.11 and Fubini’s theorem, we have
[TABLE]
for all . Next, since on the set , we have a bound of the form
[TABLE]
for all and , where, without loss of generality, we use the same constant as in Eq. 2.10. Since , it is clear that there exists , independent of , such that
[TABLE]
Thus, by Eqs. 1.14, 2.13 and 2.14, there exists such that
[TABLE]
The estimate in Eq. 2.8 follows from Eqs. 1.14, 2.9, 2.12 and 2.15. This completes the proof. ∎
We next prove Theorem 1.3.
Proof of Theorem 1.3.
In cases (i) and (ii), we take , while in case (iii) we use with sufficiently small. Then, in view of Lemma 2.2 it is straightforward to see that there exist constants , , and , such that
[TABLE]
in case (i), and
[TABLE]
in cases (ii) and (iii). Observe that the above relations, together with [66, Theorem 2.1] and Lemma 2.1, imply that is conservative. Finally, according to Lemma 2.1 and [24, Theorem 3.4] the process satisfies Eq. 1.1 with in case (i), and in cases (ii) and (iii) (for some and closed petite set ). ∎
The proof of Theorem 1.4 is based on the following lemma.
Lemma 2.3**.**
Let be an Itô process with locally bounded coefficients and and satisfying the linear growth condition in Eq. 1.12, and such that . Then, for any , there exists a constant such that
[TABLE]
Proof.
Let be such that and for , and for . Further, for , let be such that , , and , as , for every . Then, according to Itô’s formula and the conservativeness of we have
[TABLE]
for all , , and , where the constants depend on , , , and the quantities
[TABLE]
for large enough. Clearly, the functions can be chosen such that . Now, since the function t\mapsto{\mathbb{E}}_{x}\bigl{[}\varphi_{k}(X\bigl{(}t\wedge\tau_{k})\bigr{)}\bigr{]} is bounded and càdlàg, Gronwall’s lemma implies that
[TABLE]
for all , , and . By letting , Fatou’s lemma and the conservativeness of imply that
[TABLE]
Finally, we have that
[TABLE]
This completes the proof. ∎
We next prove Theorem 1.4.
Proof of Theorem 1.4.
For , define , , and
[TABLE]
Calculating , using Eq. 1.16, we obtain
[TABLE]
[TABLE]
for all . Next, for , let (possibly ), where denotes the solution to Eq. 1.15 with for . By Itô’s formula and the conservativeness of we obtain
[TABLE]
for all and , since, for , a.s. by the pathwise uniqueness of the solution to Eq. 1.15. From this and Lemma 2.3 we conclude that the function t\mapsto{\mathbb{E}}\bigl{[}{\mathscr{V}}_{p}\bigl{(}X^{x+z}(t\wedge\tau\wedge\tau_{k})-X^{x}(t\wedge\tau\wedge\tau_{k})\bigr{)}\bigr{]} is differentiable a.e. on . Note that |\tilde{{\mathcal{L}}}{\mathscr{V}}_{p}\bigl{(}x;z\bigr{)}|\leq c|z|^{p} for some and all , We conclude now that
[TABLE]
for all . Thus by Gronwall’s lemma, it follows that
[TABLE]
and Fatou’s lemma implies that
[TABLE]
for all and . Next, from the bound we obtain
[TABLE]
for all and , thus establishing Eq. 1.17.
Finally, in order to establish Eq. 1.18, we follow the idea from [59, Proof of Corollary 1.8] or [49, Proof of Theorem 2.1]. Observe first that, according to Lemma 2.3, for any , for all . Next, let be arbitrary. According to Eq. 1.17, we have
[TABLE]
Fix such that
[TABLE]
Then, the mapping is a contraction on . Thus, since is a complete metric space, the Banach fixed point theorem entails that there exists a unique such that . By defining , we can easily see that for all , i.e. is an invariant probability measure for . By employing Lemma 2.1 again, we also see that . Finally, for any we have
[TABLE]
which also proves uniqueness of .
To prove the second assertion, we adapt the proof of [4, Lemma 7.3.4], where an analogous result is shown for . Define
[TABLE]
and observe that in this case reduces to
[TABLE]
Calculating , using Eq. 1.16, we obtain
[TABLE]
As before, by Itô’s formula and the conservativeness of , combined with the fact that the Lévy noise does not depend on the state, we obtain
[TABLE]
for all and , and
[TABLE]
for all . Thus by Gronwall’s and Fatou’s lemmas it follows that
[TABLE]
for all and . Taking limits as , and using monotone convergence, the assertion follows. ∎
In what follows we give an alternative proof of Theorem 1.4 in the case when and . Let for . Clearly, is again an Itô process which satisfies
[TABLE]
where is an -dimensional pure-jump and zero-drift Lévy process determined by . The corresponding transition probability satisfies
[TABLE]
Thus, we have
[TABLE]
Now, in [11] it has been shown that Eq. 2.16 implies that
[TABLE]
for all and . Finally we get
[TABLE]
for all and , which is Eq. 1.17.
Lastly, we prove Proposition 1.1.
Proof of Proposition 1.1.
According to [82, Theorem 4.1], for each there exists such that . Now, we have that
[TABLE]
which completes the proof. ∎
3 Examples
In this section, we consider applications of the main results to several classes of Markov processes, including Langevin tempered diffusion processes, Ornstein-Uhlenbeck processes with jumps, piecewise Ornstein-Uhlenbeck processes with jumps under constant and stationary Markov controls, state-space models and backward recurrence time chains. Further examples can be found in [24, 25, 32, 33, 78].
3.1 Langevin tempered diffusion processes
We first consider a class of Langevin tempered diffusion processes. Let , and let be strictly positive, for some and all , and . Further, for and , let
[TABLE]
and
[TABLE]
Then, in [33, Proposition 15] it has been shown that the SDE
[TABLE]
admits a weak solution , which is a conservative strong Markov process with continuous sample paths. Moreover, it is irreducible, aperiodic, every compact set is petite, and is its unique invariant probability measure. Here, is a standard -dimensional Brownian motion. Note also that according to Itô’s formula satisfies (MP) with
[TABLE]
Proposition 3.1**.**
- (i)
If \beta\in\bigl{[}\alpha,\frac{1}{2}(1+\alpha(1-n))\bigr{)}, then the assertions of Theorem 1.1 (iii) hold with and for any . 2. (ii)
If and , then the assertions of Theorem 1.1 (i) and (ii) hold with
[TABLE]
for any . 3. (iii)
Under the assumptions of (ii), for \iota\in\bigl{(}0,\alpha^{-1}(1-\alpha(n+1))\bigr{)}. Let and \varepsilon\in\bigl{[}\alpha^{-1}\rho,2\alpha^{-1}(\alpha-\beta)\bigr{)} be fixed. Then, for every p\in\bigl{[}1,\alpha^{-1}(1-n\alpha-\rho)\bigr{]} and there exist a positive constant and a diverging increasing sequence , depending on the above parameters, such that Eq. 1.9 in Theorem 1.2 holds with as above, , and .
Proof.
- (i)
In [33, Theorem 16 (i)] it has been shown that for \beta\in\bigl{[}\alpha,\frac{1}{2}(1+\alpha(1-n))\bigr{)} and the Foster-Lyapunov condition in Eq. 1.1 holds with as above, and for some large enough. Also, the relation in Eq. 1.2 easily follows from the form of and , and the choice of .
- (ii)
In [33, Theorem 16 (ii)] it has been shown that for , and , the Foster-Lyapunov condition in Eq. 1.1 holds with and as above and for some large enough. The relation in Eq. 1.2 can again be easily verified due to the form of and , and the choice of .
- (iii)
Since , we have . The assertion now follows from Theorem 1.2 by taking .
This completes the proof. ∎
Remark 3.1*.*
Observe that the rates obtained in Proposition 3.1 (ii) and (iii) match. Also, in Proposition 3.1 (ii) we assume that \alpha\in\bigl{(}0,(n+1)^{-1}\bigr{)}. Namely, for \alpha\in\bigl{[}(n+1)^{-1},n^{-1}\bigr{)} it holds that , and hence convergence in the -distance cannot hold. On the other hand, in this case, [33, Theorem 16 (ii)] shows subexponential convergence in the -norm. In the following subsections we give examples of Markov processes which are ergodic in the -distance but not in the -norm. For additional results on ergodic properties of Langevin tempered diffusion processes with respect to the -norm see [24] and [33].
3.2 Ornstein-Uhlenbeck processes with jumps
We next consider a class of Itô processes with linear drift. Let be an matrix, and let be an -dimensional Lévy process determined by Lévy triplet \bigl{(}b_{L},a_{L},\upnu_{L}(\mathrm{d}y)\bigr{)}. It is well known that the SDE
[TABLE]
admits a unique conservative strong solution which is a strong Markov process with càdlàg sample paths (see e.g. [2, Theorem 3.1 and Proposition 4.2]). In particular, is an Itô process satisfying (MP) with , , and . This process is known as an Ornstein-Uhlenbeck process with jumps. In the case when is a standard Brownian motion, is the classical Ornstein-Uhlenbeck process. If is a Hurwitz matrix (a square matrix whose eigenvalues have all strictly negative real parts), it has been shown in [73, Theorems 4.1 and 4.2] that admits a unique invariant if, and only if,
[TABLE]
Moreover, if this is the case, then \lim_{t\to\infty}\updelta_{x}P_{t}\bigl{(}f\bigr{)}\,=\,\uppi\bigl{(}f\bigr{)} for all and , i.e. for any , the transition kernel converges weakly, as , to . However, this is not enough for -convergence of to (see [82, Theorem 6.9]). Assume additionally that , and let . Since is Hurwitz, there exists such that (see [27, Lemma 2.2]). The left-hand side of Eq. 1.16 then reads
[TABLE]
Now, by setting
[TABLE]
the assertions of Theorem 1.4 follow. We remark here that this result does not necessarily imply ergodicity of in the -norm. Indeed, let , and take for . Then it is easy to see that for . Thus, , and converges to , as , in -distance for any , but clearly this convergence cannot hold in the -norm.
If , and satisfies the assumptions in [7, Theorem 3.1] (which ensure that is irreducible and aperiodic, and that the support of the corresponding irreducibility measure has nonempty interior), then according to [2, Proposition 4.3] and [79, Theorems 5.1 and 7.1] (which imply that every compact set is petite for ) the conclusions of Theorem 1.3 (ii) hold true for any . If , then under the same assumptions as above, [26, Theorem 5.2], [65, Proposition 6.1], and [66, Theorem 4.2] (and [2, Proposition 4.3], and [79, Theorems 5.1 and 7.1]) imply that for any the process is exponentially ergodic in the -norm with . However, this does not necessarily imply ergodicity of in the -distance. To see this take again , and let be a one-dimensional symmetric -stable Lévy process with and symbol (characteristic exponent) . Thus, , and . We claim that . Assume this is not the case. Then,
[TABLE]
In particular, for every it holds that , -a.e. On the other hand, according to [73, Theorem 3.1], we have
[TABLE]
for all , and where is a probability measure on with characteristic function
[TABLE]
and denotes the space of bounded functions in . Hence, is the law of a symmetric -stable random variable. Now, the monotone convergence theorem implies that
[TABLE]
which is impossible.
Let us mention that ergodic properties of Ornstein-Uhlenbeck processes with jumps in the -norm, and in particular in the total variation norm, have been considered in [41, 61, 75, 83, 84].
3.3 Piecewise Ornstein-Uhlenbeck processes with jumps
We extend the results from the previous subsection to a class of Itô processes with a piecewise linear drift. Consider an -dimensional SDE of the form
[TABLE]
where
- (i)
the function is given by
[TABLE]
where , has nonnegative components and satisfies with , is a nonsingular M-matrix such that the vector has nonnegative components, and with for ; 2. (ii)
is a standard -dimensional Brownian motion, and the covariance function is locally Lipschitz continuous and satisfies, for some ,
[TABLE] 3. (iii)
is a -dimensional pure-jump Lévy process specified by a drift and Lévy measure .
Recall that a matrix is called an M-matrix if it can be expressed as for some and some nonnegative matrix with the property that , where and denote the identity matrix and the spectral radius of , respectively. Clearly, the matrix is nonsingular if . It is well known that the SDE in Eq. 3.1 admits a unique conservative strong solution which is a strong Markov process with càdlàg sample paths (see e.g. [2, Theorem 3.1 and Proposition 4.2]). In particular, is an Itô process satisfying (MP) with , , and . This process is often called a piecewise Ornstein-Uhlenbeck process with jumps. It arises as a limit of the suitably scaled queueing processes of multiclass many-server queueing networks with heavy-tailed (bursty) arrivals and/or asymptotically negligible service interruptions. In these models, if the scheduling policy is based on a static priority assignment on the queues, then the vector in the limiting diffusion Eq. 3.1 corresponds to a constant control. The process also arises in many-server queues with phase-type service times, where the constant vector corresponds to the probability distribution of the phases. For a multiclass queueing network with independent heavy-tailed arrivals, the process is an anisotropic Lévy process consisting of independent one-dimensional symmetric -stable components. Under service interruptions, is either a compound Poisson process, or an anisotropic Lévy process described above together with a compound Poisson component. More details on these queueing models can be found in [7, Section 4].
We first discuss the case when . This corresponds to the case when the control gives lowest priority to queues whose abandonment rate is zero. When , we define
[TABLE]
Proposition 3.2**.**
In addition to the assumptions of [7, Theorem 3.1] (which ensure that is irreducible and aperiodic with irreducibility measure having support with nonempty interior), suppose that , , and \bigl{\langle}e,M^{-1}\tilde{l}\bigr{\rangle}<0.
- (i)
If
[TABLE]
then there exists such that the assertions of Theorem 1.3 (i) hold true with . 2. (ii)
If is bounded, and for some , then there exists such that the assertions of Theorem 1.3 (iii) hold.
Proof.
- (i)
In [7, Theorem 3.2 (i)] it has been shown that there exist , , and , such that for any , we have
[TABLE]
It is easy to see that the above relation implies that there exist , , and , such that
[TABLE]
with . The assertion now follows from Theorem 1.3 (i), together with [2, Proposition 4.3], [24, Theorem 3.4], and [79, Theorems 5.1 and 7.1].
- (ii)
Let . As shown in the proof of [7, Theorem 3.2 (ii)], there exist , and , such that for any \zeta\in\bigl{(}0,\theta\lVert Q\rVert^{-\nicefrac{{1}}{{2}}}\bigr{)},
[TABLE]
This together with Lemma 2.2 (iii) imply that, for any sufficiently small, there exist and , such that
[TABLE]
Again, It is straightforward to see that the above relation implies that there exist , and , such that
[TABLE]
The assertion now follows from Theorem 1.3 (iii), and the results from [2, 24, 79] cited in part (i).∎
Remark 3.2*.*
It has been shown in [7, Theorem 3.3 (b) and Lemma 5.7] that the assumptions and are both necessary for the existence of an invariant probability measure of . Using this, we can exhibit an example where we have ergodicity with respect to the -norm but not with respect to -distance. Suppose that , , satisfies Eq. 3.3, and is a rotationally invariant -stable process with . Then [7, Theorem 3.1 (i)] shows that admits a unique invariant for , and
[TABLE]
for all and . Here, stands for the total variation norm, i.e. the -norm with . However, by [7, Theorem 3.4 (b)], so we cannot have convergence in -distance.
We next exhibit a lower bound on the polynomial rate of convergence in Proposition 3.2 (i), which is analogous to [7, Theorem 3.4]. We let
[TABLE]
Note that, in general, . In [7] it is assumed that is a compound Poisson process with drift , and Lévy measure which is supported on a half-line of the form with , and satisfies Eq. 3.3. This implies that , and subsequently, this equality is used in the proof of [7, Lemma 5.7 (b)] to establish that, provided , \int_{{\mathbb{R}}^{n}}\bigl{(}\langle e,M^{-1}x\rangle^{+}\bigr{)}^{p-1}\,\uppi(\mathrm{d}x)<\infty implies for . We use this fact, namely that the conclusions of [7, Lemma 5.7 (b)] hold under the weaker assumption that in the proof of the following proposition.
Proposition 3.3**.**
In addition to the assumptions of [7, Theorem 3.1], assume that , \bigl{\langle}e,M^{-1}\tilde{l}\bigr{\rangle}<0, and . Then, due to Proposition 3.2 (i), admits a unique invariant , . Next, fix and . Then, for any and there exist and a diverging increasing sequence , depending on these parameters, such that Eq. 1.9 holds with , , and , where is given in Proposition 3.2 (i).
Proof.
Observe first that . Thus, according to [7, Lemma 5.7 (b)], we have
[TABLE]
The assertion now follows from the proof of Proposition 3.2 (i) (together with [2, Proposition 4.3], [24, Theorem 3.4], and [79, Theorems 5.1 and 7.1]), and Theorem 1.2 by setting and . ∎
We now discuss the case when . For , we write () to indicate that all components of are nonnegative (nonnegative and at least one is strictly positive). Also, for we write if, and only if, .
Proposition 3.4**.**
In addition to the assumptions of [7, Theorem 3.1], suppose that ,
[TABLE]
and that one of the following holds:
- (i)
; 2. (ii)
* with , , and .*
Then there exists such that the assertions of Theorem 1.3 (ii) hold true.
Proof.
In [7, Theorem 3.5] it has been shown that there exist , , and , such that for any , we have
[TABLE]
As in Proposition 3.2, it is easy to see that the above relation implies that there exist , and , such that
[TABLE]
with . The assertion now follows from Theorem 1.3 (ii), together with the results from [2, 24, 79] cited in the proof of Proposition 3.3. ∎
In the case when (under (i) or (ii) in Proposition 3.4) the dynamics are contractive in the -distance. This is shown by establishing an asymptotic flatness (uniform dissipativity) property for . As a consequence, we assert exponential ergodicity of with respect to , without assuming irreducibility and aperiodicity, i.e. we allow the SDE in Eq. 3.1 to be degenerate.
Proposition 3.5**.**
Suppose that , is Lipschitz continuous, and either (i) or (ii) in Proposition 3.4 holds. Then there exists such that the matrices
[TABLE]
are in . Let denote the smallest eigenvalue of the positive definite matrices in Eq. 3.6, and , denote the largest, smallest eigenvalue of , respectively. For , let
[TABLE]
where is the Lipschitz constant of with respect to the Hilbert-Schmidt norm, and suppose that for some . Then the assertions of Theorem 1.4 hold true. If and , the assertions of Theorem 1.4 hold true for any .
Proof.
Existence of the matrix has been proven in [7, Theorem 3.5]. We prove that Eq. 1.16 holds with defined above. First, clearly,
[TABLE]
for all . We next discuss the term \bigl{\langle}\varDelta_{y-x}\tilde{b}(x),Q(y-x)\bigr{\rangle} for . Clearly, for . With , we have . If both and are on the same half-space, i.e. and , or the opposite, then
[TABLE]
So suppose, without loss of generality, that and . Then we have
[TABLE]
We distinguish two cases.
- (i)
\bigl{\langle}y-x,QM\hat{v}e^{\prime}x\bigr{\rangle}\leq 0. Then of course subtracting Eq. 3.8a from Eq. 3.8b, we obtain
[TABLE] 2. (ii)
\bigl{\langle}y-x,QM\hat{v}e^{\prime}x\bigr{\rangle}>0. Since , we must have \bigl{\langle}y-x,QM\hat{v}\bigr{\rangle}>0. This in turn implies, since , that
[TABLE]
Adding Eqs. 3.8a and 3.9 and subtracting Eq. 3.8b from the sum, we obtain
[TABLE]
Finally, combining Eqs. 3.7 and 3.10, we obtain
[TABLE]
thus completing the proof. ∎
The hypothesis in Proposition 3.5 that is, of course, always true if , in which case we have . This is the scenario for multiclass queueing models with service interruptions described in [7, Section 4.2].
Some examples of degenerate SDEs of the form Eq. 3.1 for which Proposition 3.5 is applicable are the following.
- (i)
is given by for , where has rank smaller than , and is a -dimensional Lévy process. As a special case may be composed of mutually independent -stable processes. This is the case in the queueing example described below. 2. (ii)
is a degenerate subordinate Brownian motion, as studied in [87].
The following is an example of a degenerate SDE that arises in applications for which Proposition 3.4 is applicable. Consider a two class queue with class-1 jobs having a Poisson process, and class-2 jobs having a heavy-tailed renewal arrival process. Service and patience times are exponentially distributed with rates and for , respectively. Assume that the arrival, service and abandonment processes are mutually independent, and that the number of servers is . Consider a sequence of such models indexed by , operating in the critically loaded asymptotic modified Halfin-Whitt regime as . Let denote the arrival process for class , with arrival rates . Assume that and for are independent of , and that as , for . The arrival process satisfies a functional central limit theorem (FCLT) with a Brownian motion limit , where is a standard Brownian motion, i.e.
[TABLE]
Here, denotes the convergence in the space of càdlàg functions endowed with the Skorokhod topology. We assume that the arrival process satisfies a FCLT with a symmetric -stable Lévy process , , in the limit, i.e.
[TABLE]
Here, denotes the convergence in the space with the topology. Let and for . The modified Halfin-Whitt regime requires the parameters satisfy
[TABLE]
In addition, we assume that as for . Next, let denote the number of class- jobs in the system. Define the scaled processes for . Let be the scheduling control process, representing allocations of service capacity to class . Let \hat{X}^{k}(t)=\bigl{(}\hat{X}^{k}_{1}(t),\hat{X}^{k}_{2}(t)\bigr{)}^{\prime} and U^{k}(t)=\bigl{(}U^{k}_{1}(t),U^{k}_{2}(t)\bigr{)}^{\prime} for . We consider work conserving and preemptive scheduling policies resulting in constant controls at the limit, i.e. , where for with being a probability vector. Then, as in [7, Theorem 4.1], it can shown that , where the limit process is a solution to the following two-dimensional degenerate -stable driven SDE:
[TABLE]
which is Eq. 3.1 with , , , , and for . Observe that the process does not fall into any of the four categories in [7, Theorem 3.1]. In fact, one can consider multiple classes of jobs with all heavy-tailed arrival processes that have different scaling parameters ’s for , in their corresponding FCLTs. The centered queueing process should be scaled as , where , and the limit process has the components driven by independent -stable processes if the arrival process of class has the parameter equal to the minimum , and the other components are degenerate without stochastic driving terms.
We remark here that without assuming irreducibility and aperiodicity, establishing subgeometric ergodicity in the case is difficult. Consider the following example. Let , , for , and
[TABLE]
Clearly, satisfies all the assumptions in [7], and
[TABLE]
A straightforward calculation shows that
[TABLE]
Let
[TABLE]
Then it is easy to see that the conditions (1)–(3) in [12, Theorem 2.4] hold. However, condition (4) does not hold. Namely, for arbitrary let . Then, \mathsf{d}\bigl{(}X^{x}(t),X^{y}(t)\bigr{)}=\mathsf{d}(x,y) for all .
Let us mention that ergodic properties of piecewise Ornstein-Uhlenbeck processes with jumps in the total variation norm have been considered in [7, 23, 70].
3.4 Piecewise Ornstein-Uhlenbeck processes with jumps under
stationary Markov controls
In Subsection 3.3 we consider a model with a constant control, i.e. with the vector being constant and fixed. If the scheduling policy (control) is a function of the state of the system, then in the limiting SDE Eq. 3.1 is, in general, a Borel measurable map from to . We call such a a stationary Markov control and denote the set of such controls by . If for , or it is a compound Poisson process, it follows from the results in [37] that, under any , Eq. 3.1 admits a unique conservative strong solution which is a strong Markov process with càdlàg sample paths. In the general case, we consider the subclass of stationary Markov controls for which
[TABLE]
is locally Lipschitz continuous. We let denote the class of such controls. Clearly, for any , the drift has at most linear growth. Other parameters are as in Subsection 3.3. Again, the SDE of the form Eq. 3.1, with replaced by , admits a unique conservative strong solution which is a strong Markov process with càdlàg sample paths. Also, it is an Itô process satisfying (MP) with , , and .
Recently, in [6] the authors have studied ergodic properties with respect to the total variation norm of this model with being either (or a combination of) a rotationally invariant -stable Lévy process, an anisotropic Lévy process consisting of independent one-dimensional symmetric -stable components, or a compound Poisson process. Observe that in this situation we cannot follow the procedure from the constant control case. Namely, the matrices used in constructing the appropriate Lyapunov functions depend on .
Proposition 3.6**.**
Grant the assumptions of [7, Theorem 3.1], and suppose that , with for
- (i)
Assume that the diagonal components of are strictly positive, satisfies Eq. 3.5, and is either a rotationally invariant -stable Lévy process, an anisotropic Lévy process consisting of independent one-dimensional symmetric -stable components (in both cases we assume that ), or a compound Poisson process satisfying . We allow to have a drift. Then, for any and , the assertions of Theorem 1.1 (iii) hold true with , and {\mathscr{V}}(x)=\bigl{(}\bar{\mathscr{V}}(x)\bigr{)}^{\theta}+1, where (given explicitly in **[6, Definition 1])* is bounded from below away from zero, is Lipschitz continuous, and satisfies*
[TABLE] 2. (ii)
Assume \bigl{\langle}e,M^{-1}\tilde{l}\bigr{\rangle}<0, where is given in Eq. 3.2, satisfies Eq. 3.3, and is a pure-jump Lévy process (possibly with drift) satisfying . Then, for any and , the assertions of Theorem 1.1 (i) and (ii) hold true with , , and as in (i). 3. (iii)
In addition to the assumptions in (ii) assume that , where is given in Eq. 3.4. Then, due to (ii), for any , admits a unique invariant for . Next, fix and . Then, for any such that a.e., and , there exist and a diverging increasing sequence , depending on these parameters, such that Eq. 1.9 holds for the corresponding with , , and as above.
Proof.
- (i)
Observe first that in the case when is a rotationally invariant -stable Lévy process or an anisotropic Lévy process consisting of independent one-dimensional symmetric -stable components, . In [6, Theorem 3 and the discussion after Theorem 5] it has been shown that for any and there exist and , such that
[TABLE]
It is easy to see that the above relation implies that there exist , , and , such that
[TABLE]
The assertion then follows from Theorem 1.1 (iii), together with [2, Proposition 4.3], [24, Theorem 3.4], and [79, Theorems 5.1 and 7.1].
- (ii)
In Theorem 5 and the discussion following the proof of this theorem in [6] it has been shown that for any and there exist , , and , such that
[TABLE]
It is easy to see that the above relation implies that there exist , , and , such that
[TABLE]
with given as above. The assertion now follows from Theorem 1.1 (i) and (ii), together with the results from [2, 24, 79] cited in part (i).
- (iii)
Clearly, . Thus, according to [7, Lemma 5.7 (b)],
[TABLE]
The assertion now follows from (the proof of) (ii) (together with the results from [2, 24, 79] cited in part (i)), and Theorem 1.2 by setting and .∎
As discussed in Subsection 3.3, the hypothesis that is true if is a compound Poisson process (possibly with drift) with Lévy measure supported on a half-line of the form with .
Ergodic properties in the -norm of piecewise Ornstein-Uhlenbeck processes with jumps under stationary Markov controls have been considered in [5, 6].
3.5 State-space models
Let be continuous, and such that for some and all Further, let be an -valued random variable, and let be a sequence of i.i.d. -valued random variables independent of . Assume that the common distribution of has a nontrivial absolutely continuous component which is bounded away from zero in a neighborhood of the origin. Then the Markov process defined by
[TABLE]
is irreducible, aperiodic, and all compact sets are petite (see [78, Proposition 5.2]). Further, assume that there exist constants , , , and , such that
[TABLE]
Proposition 3.7**.**
Under the above assumptions, the assertions of Theorem 1.1 (i) and (ii) hold with , , and .
Proof.
In [78, Proposition 5.2] it has been proved that the Foster-Lyapunov condition in Eq. 1.1 holds with and as above, and for some . The result now follows from Theorem 1.1 (i) and (ii). ∎
Ergodic properties of state-space models in the -norm have been studied in [78, 32].
3.6 Backward recurrence time chain
Let be such that , for , and , as . Let be a Markov process on defined by the transition kernel for . The process is irreducible and aperiodic, and it admits a unique invariant if, and only, if
[TABLE]
In this case, , and for .
Proposition 3.8**.**
- (i)
If there exist and , such that for , then the assertions of Theorem 1.1 (i) and (ii) hold with
[TABLE] 2. (ii)
Under the assumptions in (i), for . Next, fix and . Then, for every and there exist a positive constant and a diverging increasing sequence , depending on these parameters, such that Eq. 1.9 holds with as above, , and .
Proof.
- (i)
In [25, Section 3] it has been shown that the Foster-Lyapunov condition in Eq. 1.1 holds with a Lyapunov function which asymptotically behaves like , as above, and being a finite set for any and . Taking into account Eq. 1.2, the assertion follows.
- (ii)
From the assumptions on the sequence we see that . Now, since , we have . The assertion now follows from Theorem 1.2 by taking .∎
4 Concluding Remarks
We remark on some other approaches in the study of exponential or subexponential ergodicity of Markov processes. By analyzing polynomial moments of hitting times of compact sets directly, polynomial ergodicity results are established in [80, Theorem 6] for a class of irreducible (with respect to the Lebesgue measure) and aperiodic diffusion processes. In a follow-up work, by using analogous techniques, the same author established polynomial ergodicity of a class of diffusion processes without directly assuming irreducibility and aperiodicity of the process, but employing instead a so-called (local) Dobrushin condition (also known as Markov-Dobrushin condition) [81, Theorem 6]. This approach is based on a Foster-Lyapunov condition of the form Eq. 1.1, and instead of assuming irreducibility and aperiodicity of , it is assumed that (i) has precompact sub-level sets, and (ii) for every there exists such that
[TABLE]
(see [53, Chapter 3]). Observe that this condition actually means that for each satisfying the probability measures and are not mutually singular. Intuitively, the Dobrushin condition encodes irreducibility and aperiodicity of , and petiteness of sub-level sets of . Based on these assumptions, and using an appropriate Markov coupling of , it follows that the -modulated moment of the corresponding coupling time is finite and controlled by . This then implies (sub)geometric ergodicity of in the total variation norm (see [38, Theorem 4.1] or [53, Chapter 3]).
We remark that irreducibility and aperiodicity (together with Eq. 1.1) imply that the Dobrushin condition holds on the Cartesian product of any petite set with itself. Namely, according to [65, Proposition 6.1], for any petite set there exists such that for the measure (in the definition of petiteness) the Dirac measure in can be taken (together with some non-trivial measure ). Thus, for any and , which implies that
[TABLE]
If, in addition, is -Feller (i.e. is continuous and bounded for any and any continuous and bounded function ), and the support of the corresponding irreducibility measure has nonempty interior, then every compact set is petite (see [79, Theorems 5.1 and 7.1]) and thus Eq. 4.1 holds for any bounded set . This shows that, at least in this particular situation, the approach based on the Dobrushin condition is more general than the approach based on irreducibility and aperiodicity. Situations where it has a clear advantage are discussed in [54, 1]. In [54], the author considers a Markov process obtained as a solution to a Lévy-driven SDE with highly irregular coefficients and noise term; while in [1], a diffusion process with highly irregular (discontinuous) drift function and uniformly elliptic diffusion coefficient has been considered. In these concrete situations it is not clear whether one can obtain irreducibility and aperiodicity of the processes, whereas the authors obtain Eq. 4.1 for any compact set (see [54, Theorem 1.3] and [1, Lemma 3]). For additional results on ergodic properties of Markov processes based on the Dobrushin condition we refer the readers to [38, 53].
Acknowledgements
We thank the anonymous referee for the helpful comments that have led to significant improvements of the results in the article. This research was supported in part by the Army Research Office through grant W911NF-17-1-001, and in part by the National Science Foundation through grants DMS-1715210, CMMI-1635410, and DMS-1715875. Financial support through the Alexander von Humboldt Foundation (No. HRV 1151902 HFST-E) and the Croatian Science Foundation under the project 8958 (for N. Sandrić) is gratefully acknowledged.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] {barticle} [author] \bauthor \bsnm Abourashchi, \bfnm N. \binits N. and \bauthor \bsnm Veretennikov, \bfnm A. Yu. \binits A. Y. ( \byear 2010). \btitle On stochastic averaging and mixing. \bjournal Theory Stoch. Process. \bvolume 16 \bpages 111–129. \bmrnumber 2779833 \endbibitem
- 2[2] {barticle} [author] \bauthor \bsnm Albeverio, \bfnm Sergio \binits S., \bauthor \bsnm Brzeźniak, \bfnm Zdzisław \binits Z. and \bauthor \bsnm Wu, \bfnm Jiang-Lun \binits J.-L. ( \byear 2010). \btitle Existence of global solutions and invariant measures for stochastic differential equations driven by Poisson type noise with non-Lipschitz coefficients. \bjournal J. Math. Anal. Appl. \bvolume 371 \bpages 309–322. \bdoi 10.1016/j.jmaa.2010.05.039 \bmrnumber 2661009 \endbibitem
- 3[3] {barticle} [author] \bauthor \bsnm Andrieu, \bfnm Christophe \binits C., \bauthor \bsnm Fort, \bfnm Gersende \binits G. and \bauthor \bsnm Vihola, \bfnm Matti \binits M. ( \byear 2015). \btitle Quantitative convergence rates for subgeometric Markov chains. \bjournal J. Appl. Probab. \bvolume 52 \bpages 391–404. \bdoi 10.1239/jap/1437658605 \bmrnumber 3372082 \endbibitem
- 4[4] {bbook} [author] \bauthor \bsnm Arapostathis, \bfnm Ari \binits A., \bauthor \bsnm Borkar, \bfnm Vivek S. \binits V. S. and \bauthor \bsnm Ghosh, \bfnm Mrinal K. \binits M. K. ( \byear 2012). \btitle Ergodic control of diffusion processes. \bseries Encyclopedia of Mathematics and its Applications \bvolume 143. \bpublisher Cambridge University Press, Cambridge. \bmrnumber 2884272 \endbibitem
- 5[5] {barticle} [author] \bauthor \bsnm Arapostathis, \bfnm A. \binits A., \bauthor \bsnm Hmedi, \bfnm H. \binits H. and \bauthor \bsnm Pang, \bfnm G. \binits G. ( \byear 2020). \btitle On uniform exponential ergodicity of Markovian multiclass many-server queues in the Halfin-Whitt regime. \bjournal Math. Oper. Res. \bnote (to appear). \endbibitem
- 6[6] {bincollection} [author] \bauthor \bsnm Arapostathis, \bfnm A. \binits A., \bauthor \bsnm Hmedi, \bfnm H. \binits H., \bauthor \bsnm Pang, \bfnm G. \binits G. and \bauthor \bsnm Sandrić, \bfnm N. \binits N. ( \byear 2019). \btitle Uniform polynomial rates of convergence for a class of Lévy-driven controlled SD Es arising in multiclass many-server queues. In \bbooktitle Modeling, stochastic control, optimization, and applications. \bseries IMA Vol. Math. Appl. \bvolume 164 \b
- 7[7] {barticle} [author] \bauthor \bsnm Arapostathis, \bfnm Ari \binits A., \bauthor \bsnm Pang, \bfnm Guodong \binits G. and \bauthor \bsnm Sandrić, \bfnm Nikola \binits N. ( \byear 2019). \btitle Ergodicity of a Lévy-driven SDE arising from multiclass many-server queues. \bjournal Ann. Appl. Probab. \bvolume 29 \bpages 1070–1126. \bdoi 10.1214/18-AAP 1430 \bmrnumber 3910024 \endbibitem
- 8[8] {barticle} [author] \bauthor \bsnm Arisawa, \bfnm Mariko \binits M. ( \byear 2009). \btitle Homogenization of a class of integro-differential equations with Lévy operators. \bjournal Comm. Partial Differential Equations \bvolume 34 \bpages 617–624. \bdoi 10.1080/03605300902963518 \bmrnumber 2560294 \endbibitem
