Subexponential upper and lower bounds in Wasserstein distance for Markov   processes

Ari Arapostathis; Guodong Pang; and Nikola Sandri\'c

arXiv:1907.05250·math.PR·February 28, 2022

Subexponential upper and lower bounds in Wasserstein distance for Markov processes

Ari Arapostathis, Guodong Pang, and Nikola Sandri\'c

PDF

Open Access

TL;DR

This paper establishes subexponential and exponential convergence bounds in Wasserstein distance for various Markov processes, providing sharp characterizations of convergence rates under different conditions.

Contribution

It introduces new subexponential bounds for Wasserstein convergence in Markov processes using Foster-Lyapunov conditions and applies these to specific stochastic models.

Findings

01

Subexponential convergence bounds for irreducible, aperiodic Markov processes.

02

Exponential ergodicity under asymptotic flatness for Itô processes.

03

Sharp rate characterizations for Langevin, Ornstein-Uhlenbeck, and recurrence time chains.

Abstract

In this article, relying on Foster-Lyapunov drift conditions, we establish subexponential upper and lower bounds on the rate of convergence in the $L^{p}$ -Wasserstein distance for a class of irreducible and aperiodic Markov processes. We further discuss these results in the context of Markov L\'evy-type processes. In the lack of irreducibility and/or aperiodicity properties, we obtain exponential ergodicity in the $L^{p}$ -Wasserstein distance for a class of It\^{o} processes under an asymptotic flatness (uniform dissipativity) assumption. Lastly, applications of these results to specific processes are presented, including Langevin tempered diffusion processes, piecewise Ornstein-Uhlenbeck processes with jumps under constant and stationary Markov controls, and backward recurrence time chains, for which we provide a sharp characterization of the rate of convergence via…

Equations408

\int_{T} p (t, x, B) \upchi (d t) \geq \upnu_{\upchi} (B)

\int_{T} p (t, x, B) \upchi (d t) \geq \upnu_{\upchi} (B)

\upmu P_{t}(\mathrm{d}y)=\int_{\mathbb{X}}p(t,x,\mathrm{d}y)\,\upmu(\mathrm{d}{x})\,,\qquad\text{and}\qquad\upmu\bigl{(}f\bigr{)}=\int_{\mathbb{X}}f(x)\,\upmu(\mathrm{d}{x})

\upmu P_{t}(\mathrm{d}y)=\int_{\mathbb{X}}p(t,x,\mathrm{d}y)\,\upmu(\mathrm{d}{x})\,,\qquad\text{and}\qquad\upmu\bigl{(}f\bigr{)}=\int_{\mathbb{X}}f(x)\,\upmu(\mathrm{d}{x})

{\mathscr{W}}_{p}(\upmu_{1},\upmu_{2})\,\coloneqq\,\inf_{\Pi\in\mathcal{C}(\upmu_{1},\upmu_{2})}\biggl{(}\int_{\mathbb{X}\times\mathbb{X}}\bigl{(}\mathsf{d}(x,y)\bigr{)}^{p}\,\Pi(\mathrm{d}{x},\mathrm{d}{y})\biggr{)}^{\nicefrac{{1}}{{p}}}\,,

{\mathscr{W}}_{p}(\upmu_{1},\upmu_{2})\,\coloneqq\,\inf_{\Pi\in\mathcal{C}(\upmu_{1},\upmu_{2})}\biggl{(}\int_{\mathbb{X}\times\mathbb{X}}\bigl{(}\mathsf{d}(x,y)\bigr{)}^{p}\,\Pi(\mathrm{d}{x},\mathrm{d}{y})\biggr{)}^{\nicefrac{{1}}{{p}}}\,,

{\mathbb{E}}_{x}\bigl{[}{\mathscr{V}}(X(t))\bigr{]}-{\mathscr{V}}(x)\,\leq\,b\int_{[0,t)}{\mathbb{E}}_{x}\bigl{[}\mathds{1}_{C}(X(s))\bigr{]}\uptau(\mathrm{d}{s})-\int_{[0,t)}{\mathbb{E}}_{x}\bigl{[}\phi\circ{\mathscr{V}}(X(s))\bigr{]}\uptau(\mathrm{d}{s})

{\mathbb{E}}_{x}\bigl{[}{\mathscr{V}}(X(t))\bigr{]}-{\mathscr{V}}(x)\,\leq\,b\int_{[0,t)}{\mathbb{E}}_{x}\bigl{[}\mathds{1}_{C}(X(s))\bigr{]}\uptau(\mathrm{d}{s})-\int_{[0,t)}{\mathbb{E}}_{x}\bigl{[}\phi\circ{\mathscr{V}}(X(s))\bigr{]}\uptau(\mathrm{d}{s})

c\,\coloneqq\,\inf_{x\in\mathbb{X}}\,\dfrac{\phi\circ{\mathscr{V}}(x)}{\bigl{(}1+\mathsf{d}(x,x_{0})\bigr{)}^{\eta}}\,>\,0

c\,\coloneqq\,\inf_{x\in\mathbb{X}}\,\dfrac{\phi\circ{\mathscr{V}}(x)}{\bigl{(}1+\mathsf{d}(x,x_{0})\bigr{)}^{\eta}}\,>\,0

\displaystyle\left(1\vee\bigl{(}r(t)\bigr{)}^{\nicefrac{{(\eta-1)}}{{\eta}}}\right)\,{\mathscr{W}}_{1}\bigl{(}\updelta_{x}P_{t},\uppi\bigr{)}

\displaystyle\left(1\vee\bigl{(}r(t)\bigr{)}^{\nicefrac{{(\eta-1)}}{{\eta}}}\right)\,{\mathscr{W}}_{1}\bigl{(}\updelta_{x}P_{t},\uppi\bigr{)}

\displaystyle\int_{\mathbb{T}}\left(1\vee\bigl{(}r(t)\bigr{)}^{\nicefrac{{(\eta-1)}}{{\eta}}}\right)\,{\mathscr{W}}_{1}\bigl{(}\updelta_{x}P_{t},\updelta_{y}P_{t}\bigr{)}\,\uptau(\mathrm{d}{t})

\left(1\vee\left(t^{\nicefrac{{(\eta-p)}}{{p}}}\wedge t^{\nicefrac{{(1-p)}}{{p}}}\right)\bigl{(}r(t)\bigr{)}^{\nicefrac{{(\eta-1)}}{{p\eta}}}\right)\,{\mathscr{W}}_{p}(\updelta_{x}P_{t},\uppi)\,\leq\,\tilde{c}\,\bigl{(}{\mathscr{V}}(x)+\overline{m}_{\eta}\bigr{)}

\left(1\vee\left(t^{\nicefrac{{(\eta-p)}}{{p}}}\wedge t^{\nicefrac{{(1-p)}}{{p}}}\right)\bigl{(}r(t)\bigr{)}^{\nicefrac{{(\eta-1)}}{{p\eta}}}\right)\,{\mathscr{W}}_{p}(\updelta_{x}P_{t},\uppi)\,\leq\,\tilde{c}\,\bigl{(}{\mathscr{V}}(x)+\overline{m}_{\eta}\bigr{)}

\mathrm{e}^{\gamma t}\,{\mathscr{W}}_{1}\bigl{(}\updelta_{x}P_{t},\uppi\bigr{)}\,\leq\,\check{c}\,{\mathscr{V}}(x)\qquad\forall\,(t,x)\in{\mathbb{T}}\times\mathbb{X}\,.

\mathrm{e}^{\gamma t}\,{\mathscr{W}}_{1}\bigl{(}\updelta_{x}P_{t},\uppi\bigr{)}\,\leq\,\check{c}\,{\mathscr{V}}(x)\qquad\forall\,(t,x)\in{\mathbb{T}}\times\mathbb{X}\,.

\bigl{(}1\vee t^{\nicefrac{{\eta}}{{p}}-1}\bigr{)}\,{\mathscr{W}}_{p}(\updelta_{x}P_{t},\uppi)\,\leq\,\breve{c}\,\bigl{(}{\mathscr{V}}(x)+\overline{m}_{\eta}\bigr{)}^{\nicefrac{{1}}{{p}}}\qquad\forall\,(t,x)\in{\mathbb{T}}\times\mathbb{X}\,.

\bigl{(}1\vee t^{\nicefrac{{\eta}}{{p}}-1}\bigr{)}\,{\mathscr{W}}_{p}(\updelta_{x}P_{t},\uppi)\,\leq\,\breve{c}\,\bigl{(}{\mathscr{V}}(x)+\overline{m}_{\eta}\bigr{)}^{\nicefrac{{1}}{{p}}}\qquad\forall\,(t,x)\in{\mathbb{T}}\times\mathbb{X}\,.

\lVert\upmu\rVert_{f}\,\coloneqq\,\sup_{g\in{\mathcal{B}}(\mathbb{X}),\,\lvert g\rvert\leq f}\,\bigl{\lvert}\upmu(g)\bigr{\rvert}\,,

\lVert\upmu\rVert_{f}\,\coloneqq\,\sup_{g\in{\mathcal{B}}(\mathbb{X}),\,\lvert g\rvert\leq f}\,\bigl{\lvert}\upmu(g)\bigr{\rvert}\,,

\Lambda(\gamma)\,\coloneqq\,\sup_{k\in{\mathbb{N}}}\,\sup_{0=u_{0}<u_{1}<\dotsb<u_{k-1}<u_{k}=1}\Big{(}\textsf{d}\bigl{(}\gamma(u_{0}),\gamma(u_{1})\bigr{)}+\cdots+\textsf{d}\bigl{(}\gamma(u_{k-1}),\gamma(u_{k})\bigr{)}\Big{)}\,.

\Lambda(\gamma)\,\coloneqq\,\sup_{k\in{\mathbb{N}}}\,\sup_{0=u_{0}<u_{1}<\dotsb<u_{k-1}<u_{k}=1}\Big{(}\textsf{d}\bigl{(}\gamma(u_{0}),\gamma(u_{1})\bigr{)}+\cdots+\textsf{d}\bigl{(}\gamma(u_{k-1}),\gamma(u_{k})\bigr{)}\Big{)}\,.

d (x, y) = γ \in C ([0, 1]; X) in f {Λ (γ) : γ (0) = x, γ (1) = y} \forall x, y \in X .

d (x, y) = γ \in C ([0, 1]; X) in f {Λ (γ) : γ (0) = x, γ (1) = y} \forall x, y \in X .

{\mathscr{V}}(x)\,\geq\,c\,\bigl{(}L(x)\bigr{)}^{\theta}\,,\quad\text{and}\quad\phi\circ{\mathscr{V}}(x)\,\geq\,c\,\bigl{(}L(x)\bigr{)}^{\vartheta}\qquad\forall\,x\in\mathbb{X}\,.

{\mathscr{V}}(x)\,\geq\,c\,\bigl{(}L(x)\bigr{)}^{\theta}\,,\quad\text{and}\quad\phi\circ{\mathscr{V}}(x)\,\geq\,c\,\bigl{(}L(x)\bigr{)}^{\vartheta}\qquad\forall\,x\in\mathbb{X}\,.

{\mathscr{W}}_{p}(\updelta_{x}P_{t_{n}},\uppi)\,\geq\,\bar{c}\,\bigl{(}t_{n}+{\mathscr{V}}(x)\bigr{)}^{-\frac{\vartheta-p+\varepsilon+\iota}{(\theta-\vartheta-\varepsilon-\iota)p}}\qquad\forall\,n\in{\mathbb{N}}\,.

{\mathscr{W}}_{p}(\updelta_{x}P_{t_{n}},\uppi)\,\geq\,\bar{c}\,\bigl{(}t_{n}+{\mathscr{V}}(x)\bigr{)}^{-\frac{\vartheta-p+\varepsilon+\iota}{(\theta-\vartheta-\varepsilon-\iota)p}}\qquad\forall\,n\in{\mathbb{N}}\,.

\mathcal{L}f(x)\,=\,\bigl{\langle}b(x),\nabla f(x)\bigr{\rangle}+\frac{1}{2}\operatorname{Tr}\bigl{(}a(x)\nabla^{2}f(x)\bigr{)}+\int_{{\mathbb{R}}^{n}}\mathfrak{d}_{1}f(x;y)\upnu(x,\mathrm{d}{y})\,,\qquad x\in{\mathbb{R}}^{n}\,.

\mathcal{L}f(x)\,=\,\bigl{\langle}b(x),\nabla f(x)\bigr{\rangle}+\frac{1}{2}\operatorname{Tr}\bigl{(}a(x)\nabla^{2}f(x)\bigr{)}+\int_{{\mathbb{R}}^{n}}\mathfrak{d}_{1}f(x;y)\upnu(x,\mathrm{d}{y})\,,\qquad x\in{\mathbb{R}}^{n}\,.

\upnu(x,\{0\})\,=\,0\,,\quad\text{and}\quad\int_{{\mathbb{R}}^{n}}\bigl{(}1\wedge|y|^{2}\bigr{)}\,\upnu(x,\mathrm{d}y)\,<\,\infty\qquad\forall\,x\in{\mathbb{R}}^{n}\,,

\upnu(x,\{0\})\,=\,0\,,\quad\text{and}\quad\int_{{\mathbb{R}}^{n}}\bigl{(}1\wedge|y|^{2}\bigr{)}\,\upnu(x,\mathrm{d}y)\,<\,\infty\qquad\forall\,x\in{\mathbb{R}}^{n}\,,

d_{1} f (x; y) : = f (x + y) - f (x) - \mathds 1_{B} (y) ⟨ y, \nabla f (x)⟩, x, y \in R^{n}, f \in C^{1} (R^{n}) .

d_{1} f (x; y) : = f (x + y) - f (x) - \mathds 1_{B} (y) ⟨ y, \nabla f (x)⟩, x, y \in R^{n}, f \in C^{1} (R^{n}) .

M_{f}(t)\,\coloneqq\,f\bigl{(}X(t)\bigr{)}-f\bigl{(}X(0)\bigr{)}-\int_{0}^{t}\mathcal{L}f\bigl{(}X(s)\bigr{)}\,\mathrm{d}s\,,\qquad t\geq 0\,,

M_{f}(t)\,\coloneqq\,f\bigl{(}X(t)\bigr{)}-f\bigl{(}X(0)\bigr{)}-\int_{0}^{t}\mathcal{L}f\bigl{(}X(s)\bigr{)}\,\mathrm{d}s\,,\qquad t\geq 0\,,

q(x,\xi)\,\coloneqq\,-i\langle\xi,b(x)\rangle+\frac{1}{2}\langle\xi,a(x)\xi\rangle+\int_{{\mathbb{R}}^{n}}\bigl{(}1-\mathrm{e}^{i\langle\xi,y\rangle}+i\langle\xi,y\rangle\mathds{1}_{{\mathscr{B}}}(y)\bigr{)}\upnu(x,\mathrm{d}y)\,,\qquad x,\xi\in{\mathbb{R}}^{n}\,,

q(x,\xi)\,\coloneqq\,-i\langle\xi,b(x)\rangle+\frac{1}{2}\langle\xi,a(x)\xi\rangle+\int_{{\mathbb{R}}^{n}}\bigl{(}1-\mathrm{e}^{i\langle\xi,y\rangle}+i\langle\xi,y\rangle\mathds{1}_{{\mathscr{B}}}(y)\bigr{)}\upnu(x,\mathrm{d}y)\,,\qquad x,\xi\in{\mathbb{R}}^{n}\,,

L f (x) = - \int_{R^{n}} e^{i ⟨ ξ, x ⟩} q (x, ξ) \hat{f} (ξ) d ξ

L f (x) = - \int_{R^{n}} e^{i ⟨ ξ, x ⟩} q (x, ξ) \hat{f} (ξ) d ξ

\lim_{\rho\to\infty}\,\sup_{x\in{\mathscr{B}}_{\rho}}\,\sup_{\xi\in{\mathscr{B}}_{\nicefrac{{1}}{{\rho}}}}\,\bigl{\lvert}q(x,\xi)\bigr{\rvert}\,=\,0\,.

\lim_{\rho\to\infty}\,\sup_{x\in{\mathscr{B}}_{\rho}}\,\sup_{\xi\in{\mathscr{B}}_{\nicefrac{{1}}{{\rho}}}}\,\bigl{\lvert}q(x,\xi)\bigr{\rvert}\,=\,0\,.

\displaystyle\lim_{\rho\to\infty}\,\Biggl{(}\frac{\sup_{x\in{\mathscr{B}}_{\rho}}\lvert b(x)\rvert}{\rho}+\frac{\sup_{x\in{\mathscr{B}}_{\rho}}\lvert a(x)\rvert}{\rho^{2}}

\displaystyle\lim_{\rho\to\infty}\,\Biggl{(}\frac{\sup_{x\in{\mathscr{B}}_{\rho}}\lvert b(x)\rvert}{\rho}+\frac{\sup_{x\in{\mathscr{B}}_{\rho}}\lvert a(x)\rvert}{\rho^{2}}

\displaystyle+\sup_{x\in{\mathscr{B}}_{\rho}}\,\sup_{\xi\in{\mathscr{B}}_{\nicefrac{{1}}{{\rho}}}}\,\int_{{\mathscr{B}}^{c}}\bigl{(}1-\mathrm{e}^{i\langle\xi,y\rangle}\bigr{)}\upnu(x,\mathrm{d}y)\Biggr{)}\,=\,0\,.

\limsup_{\lvert x\rvert\to\infty}\,\sup_{\xi\in{\mathscr{B}}_{\nicefrac{{1}}{{\lvert x\rvert}}}}\,\bigl{\lvert}q(x,\xi)\bigr{\rvert}\,<\,\infty

\limsup_{\lvert x\rvert\to\infty}\,\sup_{\xi\in{\mathscr{B}}_{\nicefrac{{1}}{{\lvert x\rvert}}}}\,\bigl{\lvert}q(x,\xi)\bigr{\rvert}\,<\,\infty

∣ x ∣ \to \infty lim sup (\frac{∣ b ( x ) ∣}{∣ x ∣} + \frac{∥ a ( x )∥}{∣ x ∣ ^{2}} + \frac{\int _{B} ∣ y ∣ ^{2} \upnu ( x , d y )}{∣ x ∣ ^{2}} + \upnu (x, B^{c})) < \infty .

∣ x ∣ \to \infty lim sup (\frac{∣ b ( x ) ∣}{∣ x ∣} + \frac{∥ a ( x )∥}{∣ x ∣ ^{2}} + \frac{\int _{B} ∣ y ∣ ^{2} \upnu ( x , d y )}{∣ x ∣ ^{2}} + \upnu (x, B^{c})) < \infty .

ρ \to \infty lim y \in B_{r} sup \upnu (y, B_{ρ}^{c}) = 0, ρ \to 0 lim y \in B_{r} sup \int_{B_{ρ}} ∣ z ∣^{2} \upnu (y, d z) = 0,

ρ \to \infty lim y \in B_{r} sup \upnu (y, B_{ρ}^{c}) = 0, ρ \to 0 lim y \in B_{r} sup \int_{B_{ρ}} ∣ z ∣^{2} \upnu (y, d z) = 0,

y \to x lim \int_{R^{n}} f (z) \upnu (y, d z) = \int_{R^{n}} f (z) \upnu (x, d z) .

\lim_{\lvert x\rvert\to\infty}\,\upnu\bigl{(}x,{\mathscr{B}}_{r}(-x)\bigr{)}\,=\,0\qquad\forall\,r>0\,,

\lim_{\lvert x\rvert\to\infty}\,\upnu\bigl{(}x,{\mathscr{B}}_{r}(-x)\bigr{)}\,=\,0\qquad\forall\,r>0\,,

{\mathscr{V}}_{Q,\zeta}(x)\,\coloneqq\,\bigl{(}\chi_{Q}(x)\bigr{)}^{\zeta}\,,\quad\text{and}\quad\widetilde{\mathscr{V}}_{Q,\zeta}(x)\,\coloneqq\,\mathrm{e}^{\zeta\chi_{Q}(x)}\,,\qquad x\in{\mathbb{R}}^{n}\,.

{\mathscr{V}}_{Q,\zeta}(x)\,\coloneqq\,\bigl{(}\chi_{Q}(x)\bigr{)}^{\zeta}\,,\quad\text{and}\quad\widetilde{\mathscr{V}}_{Q,\zeta}(x)\,\coloneqq\,\mathrm{e}^{\zeta\chi_{Q}(x)}\,,\qquad x\in{\mathbb{R}}^{n}\,.

\Theta_{\upnu}\,\coloneqq\,\left\{\theta\geq 0\,\colon\sup_{x\in{\mathbb{R}}^{n}}\int_{{\mathbb{R}}^{n}}\bigl{(}\lvert y\rvert^{2}\,\mathds{1}_{{\mathscr{B}}}(y)+\lvert y\rvert^{\theta}\,\mathds{1}_{{\mathscr{B}}^{c}}(y)\bigr{)}\,\upnu(x,\mathrm{d}{y})<\infty\right\}\,,

\Theta_{\upnu}\,\coloneqq\,\left\{\theta\geq 0\,\colon\sup_{x\in{\mathbb{R}}^{n}}\int_{{\mathbb{R}}^{n}}\bigl{(}\lvert y\rvert^{2}\,\mathds{1}_{{\mathscr{B}}}(y)+\lvert y\rvert^{\theta}\,\mathds{1}_{{\mathscr{B}}^{c}}(y)\bigr{)}\,\upnu(x,\mathrm{d}{y})<\infty\right\}\,,

r \to \infty lim x \in R^{n} sup \int_{B_{r}^{c}} ∣ y ∣^{θ} \upnu (x, d y) = 0

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMarkov Chains and Monte Carlo Methods · Stochastic processes and financial applications · Stochastic processes and statistical mechanics

Full text

Subexponential upper and lower bounds in Wasserstein distance

for Markov processes

Nikola Sandrićlabel=e1][email protected] [

Ari Arapostathislabel=e2][email protected] [

Guodong Panglabel=e3][email protected] [ University of Zagreb\thanksmarkm1, University of Texas at Austin\thanksmarkm2,

and Pennsylvania State University\thanksmarkm3

Department of Mathematics, University of Zagreb

Bijenička cesta 30, 10000 Zagreb, Croatia

Department of Electrical and Computer Engineering

University of Texas at Austin, 2501 Speedway, EER 7.824, Austin, TX 78712

Department of Computational and Applied Mathematics

Rice University, Houston, TX 77005

Abstract

In this article, relying on Foster-Lyapunov drift conditions, we establish subexponential upper and lower bounds on the rate of convergence in the $\mathrm{L}^{p}$ -Wasserstein distance for a class of irreducible and aperiodic Markov processes. We further discuss these results in the context of Markov Lévy-type processes. In the lack of irreducibility and/or aperiodicity properties, we obtain exponential ergodicity in the $\mathrm{L}^{p}$ -Wasserstein distance for a class of Itô processes under an asymptotic flatness (uniform dissipativity) assumption. Lastly, applications of these results to specific processes are presented, including Langevin tempered diffusion processes, piecewise Ornstein–Uhlenbeck processes with jumps under constant and stationary Markov controls, and backward recurrence time chains, for which we provide a sharp characterization of the rate of convergence via matching upper and lower bounds.

60J05; 60J25,

60H10; 60J75,

exponential and subexponential ergodicity,

Wasserstein distance,

Itô process,

Foster–Lyapunov condition,

asymptotic flatness (uniform dissipativity),

Langevin diffusion process,

Ornstein-Uhlenbeck process,

keywords:

[class=MSC]

keywords:

\startlocaldefs\endlocaldefs

,

and

1 Introduction

One of the classical directions in the analysis of Markov processes centers around their ergodic properties. In this article, we focus on both qualitative and quantitative aspects of this problem. Let $\mathbb{X}$ be a locally compact Polish space, i.e. a locally compact separable completely metrizable topological space. Denote the corresponding metric by $\mathsf{d}$ , and let ${\mathbb{T}}={\mathbb{R}}_{+}$ or $\mathbb{Z}_{+}$ be the time parameter set. We endow $(\mathbb{X},\mathsf{d})$ with its Borel $\sigma$ -algebra $\mathfrak{B}(\mathbb{X})$ . Further, let $\bigl{(}\Omega,\mathcal{F},\{\mathcal{F}_{t}\}_{t\in{\mathbb{T}}},\{\theta_{t}\}_{t\in{\mathbb{T}}},\{X(t)\}_{t\in{\mathbb{T}}},\{{\mathbb{P}}_{x}\}_{x\in\mathbb{X}}\bigr{)}$ , denoted by $\{X(t)\}_{t\in{\mathbb{T}}}$ in the sequel, be a time-homogeneous conservative strong Markov process with càdlàg sample paths (when ${\mathbb{T}}={\mathbb{R}}_{+}$ ) and state space $\bigl{(}\mathbb{X},\mathfrak{B}(\mathbb{X})\bigr{)}$ , in the sense of [10]. Here, $(\Omega,\mathcal{F},{\mathbb{P}}_{x})_{x\in\mathbb{X}}$ is a family of probability spaces and $\{X(t)\}_{t\in{\mathbb{T}}}$ satisfies ${\mathbb{P}}_{x}(X(0)=x)=1$ , $\{\mathcal{F}_{t}\}_{t\in{\mathbb{T}}}$ is a filtration on $(\Omega,\mathcal{F})$ (non-decreasing family of sub- $\sigma$ -algebras of $\mathcal{F}$ ) and $\{\theta_{t}\}_{t\in{\mathbb{T}}}$ is a family of shift operators on $\Omega$ satisfying $X(t)\circ\theta_{s}=X(t+s)$ for all $s,t\in{\mathbb{T}}$ . Recall, $\{X(t)\}_{t\in{\mathbb{T}}}$ is said to be conservative if ${\mathbb{P}}_{x}(X(t)\in\mathbb{X})=1$ for all $t\in{\mathbb{T}}$ and $x\in\mathbb{X}$ . In the present article, we present (sharp) sufficient conditions under which $\{X(t)\}_{t\in{\mathbb{T}}}$ admits a unique invariant probability measure $\uppi(\mathrm{d}x)$ , and which ensure that the marginals of $\{X(t)\}_{t\in{\mathbb{T}}}$ converge to $\uppi(\mathrm{d}x)$ , as $t\to\infty$ , in the $\mathrm{L}^{p}$ -Wasserstein distance at exponential and subexponential rates.

1.1 Summary of the results

Before stating the main results of this article, we introduce some notation we need in the sequel. Denote by $p(t,x,\mathrm{d}y)\coloneqq{\mathbb{P}}_{x}(X(t)\in\mathrm{d}y)$ for $t\in{\mathbb{T}}$ and $x\in\mathbb{X}$ , the transition kernel of $\{X(t)\}_{t\in{\mathbb{T}}}$ . We endow ${\mathbb{T}}$ with the standard (Euclidean Borel in the case when ${\mathbb{T}}={\mathbb{R}}_{+}$ , and discrete when ${\mathbb{T}}=\mathbb{Z}_{+}$ ) $\sigma$ -algebra. The process $\{X(t)\}_{t\in{\mathbb{T}}}$ is called

(i)

irreducible if there exists a $\sigma$ -finite measure $\upvarphi(\mathrm{d}x)$ on $\mathfrak{B}(\mathbb{X})$ such that whenever $\upvarphi(B)>0$ we have $\int_{{\mathbb{T}}}p(t,x,B)\,\uptau(\mathrm{d}t)>0$ for all $x\in\mathbb{X}$ , where $\uptau(\mathrm{d}t)$ stands for the Lebesgue measure on ${\mathbb{T}}$ when ${\mathbb{T}}={\mathbb{R}}_{+}$ , and the counting measure when ${\mathbb{T}}=\mathbb{Z}_{+}$ ; 2. (ii)

transient if it is irreducible, and there exist $\{b_{k}\}_{k\in{\mathbb{N}}}\subset[0,\infty)$ and a covering $\{B_{k}\}_{k\in{\mathbb{N}}}\subseteq\mathfrak{B}(\mathbb{X})$ of $\mathbb{X}$ , such that $\int_{{\mathbb{T}}}p(t,x,B_{k})\,\uptau(\mathrm{d}{t})\leq b_{k}$ for all $x\in\mathbb{X}$ and $k\in{\mathbb{N}}$ ; 3. (iii)

recurrent if it is irreducible, and $\upvarphi(B)>0$ implies that $\int_{{\mathbb{T}}}p(t,x,B)\,\uptau(\mathrm{d}{t})=\infty$ for all $x\in\mathbb{X}$ ; 4. (iv)

aperiodic if there exists $t_{0}>0$ such that $\{X_{kt_{0}}\}_{k\in\mathbb{Z}_{+}}$ is irreducible, in the case when ${\mathbb{T}}={\mathbb{R}}_{+}$ ; and there does not exist a partition $\{B_{1},\dots,B_{k}\}\subseteq\mathfrak{B}(\mathbb{X})$ with $k\geq 2$ of $\mathbb{X}$ such that $p(1,x,B_{i+1})=1$ for all $x\in B_{i}$ and all $1\leq i\leq k-1$ , and $p(1,x,B_{1})=1$ for all $x\in B_{k}$ , in the case when ${\mathbb{T}}=\mathbb{Z}_{+}$ .

Let us remark that if $\{X(t)\}_{t\in{\mathbb{T}}}$ is irreducible, then it is either transient or recurrent (see [79, Theorem 2.3]). A Borel measure $\uppi(\mathrm{d}x)$ on $\mathbb{X}$ is called invariant for $\{X(t)\}_{t\in{\mathbb{T}}}$ if $\int_{\mathbb{X}}p(t,x,\mathrm{d}y)\,\uppi(\mathrm{d}x)=\uppi(\mathrm{d}y)$ for all $t\in{\mathbb{T}}$ . It is well known that if $\{X(t)\}_{t\in{\mathbb{T}}}$ is recurrent, then it possesses a unique (up to constant multiples) invariant measure (see [79, Theorem 2.6]). If the invariant measure is finite, then it may be normalized to a probability measure. If $\{X(t)\}_{t\in{\mathbb{T}}}$ is recurrent with finite invariant measure, then it is called positive recurrent; otherwise it is called null recurrent. Note that a transient Markov process cannot have a finite invariant measure. A set $C\in\mathfrak{B}(\mathbb{X})$ is called petite for $\{X(t)\}_{t\in{\mathbb{T}}}$ if there exist a probability measure $\upchi(\mathrm{d}t)$ on ${\mathbb{T}}$ and a non-trivial Borel measure $\upnu_{\upchi}(\mathrm{d}x)$ on $\mathbb{X}$ , such that

[TABLE]

for all $x\in C$ and $B\in\mathfrak{B}(\mathbb{X}).$ Recall that petite sets play a role of singletons for Markov processes on general state spaces (see [64, Chapter 5] for a detailed discussion). Denote by ${\mathcal{P}}(\mathbb{X})$ the class of all Borel probability measures on $\mathbb{X}$ , and for $f\in{\mathcal{B}}(\mathbb{X})$ (the space of real-valued Borel measurable functions on $\mathbb{X}$ ) let ${\mathcal{P}}_{f}(\mathbb{X})$ denote the class of all $\upmu\in{\mathcal{P}}(\mathbb{X})$ with the property that $\int_{\mathbb{X}}\lvert f(x)\rvert\,\upmu(\mathrm{d}{x})<\infty$ . When $f(x)=\bigl{(}\mathsf{d}(x_{0},x)\bigr{)}^{p}$ for some $p>0$ and $x_{0}\in\mathbb{X}$ , we denote this as ${\mathcal{P}}_{p}(\mathbb{X})$ . We adopt the usual notation

[TABLE]

for $t\in{\mathbb{T}}$ , $x\in\mathbb{X}$ , $\upmu\in{\mathcal{P}}(\mathbb{X})$ and $f\in{\mathcal{B}}(\mathbb{X})$ . Therefore, with $\updelta_{x}$ denoting the Dirac measure concentrated at $x\in\mathbb{X}$ , we have $\updelta_{x}P_{t}(\mathrm{d}y)=p(t,x,\mathrm{d}y)$ . Finally, recall that the $\mathrm{L}^{p}$ -Wasserstein distance on ${\mathcal{P}}_{p}(\mathbb{X})$ with $p\geq 1$ is defined by

[TABLE]

where $\mathcal{C}(\upmu_{1},\upmu_{2})$ is the family of couplings of $\upmu_{1}(\mathrm{d}x)$ and $\upmu_{2}(\mathrm{d}x)$ , i.e. $\Pi\in\mathcal{C}(\upmu_{1},\upmu_{2})$ if, and only if, $\Pi(\mathrm{d}x,\mathrm{d}y)$ is a probability measure on $\mathbb{X}\times\mathbb{X}$ having $\upmu_{1}(\mathrm{d}x)$ and $\upmu_{2}(\mathrm{d}x)$ as its marginals. It is well known that $\mathcal{P}_{p}(\mathbb{X})$ is a complete separable metric space under the metric ${\mathscr{W}}_{p}$ [82, Theorem 6.18]. The topology generated by ${\mathscr{W}}_{p}$ on ${\mathcal{P}}_{p}(\mathbb{X})$ is finer than the Prokhorov topology, i.e. the topology of weak convergence.

We now state the main results of this article.

Theorem 1.1.

Suppose that $\{X(t)\}_{t\in{\mathbb{T}}}$ is irreducible and aperiodic, and there exist a continuous ${\mathscr{V}}\colon\mathbb{X}\to[1,\infty)$ , a constant $b>0$ , a nondecreasing differentiable concave function $\phi\colon[1,\infty)\to(0,\infty)$ , and a (topologically) closed petite set $C\subseteq\mathbb{X}$ such that

[TABLE]

for all $(t,x)\in T\times\mathbb{X}$ . Assume further that $\sup_{x\in C}{\mathscr{V}}(x)<\infty$ , and

[TABLE]

for some $\eta\geq 1$ and some (and therefore any) $x_{0}\in\mathbb{X}$ . Then $\{X(t)\}_{t\in{\mathbb{T}}}$ admits a unique invariant $\uppi\in\mathcal{P}_{\phi\circ{\mathscr{V}}}(\mathbb{X})$ . In addition, with $\Phi(t)\coloneqq\int_{1}^{t}\frac{\mathrm{d}{s}}{\phi(s)}$ and $r(t)\coloneqq\phi\circ\Phi^{-1}(t)$ , the following hold.

(i)

If $\displaystyle\lim_{t\to\infty}\phi^{\prime}(t)=0$ , then for some $\bar{c}>0$ we have

[TABLE] 2. (ii)

If $\displaystyle\lim_{t\to\infty}\phi^{\prime}(t)=0$ , then for any $p\in[1,\eta]$ there exists $\tilde{c}>0$ such that

[TABLE]

for all $(t,x)\in{\mathbb{T}}\times\mathbb{X}$ , where $\overline{m}_{\eta}=\uppi\bigl{(}\bigl{(}\mathsf{d}(x_{0},\cdot\,)\bigr{)}^{\eta}\bigr{)}$ . 3. (iii)

If $\phi(t)=\hat{c}\,t$ for some $\hat{c}>0$ , then there exist $\check{c}>0$ and $\gamma>0$ , such that

[TABLE]

In addition, for any $p\in[1,\eta]$ there exists $\breve{c}>0$ such that

[TABLE]

The results in Theorem 1.1 should be compared to equations (2.3) and (2.5) in [12, Theorems 2.1 and 2.4] (see also [28, Theorem 3 (ii)] and [53, Chapter 4]). The underlying metric $\mathsf{d}$ is assumed to be bounded in [12]. The starting point is a Foster-Lyapunov condition of the form in Eq. 1.1, and the irreducibility and aperiodicity assumptions are replaced by a closely related structural property: the metric $\mathsf{d}$ is contracting, and the sublevel sets of $(x,y)\mapsto{\mathscr{V}}(x)+{\mathscr{V}}(y)$ are $\mathsf{d}$ -small (see (3) and (4) in [12, Theorems 2.1 and 2.4]). Then an analogous estimate to Eq. 1.3 holds for the corresponding ${\mathscr{W}}_{1}$ -distance. Observe that when $\mathsf{d}$ is bounded, the relation in Eq. 1.1 trivially holds for any $\eta\geq 0$ . Provided $\{X(t)\}_{t\in{\mathbb{T}}}$ is irreducible and aperiodic, this gives an analogous result to the one obtained in [12, Theorems 2.1 and 2.4] (in ${\mathscr{W}}_{1}$ -distance) without assuming either contraction properties of $\mathsf{d}$ or $\mathsf{d}$ -smallness of the sublevel sets of $(x,y)\mapsto{\mathscr{V}}(x)+{\mathscr{V}}(y)$ . The proof of Theorem 1.1 relies on [24, Theorem 3.2] and [25, Theorem 2.8], where, under the assumptions of Theorem 1.1, the authors show ergodicity of $\{X(t)\}_{t\in{\mathbb{T}}}$ in the $f$ -norm with rate $\Psi_{1}\circ r(t)$ and $f(x)=\Psi_{2}\circ\phi\circ{\mathscr{V}}(x)\vee 1$ , for any pair $(\Psi_{1}^{-1},\Psi_{2}^{-1})$ of Young’s functions. Recall, for a signed Borel measure $\upmu(\mathrm{d}x)$ on $\mathbb{X}$ and a function $f\colon\mathbb{X}\to[1,\infty]$ the so-called $f$ -norm of $\upmu(\mathrm{d}x)$ is defined as

[TABLE]

generalizing the usual total variation norm $\lVert\upmu\rVert_{\mathrm{TV}}\coloneqq\sup_{g\in{\mathcal{B}}(\mathbb{X}),\,\lvert g\rvert\leq 1}\,\bigl{\lvert}\upmu(g)\bigr{\rvert}$ . We remark here that convergence in the $f$ -norm does not in general imply convergence in the ${\mathscr{W}}_{p}$ -distance, and vice versa (see Section 3 for examples of such Markov processes).

In the following theorem we establish a lower bound for ${\mathscr{W}}_{p}$ -convergence, which matches the upper bounds obtained in Eqs. 1.3 and 1.5. For $\gamma\in C([0,1];\mathbb{X})$ (the space of continuous mappings from $[0,1]$ to $\mathbb{X}$ ) let

[TABLE]

The space $\mathbb{X}$ is called a length space if

[TABLE]

Theorem 1.2.

Assume that $\mathbb{X}$ is a length space, $\{X(t)\}_{t\in{\mathbb{T}}}$ satisfies Eq. 1.1, and there exist a Lipschitz continuous function $L\colon\mathbb{X}\to[0,\infty)$ and constants $\theta>\vartheta\geq 1$ and $c>0$ , such that

[TABLE]

In addition, suppose that $\{X(t)\}_{t\in{\mathbb{T}}}$ admits an invariant $\uppi\in{\mathcal{P}}(\mathbb{X})$ such that $\int_{\mathbb{X}}\bigl{(}L(x)\bigr{)}^{\vartheta+\varepsilon}\,\uppi(\mathrm{d}x)=\infty$ for some $\varepsilon\in(0,\theta-\vartheta)$ . Then, for each $p\in[1,\vartheta]$ , $\iota\in(0,\theta-\vartheta-\varepsilon)$ and $x\in\mathbb{X}$ , there exist a constant $\bar{c}>0$ and a diverging increasing sequence $\{t_{n}\}_{n\in{\mathbb{N}}}\subseteq{\mathbb{T}}$ , depending on these parameters, such that

[TABLE]

Note that the parameters $\theta$ , $\vartheta$ , $\varepsilon$ , $p$ and $\iota$ are such that the exponent in the above expression is always strictly negative. Obtaining lower bound for the convergence in the total variation norm is discussed in [38, Theorem 5.1 and Corollary 5.2]. Applications of Theorem 1.2 are discussed in Section 3.

1.2 Ergodicity of a class of Lévy-type processes

Here, we discuss ergodic properties of a class of Markov processes on the Euclidean space ${\mathbb{R}}^{n}$ (endowed with the standard Euclidean metric) generated by a (Lévy-type) operator $\mathcal{L}\colon\mathcal{D}(\mathcal{L})\subseteq{\mathcal{B}}({\mathbb{R}}^{n})\to{\mathcal{B}}({\mathbb{R}}^{n})$ given by

[TABLE]

Here, $b=(b_{i})_{i=1,\dots,n}\colon{\mathbb{R}}^{n}\to{\mathbb{R}}^{n}$ is Borel measurable, $a=(a_{ij})_{1\leq i,j\leq n}\colon{\mathbb{R}}^{n}\to{\mathbb{R}}^{n\times n}$ is a symmetric non-negative definite $n\times n$ matrix-valued Borel measurable function, $\upnu(x,\mathrm{d}y)$ is a nonnegative Borel kernel on ${\mathbb{R}}^{n}\times\mathfrak{B}({\mathbb{R}}^{n})$ , called the Lévy kernel, satisfying

[TABLE]

and

[TABLE]

The symbol $\mathcal{D}(\mathcal{L})$ stands for the domain of $\mathcal{L}$ , i.e. the set of functions $f\in{\mathcal{B}}({\mathbb{R}}^{n})$ for which Eq. 1.10 is well defined, $\langle\cdot,\cdot\rangle$ and $\lvert\cdot\rvert$ denote the standard inner product and the corresponding Euclidean norm on ${\mathbb{R}}^{n}$ , $\operatorname{Tr}M$ stands for the trace of a square matrix $M$ , and $\nabla^{2}f(x)$ denotes the Hessian of $f\in C^{2}({\mathbb{R}}^{n})$ . An open (resp. closed) ball of radius $r>0$ centered at $x$ is denoted by ${\mathscr{B}}_{r}(x)$ (resp. $\overline{\mathscr{B}}_{r}(x)$ ). If $x=0$ , we write ${\mathscr{B}}_{r}$ (resp. $\overline{\mathscr{B}}_{r}$ ), and the unit open (resp. closed) ball centered at [math] is denoted by ${\mathscr{B}}$ (resp. $\overline{\mathscr{B}}$ ). Observe that $C_{b}^{2}({\mathbb{R}}^{n})\subseteq\mathcal{D}(\mathcal{L})$ , where $C_{b}^{k}({\mathbb{R}}^{n})$ , $k\geq 0$ , denotes the space of $k$ times differentiable functions such that all derivatives up to order $k$ are bounded. We also denote by $\lVert M\rVert\coloneqq\bigl{(}\operatorname{Tr}MM^{\prime}\bigr{)}^{\nicefrac{{1}}{{2}}}$ the Hilbert–Schmidt norm of a matrix $M$ , where $M^{\prime}$ stands for the transpose of $M$ .

We introduce the following assumption:

(MP)

There exists a conservative strong Markov process ${\{X(t)\}_{t\geq 0}}$ with càdlàg sample paths such that

[TABLE]

is a ${\mathbb{P}}_{x}$ -martingale (with respect to $\{\mathcal{F}_{t}\}_{t\geq 0}$ ) for any $f\in C_{c}^{\infty}({\mathbb{R}}^{n})$ (the space of smooth functions with compact support).

Define

[TABLE]

and observe that

[TABLE]

for all $x\in{\mathbb{R}}^{n}$ and $f\in C_{c}^{\infty}({\mathbb{R}}^{n}),$ where $\hat{f}(\xi)\coloneqq(2\pi)^{-n}\int_{{\mathbb{R}}^{n}}\mathrm{e}^{-i\langle\xi,x\rangle}f(x)\,\mathrm{d}x$ denotes the Fourier transform of $f(x)$ . In other words, $\mathcal{L}$ is a pseudo-differential operator with symbol $q(x,\xi)$ . According to [52, Theorem 1.1], (MP) is satisfied if

(LB)

The functions $b(x)$ , $a(x)$ , and $x\mapsto\int_{{\mathbb{R}}^{n}}\bigl{(}1\wedge|y|^{2}\bigr{)}\,\upnu(x,\mathrm{d}y)$ are locally bounded.

(SG)

$x\mapsto q(x,\xi)$ is continuous for all $\xi\in{\mathbb{R}}^{n}$ , and $q(x,\xi)$ is locally uniformly continuous at $\xi=0$ , i.e.

[TABLE]

Observe that the second condition in (SG) essentially means that the coefficients $b(x)$ , $a(x)$ , and $\upnu(x,\mathrm{d}y)$ have a sublinear growth. Namely, it is satisfied if

[TABLE]

In order to allow linear growth of the coefficients, we replace (LB) and (SG) by

(LG)

$\mathcal{L}\bigl{(}C_{c}^{\infty}({\mathbb{R}}^{n})\bigr{)}\subseteq C_{\infty}({\mathbb{R}}^{n})$ , $x\mapsto q(x,\xi)$ is continuous for all $\xi\in{\mathbb{R}}^{n}$ , and

[TABLE]

(see [50, Corollary 3.2]).

Here, $C_{\infty}({\mathbb{R}}^{n})$ stands for the space of continuous functions vanishing at infinity. Clearly, the last condition in (LG) follows from

[TABLE]

Let us also remark that due to [51, Theorem A1] the map $x\mapsto q(x,\xi)$ is continuous for all $\xi\in{\mathbb{R}}^{n}$ if $b(x)$ and $a(x)$ are continuous, and for any $r>0$ , $x\in{\mathbb{R}}^{n}$ and $f\in C_{c}({\mathbb{R}}^{n}\setminus\{0\})$ ,

[TABLE]

Furthermore, under the continuity of $x\mapsto q(x,\xi)$ (for all $\xi\in{\mathbb{R}}^{n}$ ) in the same reference it has been shown that $\mathcal{L}\bigl{(}C_{c}^{\infty}({\mathbb{R}}^{n})\bigr{)}\subseteq C_{b}({\mathbb{R}}^{n})$ . In addition, if

[TABLE]

we easily see that $\mathcal{L}\bigl{(}C_{c}^{\infty}({\mathbb{R}}^{n})\bigr{)}\subseteq C_{\infty}({\mathbb{R}}^{n})$ .

Definition 1.1.

Let $\mathcal{M}_{+}$ denote the class of positive definite matrices in ${\mathbb{R}}^{n\times n}$ . For $Q\in\mathcal{M}_{+}$ , let $\lvert x\rvert_{Q}\coloneqq\langle x,Qx\rangle^{\nicefrac{{1}}{{2}}}$ for $x\in{\mathbb{R}}^{n}$ , and $\chi_{Q}\in C^{\infty}({\mathbb{R}}^{n})$ be some nonnegative, symmetric convex function such that $\chi_{Q}(x)=\lvert x\rvert_{Q}$ for $x\in{\mathcal{B}}^{c}$ . For $Q\in\mathcal{M}_{+}$ and $\zeta>0$ , we define

[TABLE]

Further, let

[TABLE]

and when $\Theta_{\upnu}\neq\emptyset$ , let $\theta_{\upnu}\coloneqq\sup\Theta_{\upnu}$ .

We now discuss ergodic properties of the Lévy-type process ${\{X(t)\}_{t\geq 0}}$ .

Theorem 1.3.

Assume (LB) and (MP), and suppose that ${\{X(t)\}_{t\geq 0}}$ is irreducible and aperiodic, and that every compact set is petite for ${\{X(t)\}_{t\geq 0}}$ . Then the following hold.

(i)

If $\theta_{\upnu}>0$ ,

[TABLE]

for some $\theta\in(0,\theta_{\upnu}]\cap\Theta_{\upnu}$ , and there exist $Q\in\mathcal{M}_{+}$ and $\vartheta\in[0\vee(2-\theta),2)$ such that

[TABLE]

then ${\{X(t)\}_{t\geq 0}}$ admits a unique invariant $\uppi\in\mathcal{P}_{\theta-2+\vartheta}({\mathbb{R}}^{n})$ . In addition, if $\theta-3+\vartheta\geq 0$ , then Theorem 1.1 (i) and (ii) hold with ${\mathscr{V}}(x)={\mathscr{V}}_{Q,\theta}(x)+1$ , $\phi(t)=t^{\nicefrac{{(\theta-2+\vartheta)}}{{\theta}}}$ and $\eta=\theta-2+\vartheta$ . 2. (ii)

If $\theta_{\upnu}>0$ , Eq. 1.13 holds for some $\theta\in(0,\theta_{\upnu}]\cap\Theta_{\upnu}$ , and there exists $Q\in\mathcal{M}_{+}$ such that

[TABLE]

then ${\{X(t)\}_{t\geq 0}}$ admits a unique invariant $\uppi\in\mathcal{P}_{\theta}({\mathbb{R}}^{n})$ . In addition, if $\theta\geq 1$ , then the conclusion of Theorem 1.1 (iii) holds with ${\mathscr{V}}(x)={\mathscr{V}}_{Q,\theta}(x)+1$ and $\eta=\theta$ . 3. (iii)

Suppose that $a(x)$ is bounded, and there exist $\theta>0$ and $Q\in{\mathcal{M}}_{+}$ , such that

[TABLE]

and

[TABLE]

Then the conclusion of Theorem 1.1 (iii) holds with ${\mathscr{V}}(x)=\widetilde{\mathscr{V}}_{Q,\zeta}(x)$ for any $\zeta>0$ sufficiently small and any $\eta\geq 1$ .

Irreducibility and aperiodicity are crucial structural properties of the underlying process in Theorems 1.1, 1.2 and 1.3. Roughly speaking, they ensure that the process does not show singular behavior in its motion, and together with the Foster-Lyapunov condition in Eq. 1.1 (which ensures controllability of the $\phi\circ\Phi^{-1}$ -modulated moment of return-times to the petite set $C$ , see [24, Theorem 4.1]) they lead to the ergodic properties stated.

Under an asymptotic flatness (uniform dissipativity) property (see Eq. 1.16), we use a completely different approach to this problem, the so-called synchronous coupling method (see [14, Example 2.16] for details), to obtain ergodic properties for a class of Itô processes which are not necessarily irreducible and aperiodic. Recall that an Itô process is a solution to a stochastic differential equation (SDE) of the following form

[TABLE]

where $b\colon{\mathbb{R}}^{n}\to{\mathbb{R}}^{n}$ , $\sigma\colon{\mathbb{R}}^{n}\to{\mathbb{R}}^{n\times n}$ and $k\colon{\mathbb{R}}^{n}\times{\mathbb{R}}\to{\mathbb{R}}^{n}$ are Borel measurable, ${\{B(t)\}_{t\geq 0}}$ is a standard $n$ -dimensional Brownian motion, and $\nu_{p}(\mathrm{d}v,\mathrm{d}s)$ is a Poisson random measure on $\mathfrak{B}({\mathbb{R}})\otimes\mathfrak{B}\bigl{(}[0,\infty)\bigr{)}$ , with intensity measure $\nu(\mathrm{d}v)\,\mathrm{d}s$ (a $\sigma$ -finite measure on $\mathfrak{B}({\mathbb{R}})\otimes\mathfrak{B}({\mathbb{R}})$ ). According to [13, Theorem 3.33], every Itô process is a semimartingale Hunt process. In particular, it is a conservative strong Markov process with càdlàg sample paths. Conversely, again by [13, Theorem 3.33], for every $n$ -dimensional semimartingale Hunt process ${\{X(t)\}_{t\geq 0}}$ , and every $\sigma$ -finite nonfinite and nonatomic measure $\nu(\mathrm{d}v)$ on $\mathfrak{B}({\mathbb{R}})$ , there exist $b(x)$ , $\sigma(x)$ , $k(x,v)$ , ${\{B(t)\}_{t\geq 0}}$ , and $\nu_{p}(\mathrm{d}v,\mathrm{d}s)$ as above (possibly defined on an enlargement of the initial stochastic basis), such that ${\{X(t)\}_{t\geq 0}}$ satisfies Eq. 1.15. By setting

[TABLE]

and

[TABLE]

Eq. 1.15 reads as

[TABLE]

Set $a(x)\coloneqq\sigma(x)\sigma(x)^{\prime}$ , and let $\mathcal{L}$ be as in Eq. 1.10. According to [40, Theorem II.2.42] (with $h(x)=x\mathds{1}_{{\mathscr{B}}}(x)$ ), for any $f\in C_{b}^{2}({\mathbb{R}}^{n})$ , the process ${\{M_{f}(t)\}_{t\geq 0}}$ , defined as in Eq. 1.11, is a ${\mathbb{P}}_{x}$ -local martingale for every $x\in{\mathbb{R}}^{n}$ . In addition, if (LB) holds true, then ${\{M_{f}(t)\}_{t\geq 0}}$ is a ${\mathbb{P}}_{x}$ -local martingale for every $f\in C_{c}^{\infty}({\mathbb{R}}^{n})$ and every $x\in{\mathbb{R}}^{n}$ , i.e. (MP) is satisfied.

For $x,z\in{\mathbb{R}}^{n}$ define

[TABLE]

If $b(x)\equiv b$ (resp. $\sigma(x)\equiv\sigma$ , or $\upnu(x,\mathrm{d}y)\equiv\upnu(\mathrm{d}y)$ ), then of course $\varDelta_{z}b(x)$ (resp. $\varDelta_{z}\sigma(x)$ , or $\varDelta_{z}\upnu(x,\mathrm{d}y)$ ) is equal to zero.

Theorem 1.4.

Assume that $b(x)$ and $a(x)$ are locally bounded and satisfy the linear growth condition in Eq. 1.12, and that $\upnu(x,\mathrm{d}y)$ is such that $2\in\Theta_{\upnu}$ . If for some $p\in[2,\theta_{\upnu}]\cap\Theta_{\upnu}$ there exist $Q\in{\mathcal{M}}_{+}$ , and a $\sigma$ -finite nonfinite and nonatomic measure $\nu(\mathrm{d}v)$ on $\mathfrak{B}({\mathbb{R}})$ such that Eq. 1.15 admits a unique strong solution ${\{X(t)\}_{t\geq 0}}$ , and

[TABLE]

for some $c(p)>0$ and all $x,z\in{\mathbb{R}}^{n}$ , where $k\colon{\mathbb{R}}^{n}\times{\mathbb{R}}\to{\mathbb{R}}^{n}$ is given in Eq. 1.15, then

[TABLE]

for all $t\geq 0$ and $x,y\in{\mathbb{R}}^{n}$ , where $\overline{\lambda}_{Q}$ ( $\underline{\lambda}_{Q}$ ) stands for the largest (smallest) eigenvalue of $Q$ . Furthermore, ${\{X(t)\}_{t\geq 0}}$ admits a unique invariant $\uppi\in{\mathcal{P}}_{p}({\mathbb{R}}^{n})$ , and

[TABLE]

for all $t\geq 0$ and $\upmu\in{\mathcal{P}}_{p}({\mathbb{R}}^{n})$ .

In addition, if $\sigma(x)\equiv\sigma$ , a constant, $\upnu(x,\mathrm{d}y)\equiv\upnu(\mathrm{d}y)$ , $1\in\Theta_{\upnu}$ , and Eq. 1.16 holds for some $p\in[1,\theta_{\upnu}]\cap\Theta_{\upnu}$ , then Eqs. 1.17 and 1.18 remain valid.

We remark that ergodic properties of a Markov process with respect to the ${\mathscr{W}}_{p}$ -distance are invariant under the Bochner’s random time-change method. Recall that a subordinator ${\{S(t)\}_{t\geq 0}}$ is a nondecreasing Lévy process on $\left[0,\infty\right)$ with Laplace transform $\mathbb{E}\bigl{[}\mathrm{e}^{-uS_{t}}\bigr{]}=\mathrm{e}^{-t\psi(u)}$ , $u,t\geq 0$ . The characteristic (Laplace) exponent $\psi\colon(0,\infty)\to(0,\infty)$ is a Bernstein function, i.e. it is of class $C^{\infty}$ and $(-1)^{n}\psi^{(n)}(u)\geq 0$ for all $n\in{\mathbb{N}}$ . It is well known that every Bernstein function admits a unique (Lévy-Khintchine) representation

[TABLE]

where $b_{S}\geq 0$ is the drift parameter and $\upnu_{S}(\mathrm{d}y)$ is a Lévy measure, i.e. a Borel measure on $\mathfrak{B}\bigl{(}(0,\infty)\bigr{)}$ satisfying $\int_{(0,\infty)}(1\wedge y)\,\upnu(\mathrm{d}y)<\infty$ . For additional reading on subordinators and Bernstein functions we refer the reader to the monograph [74]. Suppose ${\{X(t)\}_{t\geq 0}}$ is a Markov process on $\bigl{(}\mathbb{X},\mathfrak{B}(\mathbb{X})\bigr{)}$ with transition kernel $p(t,x,\mathrm{d}y)$ , and let ${\{S(t)\}_{t\geq 0}}$ be a subordinator with characteristic exponent $\psi(u)$ , independent of ${\{X(t)\}_{t\geq 0}}$ . The process $X^{\psi}(t)\coloneqq X\bigl{(}S(t)\bigr{)}$ , $t\geq 0$ , obtained from ${\{X(t)\}_{t\geq 0}}$ by a random time change through ${\{S(t)\}_{t\geq 0}}$ , is referred to as the subordinate process ${\{X(t)\}_{t\geq 0}}$ with subordinator ${\{S(t)\}_{t\geq 0}}$ in the sense of Bochner. It is easy to see that ${\{X^{\psi}(t)\}_{t\geq 0}}$ is again a Markov process with transition kernel

[TABLE]

where $\upmu_{t}(\cdot)=\mathbb{P}(S(t)\in\cdot)$ . It is also elementary to check that if $\uppi(\mathrm{d}x)$ is an invariant measure for ${\{X(t)\}_{t\geq 0}}$ , then it is also invariant for the subordinate process ${\{X^{\psi}(t)\}_{t\geq 0}}$ .

Proposition 1.1.

Assume that ${\{X(t)\}_{t\geq 0}}$ admits an invariant $\uppi\in\mathcal{P}(\mathbb{X})$ such that ${\mathscr{W}}_{p}(\updelta_{x}P_{t},\uppi)\leq c(x)\,r(t)$ for some $p\geq 1$ , and all $t\geq 0$ and $x\in\mathbb{X}$ , where $r\colon[0,\infty)\to[1,\infty)$ is Borel measurable, and $c\colon\mathbb{X}\to[0,\infty)$ . Then,

[TABLE]

where $r_{\psi}(t)\coloneqq\Bigl{(}\mathbb{E}\Bigl{[}\bigl{(}r(S(t))\bigr{)}^{p}\Bigr{]}\Bigr{)}^{\nicefrac{{1}}{{p}}}$ .

Ergodic properties of Markov processes under subordination in the $f$ -norm are discussed in [20, 21, 22].

1.3 Literature review

Our work contributes to the understanding of the ergodic properties of Markov processes. Most of the existing literature focuses on characterizing the exponential or subexponential ergodicity under the $f$ -norm, and in particular the total variation norm, see [3, 19, 24, 25, 26, 32, 33, 35, 58, 64, 65, 66, 78] and the references therein. However, there have been some recent developments in understanding ergodic properties of Markov processes (both continuous and discrete time) under the Wasserstein distances; see [12, 28, 34, 53, 57, 56, 29, 30, 85, 59, 60]. As already mentioned, exponential and subexponential convergence rates in the ${\mathscr{W}}_{1}$ -distance for general Markov processes that are (possibly) not irreducible or aperiodic are established in [12, 28, 53], under the Foster-Lyapunov condition in Eq. 1.1, contractivity of the underlying metric, and smallness of sublevel sets of the corresponding Lyapunov function. Using the coupling approach, the authors in [29, 30, 59] studied exponential ergodicity with respect to a class of Wasserstein distances for SDEs driven by an additive Brownian noise term and a drift term satisfying an asymptotic flatness property at infinity. Under the same assumption on the drift term, these results have been extended in [60, 85] to allow for more general additive Lévy noises. Subexponential ergodicity with respect to the ${\mathscr{W}}_{p}$ -distance for stochastic differential equations driven by an additive Lévy noise term, with a drift term satisfying asymptotic flatness property at zero, has been studied in [56]. By combining the Foster-Lyapunov method with the coupling approach, exponential ergodicity with respect to a class of $f$ -norms and Wasserstein distances (given in terms of the underlying Lyapunov function) is established in [57] for a class of Mckean-Vlasov SDE with Lévy noise. Lastly, exponential ergodicity with respect to the ${\mathscr{W}}_{1}$ -distance for one-dimensional positive-valued stochastic differential equations with jumps and the drift term satisfying asymptotic flatness property has been studied in [34].

Our results on both exponential and subexponential ergodicity under the ${\mathscr{W}}_{p}$ -distance contribute to this active research topic. Of particular interest is the result obtained in Theorem 1.2 which seems to be completely new in the literature, and which, in some cases, allows one to conclude that the obtained upper bound on the rate of convergence is sharp.

As we have already remarked, irreducibility and aperiodicity are crucial structural properties of the underlying process used in Theorems 1.1, 1.2 and 1.3. There is a vast literature on these, and related questions such as the strong Feller property and heat kernel estimates of Markov processes. In particular, we refer the readers to [8, 15, 16, 17, 18, 36, 42, 43, 45, 46, 47, 48, 55, 67, 71, 77] for the case of a class of Markov Lévy-type processes with bounded coefficients, and to [7, 9, 39, 44, 56, 62, 63, 68, 69, 72, 76, 86] for the case of a class of Itô processes.

Recall that the Foster-Lyapunov condition in Eq. 1.1 implies that for any $\varepsilon>0$ the $\phi\circ\Phi^{-1}$ -modulated moment of the $\varepsilon$ -shifted hitting time $\tau_{C}^{\varepsilon}\coloneqq\inf\{t\geq\delta\colon X(t)\in C\}$ of ${\{X(t)\}_{t\geq 0}}$ of $C$ (with respect to $\mathbb{P}_{x}$ ) is finite and controlled by ${\mathscr{V}}(x)$ (see [24, Theorem 4.1]). However, this property in general does not immediately imply ergodicity of ${\{X(t)\}_{t\geq 0}}$ . Namely, we also need to ensure that a similar property holds for any other “reasonable” set. If ${\{X(t)\}_{t\geq 0}}$ is irreducible with irreducibility measure $\upvarphi(\mathrm{d}x)$ , then indeed for any $\varepsilon>0$ the $\phi\circ\Phi^{-1}$ -modulated moment of $\tau_{B}^{\varepsilon}$ , for any $B\in\mathfrak{B}(\mathbb{X})$ with $\upvarphi(B)>0$ , is again finite and controlled by ${\mathscr{V}}(x)$ (see [24, the discussion after Theorem 4.1]). However, ${\{X(t)\}_{t\geq 0}}$ can also show certain cyclic behavior which destroys ergodicity (see [65, Section 5] and [64, Chapter 5]). By assuming aperiodicity, which excludes this type of behavior, (sub)exponential ergodicity in the ${\mathscr{W}}_{p}$ -distance of ${\{X(t)\}_{t\geq 0}}$ follows as discussed in Theorem 1.1, and in the $f$ -norm as discussed in [33, Theorem 1].

1.4 Organization of the article

In Section 2, we give the proofs of Theorems 1.1, 1.2, 1.3, 1.4 and 1.1 together with some auxiliary lemmas. Applications of the main results to several classes of Markov processes, including Langevin tempered diffusion processes, Ornstein-Uhlenbeck processes with jumps, piecewise Ornstein-Uhlenbeck processes with jumps under constant and stationary Markov controls, state-space models, and backward recurrence time chains, are contained in Section 3.

2 Proofs of the main results

We start with the proof of Theorem 1.1.

Proof of Theorem 1.1.

We consider the case when ${\mathbb{T}}={\mathbb{R}}_{+}$ only. The case when ${\mathbb{T}}=\mathbb{Z}_{+}$ proceeds in an analogous way, by employing the results from [25, Theorem 2.8] and [64, Theorem 15.0.2].

First, under the assumptions of the theorem, it has been shown in [24, Proposition 3.1] and [66, Theorem 4.2] that ${\{X(t)\}_{t\geq 0}}$ admits a unique invariant $\uppi\in{\mathcal{P}}_{\phi\circ{\mathscr{V}}}(\mathbb{X})$ . This, together with Eq. 1.2, implies that $\uppi\in{\mathcal{P}}_{\eta}(\mathbb{X})$ . We continue now with the proof of part (i). By the Kantorovich-Rubinstein theorem, we have

[TABLE]

where the supremum is taken over all Lipschitz continuous functions $f\colon\mathbb{X}\to{\mathbb{R}}$ with Lipschitz constant $\operatorname{Lip}(f)\leq 1$ . We apply [24, Theorem 3.2],

[TABLE]

Note that if $f\colon\mathbb{X}\to{\mathbb{R}}$ is such that $\operatorname{Lip}(f)\leq 1$ and $f(x_{0})=0$ , then $\lvert f(x)\rvert\leq\mathsf{d}(x,x_{0})\leq\Psi_{2}\circ f_{*}(x)$ . Thus

[TABLE]

(recall the definition of the $f$ -norm in Eq. 1.8). Now, from [24, (3.5) and (3.6)] we have

[TABLE]

for some $\bar{c}>0$ , and all $t\geq 0$ and $x,y\in{\mathbb{R}}^{n}$ , which proves Eqs. 1.3 and 1.4, respectively.

We next prove part (ii). Applying Eq. 1.3 and [24, (3.5)] with $\Psi_{1}(z)=1$ , and $\Psi_{2}(z)=z$ , we obtain ${\mathbb{E}}_{x}\left[\mathsf{d}(X(t),x_{0})^{\eta}\right]\leq\overline{m}_{\eta}+\breve{c}\,{\mathscr{V}}(x)$ , for some $\breve{c}>0$ , and all $t\geq 0$ and $x\in\mathbb{X}$ . Hence

[TABLE]

Further, for $t\geq 0$ , $z\in\mathbb{X}$ , and $\Pi\in\mathcal{C}(\updelta_{z}P_{t},\uppi)$ , we have

[TABLE]

Using Eqs. 2.1 and 2.2, and the bound $\int_{{\mathscr{B}}_{t}^{c}(x_{0})}\bigl{(}\mathsf{d}(x,x_{0})\bigr{)}^{p}\,\uppi(\mathrm{d}x)\leq t^{p-\eta}\,\overline{m}_{\eta}$ , we have

[TABLE]

and combining this with Eq. 1.3 we obtain

[TABLE]

for all $t\geq 0$ and $x\in\mathbb{X}$ , from which Eq. 1.5 follows with $\tilde{c}=2\max\{1,\bar{c},\breve{c}\}^{\nicefrac{{1}}{{p}}}$ .

Moving on to the proof of part (iii), note that according to [65, Proposition 6.1], [66, Theorem 4.2], and [26, Theorem 5.2], there exist constants $\mathring{c}>0$ and $\gamma>0$ , such that

[TABLE]

Equation 1.6 now follows from the Kantorovich-Rubinstein theorem and Eq. 1.2. Let $p\in[1,\eta]$ . First, from Eq. 2.3 we obtain ${\mathbb{E}}_{x}\left[\mathsf{d}(X_{t},x_{0})^{\eta}\right]\leq\overline{m}_{\eta}+\dot{c}\,{\mathscr{V}}(x)$ , for some $\dot{c}>0$ , and all $t\geq 0$ and $x\in\mathbb{X}$ , which again implies Eq. 2.1. By Eqs. 2.1 and 2.2, we have

[TABLE]

and combining this with Eq. 1.6 we obtain

[TABLE]

from which Eq. 1.7 follows again with $\breve{c}=2\max\{1,\check{c},\dot{c}\}^{\nicefrac{{1}}{{p}}}$ . This completes the proof. ∎

We proceed with the proof of Theorem 1.2.

Proof of Theorem 1.2.

We again consider the case when ${\mathbb{T}}={\mathbb{R}}_{+}$ only. The case when ${\mathbb{T}}=\mathbb{Z}_{+}$ proceeds in a similar manner.

Fix some $x_{0}\in\mathbb{X}$ , $p\in[1,\vartheta]$ and $\iota\in(0,\theta-\vartheta-\varepsilon)$ . For $s>0$ , define $f_{s}\colon\mathbb{X}\to[0,\infty)$ by

[TABLE]

We have

[TABLE]

Since, by assumption, $\int_{\mathbb{X}}\bigl{(}L(x)\bigr{)}^{\vartheta+\varepsilon}\,\uppi(\mathrm{d}{x})=\infty$ , there exists an increasing diverging sequence $\{s_{n}\}_{n\in{\mathbb{N}}}\subset[0,\infty)$ such that

[TABLE]

Note also that $\bigr{(}f_{s}(x)\bigl{)}^{p}\leq 2^{\theta-p}s^{p-\theta}\,\bigl{(}L(x)\bigr{)}^{\theta}\leq\frac{2^{\theta-p}}{c}\,s^{p-\theta}\,{\mathscr{V}}(x)$ for all $s>0$ and $x\in\mathbb{X}$ . This follows from the facts that $f_{s}(x)=0$ for $s>0$ and $x\in\mathbb{X}$ such that $L(x)\leq s/2$ ,

[TABLE]

and $\theta>p\geq 1$ . Thus, by the Foster-Lyapunov equation Eq. 1.1 (see [66, Theorem 1.1]), we obtain

[TABLE]

Select a sequence $\{t_{n}\}_{n\in{\mathbb{N}}}\subset[0,\infty)$ such that

[TABLE]

Combining Eqs. 2.4, 2.5, 2.6 and 2.7 above we have

[TABLE]

The result then follows by [82, Proposition 7.29], which asserts that

[TABLE]

for all $\upmu_{1},\upmu_{2}\in{\mathcal{P}}_{p}(\mathbb{X})$ and Lipschitz $f:\mathbb{X}\to{\mathbb{R}}$ with Lipschitz constant $\mathrm{Lip}\bigl{(}f\bigr{)}.$ ∎

For the proof of Theorem 1.3 we need two auxiliary results given in Lemmas 2.1 and 2.2 below. First, recall that $\{X(t)\}_{t\geq 0}$ is said to be conservative if ${\mathbb{P}}_{x}(X(t)\in{\mathbb{R}}^{n})=1$ for all $t\geq 0$ and $x\in{\mathbb{R}}^{n}$ , and note that this is equivalent to

[TABLE]

where $\tau_{k}\coloneqq\inf\{t\geq 0\colon X_{t}\in{\mathscr{B}}^{c}_{k}\}$ for $k\in{\mathbb{N}}$ (here it is also essential that $\{X(t)\}_{t\geq 0}$ has càdlàg sample paths). Namely, for $t\geq 0$ and $x\in{\mathbb{R}}^{n}$ it holds that

[TABLE]

Lemma 2.1.

Assume (LB) and (MP). Then for any $x\in{\mathbb{R}}^{n}$ and any nonnegative $f\in C^{\infty}({\mathbb{R}}^{n})$ such that the map $y\mapsto\int_{{\mathscr{B}}^{c}}f(y+z)\,\upnu(y,\mathrm{d}z)$ is locally bounded, ${\{M_{f}(t)\}_{t\geq 0}}$ is a ${\mathbb{P}}_{x}$ -local martingale (with respect to $\{\mathcal{F}_{t}\}_{t\geq 0}$ ).

Proof.

For $k\in{\mathbb{N}}$ , let $\chi_{k}\in C_{c}^{\infty}({\mathbb{R}}^{n})$ be such that $\mathds{1}_{{\mathscr{B}}_{k}}(x)\leq\chi_{k}(x)\leq\mathds{1}_{{\mathscr{B}}_{k+1}}(x)$ and $\chi_{k}(x)\leq\chi_{k+1}(x)$ for $x\in{\mathbb{R}}^{n}$ . Then, for any $x\in{\mathbb{R}}^{n}$ , $k,j\in{\mathbb{N}}$ and $s,t\geq 0$ , $s\leq t$ , [31, Theorem 2.2.13] implies that

[TABLE]

Next, by employing the monotone and dominated convergence theorems, we easily see that

[TABLE]

and

[TABLE]

Hence, for each $x\in{\mathbb{R}}^{n}$ , $t\geq 0$ and $j\in{\mathbb{N}}$ , $M_{f}(t\wedge\tau_{j})$ is integrable. Also,

[TABLE]

for all $x\in{\mathbb{R}}^{n}$ , $t\geq s\geq 0$ , and $j\in{\mathbb{N}}$ . The assertion now follows from the conservativeness of ${\{X(t)\}_{t\geq 0}}$ . ∎

For $f\in C^{1}({\mathbb{R}}^{n})$ we let

[TABLE]

whenever the integrals are well defined.

Lemma 2.2.

Suppose that $\theta_{\upnu}>0$ , and that Eq. 1.13 holds for some $\theta\in(0,\theta_{\upnu}]\cap\Theta_{\upnu}$ . Then, we have the following:

(i)

If $\theta\in(0,1)$ , and $f\in C^{2}({\mathbb{R}}^{n})$ satisfies

[TABLE]

then $\mathfrak{J}_{1,\upnu}[f](x)$ vanishes at infinity. 2. (ii)

If $\theta\geq 1$ , and $f\in C^{2}({\mathbb{R}}^{n})$ satisfies

[TABLE]

then $\mathfrak{J}_{\upnu}[f](x)$ vanishes at infinity when $\theta\in[1,2)$ , and the map $x\mapsto(1+\lvert x\rvert)^{2-\theta}\,\mathfrak{J}_{\upnu}[f](x)$ is bounded when $\theta\geq 2$ . 3. (iii)

If Eq. 1.14 holds for some $\theta>0$ , then there exist $c>0$ and $r=r(\zeta)>0$ , such that for any $\zeta\in\bigl{(}0,\frac{1}{2}\theta\lVert Q\rVert^{-\nicefrac{{1}}{{2}}}\bigr{)}$ we have

[TABLE]

Proof.

The proof of parts (i) and (ii) follows as a straightforward adaptation of [7, Lemma 5.1] by setting

[TABLE]

To prove part (iii), we use the identity

[TABLE]

Consider the set

[TABLE]

On this set we have the bound

[TABLE]

for some $\bar{c}\geq 1$ . Since $\zeta\lVert Q\rVert^{\nicefrac{{1}}{{2}}}<\theta$ , and $\lvert y\rvert_{Q}\geq\lvert ty\rvert_{Q}\geq\frac{1}{2}\lvert x\rvert_{Q}$ on the set $A_{x}$ , there exists $\rho=\rho(\zeta)\geq 1$ such that

[TABLE]

for all $x\in{\mathscr{B}}^{c}_{2\rho}$ and $(t,y)\in A_{x}$ . Hence, using Eqs. 2.10 and 2.11 and Fubini’s theorem, we have

[TABLE]

for all $x\in{\mathscr{B}}^{c}_{2\rho}$ . Next, since $\lvert x+ty\rvert_{Q}>\frac{1}{2}\lvert x\rvert_{Q}$ on the set $A^{c}_{x}$ , we have a bound of the form

[TABLE]

for all $x\in{\mathscr{B}}^{c}$ and $(t,y)\in A_{x}^{c}$ , where, without loss of generality, we use the same constant $\bar{c}$ as in Eq. 2.10. Since $\theta>2\zeta\sqrt{\lVert Q\rVert}$ , it is clear that there exists $\tilde{c}>0$ , independent of $\zeta$ , such that

[TABLE]

Thus, by Eqs. 1.14, 2.13 and 2.14, there exists $\hat{c}>0$ such that

[TABLE]

The estimate in Eq. 2.8 follows from Eqs. 1.14, 2.9, 2.12 and 2.15. This completes the proof. ∎

We next prove Theorem 1.3.

Proof of Theorem 1.3.

In cases (i) and (ii), we take ${\mathscr{V}}(x)={\mathscr{V}}_{Q,\theta}(x)+1$ , while in case (iii) we use ${\mathscr{V}}(x)=\widetilde{\mathscr{V}}_{Q,\zeta}(x)$ with $\zeta>0$ sufficiently small. Then, in view of Lemma 2.2 it is straightforward to see that there exist constants $\bar{c}>0$ , $\tilde{c}>0$ , and $r>0$ , such that

[TABLE]

in case (i), and

[TABLE]

in cases (ii) and (iii). Observe that the above relations, together with [66, Theorem 2.1] and Lemma 2.1, imply that ${\{X(t)\}_{t\geq 0}}$ is conservative. Finally, according to Lemma 2.1 and [24, Theorem 3.4] the process ${\{X(t)\}_{t\geq 0}}$ satisfies Eq. 1.1 with $\phi(t)=t^{\nicefrac{{(\theta-2+\vartheta)}}{{\theta}}}$ in case (i), and $\phi(t)=t$ in cases (ii) and (iii) (for some $b>0$ and closed petite set $C$ ). ∎

The proof of Theorem 1.4 is based on the following lemma.

Lemma 2.3.

Let ${\{X(t)\}_{t\geq 0}}$ be an Itô process with locally bounded coefficients $b(x)$ and $a(x)$ and satisfying the linear growth condition in Eq. 1.12, and $\upnu(x,\mathrm{d}y)$ such that $\theta_{\upnu}>0$ . Then, for any $\theta\in[0,\theta_{\upnu}]\cap\Theta_{\upnu}$ , there exists a constant $c>0$ such that

[TABLE]

Proof.

Let $\varphi\in C^{2}({\mathbb{R}}^{n})$ be such that $\varphi(x)\geq 0$ and $\varphi(x)\leq|x|^{\theta}$ for $x\in{\mathbb{R}}^{n}$ , and $\varphi(x)=|x|^{\theta}$ for $x\in{\mathscr{B}}^{c}$ . Further, for $k\in{\mathbb{N}}$ , let $\varphi_{k}\in C^{2}_{b}({\mathbb{R}}^{n})$ be such that $\varphi_{k}(x)\geq 0$ , $\varphi_{k}(x)=\varphi|_{{\mathscr{B}}_{k+1}}(x)$ , and $\varphi_{k}(x)\nearrow\varphi(x)$ , as $k\to\infty$ , for every $x\in{\mathbb{R}}^{n}$ . Then, according to Itô’s formula and the conservativeness of ${\{X(t)\}_{t\geq 0}}$ we have

[TABLE]

for all $k\in{\mathbb{N}}$ , $t\geq 0$ , and $x\in{\mathbb{R}}^{n}$ , where the constants $c_{k}>0$ depend on $\theta$ , $b(x)$ , $a(x)$ , and the quantities

[TABLE]

for $r>0$ large enough. Clearly, the functions $\varphi_{k}(x)$ can be chosen such that $c\coloneqq\sup_{k\in{\mathbb{N}}}c_{k}<\infty$ . Now, since the function $t\mapsto{\mathbb{E}}_{x}\bigl{[}\varphi_{k}(X\bigl{(}t\wedge\tau_{k})\bigr{)}\bigr{]}$ is bounded and càdlàg, Gronwall’s lemma implies that

[TABLE]

for all $k\in{\mathbb{N}}$ , $t\geq 0$ , and $x\in{\mathbb{R}}^{n}$ . By letting $k\to\infty$ , Fatou’s lemma and the conservativeness of ${\{X(t)\}_{t\geq 0}}$ imply that

[TABLE]

Finally, we have that

[TABLE]

This completes the proof. ∎

We next prove Theorem 1.4.

Proof of Theorem 1.4.

For $p\in[2,\theta_{\upnu}]\cap\Theta_{\upnu}$ , define ${\mathscr{V}}_{p}(x)\coloneqq\lvert x\rvert_{Q}^{p}$ , $x\in{\mathbb{R}}^{n}$ , and

[TABLE]

Calculating $\tilde{{\mathcal{L}}}{\mathscr{V}}_{p}(x;z)$ , using Eq. 1.16, we obtain

[TABLE]

for all $x,z\in{\mathbb{R}}^{n}$ . Next, for $x,z\in{\mathbb{R}}^{n}$ , let $\tau\coloneqq\inf\{t\geq 0\colon\,X^{x+z}(t)=X^{x}(t)\}$ (possibly $+\infty$ ), where ${\{X^{x}(t)\}_{t\geq 0}}$ denotes the solution to Eq. 1.15 with $X^{x}(0)=x$ for $x\in{\mathbb{R}}^{n}$ . By Itô’s formula and the conservativeness of ${\{X(t)\}_{t\geq 0}}$ we obtain

[TABLE]

for all $t\geq 0$ and $k\in{\mathbb{N}}$ , since, for $t\geq\tau$ , $X^{x+z}(t)=X^{x}(t)$ a.s. by the pathwise uniqueness of the solution to Eq. 1.15. From this and Lemma 2.3 we conclude that the function $t\mapsto{\mathbb{E}}\bigl{[}{\mathscr{V}}_{p}\bigl{(}X^{x+z}(t\wedge\tau\wedge\tau_{k})-X^{x}(t\wedge\tau\wedge\tau_{k})\bigr{)}\bigr{]}$ is differentiable a.e. on $(0,\infty)$ . Note that $|\tilde{{\mathcal{L}}}{\mathscr{V}}_{p}\bigl{(}x;z\bigr{)}|\leq c|z|^{p}$ for some $c>0$ and all $x,z\in{\mathbb{R}}^{n}$ , We conclude now that

[TABLE]

for all $k\in{\mathbb{N}}$ . Thus by Gronwall’s lemma, it follows that

[TABLE]

and Fatou’s lemma implies that

[TABLE]

for all $t\geq 0$ and $x,z\in{\mathbb{R}}^{n}$ . Next, from the bound $\underline{\lambda}_{Q}\lvert z\rvert^{2}\leq\lvert z\rvert^{2}_{Q}\leq\bar{\lambda}_{Q}\lvert z\rvert^{2}$ we obtain

[TABLE]

for all $t\geq 0$ and $x,z\in{\mathbb{R}}^{n}$ , thus establishing Eq. 1.17.

Finally, in order to establish Eq. 1.18, we follow the idea from [59, Proof of Corollary 1.8] or [49, Proof of Theorem 2.1]. Observe first that, according to Lemma 2.3, for any $\upmu\in{\mathcal{P}}_{p}({\mathbb{R}}^{n})$ , $\upmu P_{t}\in{\mathcal{P}}_{p}({\mathbb{R}}^{n})$ for all $t\geq 0$ . Next, let $\upmu_{1},\upmu_{2}\in{\mathcal{P}}_{p}({\mathbb{R}}^{n})$ be arbitrary. According to Eq. 1.17, we have

[TABLE]

Fix $t_{0}\geq 0$ such that

[TABLE]

Then, the mapping $\upmu\mapsto\upmu P_{t_{0}}$ is a contraction on ${\mathcal{P}}_{p}({\mathbb{R}}^{n})$ . Thus, since $({\mathcal{P}}_{p}({\mathbb{R}}^{n}),{\mathscr{W}}_{p})$ is a complete metric space, the Banach fixed point theorem entails that there exists a unique $\uppi_{0}\in{\mathcal{P}}_{p}({\mathbb{R}}^{n})$ such that $\uppi_{0}P_{t_{0}}(\mathrm{d}x)=\uppi_{0}(\mathrm{d}x)$ . By defining $\uppi(\mathrm{d}x)\coloneqq t_{0}^{-1}\int_{0}^{t_{0}}\uppi_{0}P_{s}(\mathrm{d}x)\,\mathrm{d}s$ , we can easily see that $\uppi P_{t}(\mathrm{d}x)=\uppi(\mathrm{d}x)$ for all $t\geq 0$ , i.e. $\uppi(\mathrm{d}x)$ is an invariant probability measure for ${\{X(t)\}_{t\geq 0}}$ . By employing Lemma 2.1 again, we also see that $\uppi\in\mathcal{P}_{p}({\mathbb{R}}^{n})$ . Finally, for any $\upmu\in\mathcal{P}_{p}({\mathbb{R}}^{n})$ we have

[TABLE]

which also proves uniqueness of $\uppi(\mathrm{d}x)$ .

To prove the second assertion, we adapt the proof of [4, Lemma 7.3.4], where an analogous result is shown for $p=1$ . Define

[TABLE]

and observe that in this case $\tilde{{\mathcal{L}}}$ reduces to

[TABLE]

Calculating $\tilde{{\mathcal{L}}}V_{\varepsilon,p}(x;z)$ , using Eq. 1.16, we obtain

[TABLE]

As before, by Itô’s formula and the conservativeness of ${\{X(t)\}_{t\geq 0}}$ , combined with the fact that the Lévy noise does not depend on the state, we obtain

[TABLE]

for all $t\geq 0$ and $k\in{\mathbb{N}}$ , and

[TABLE]

for all $k\in{\mathbb{N}}$ . Thus by Gronwall’s and Fatou’s lemmas it follows that

[TABLE]

for all $t\geq 0$ and $x,z\in{\mathbb{R}}^{n}$ . Taking limits as $\varepsilon\to 0$ , and using monotone convergence, the assertion follows. ∎

In what follows we give an alternative proof of Theorem 1.4 in the case when $\sigma(x)\equiv\sigma$ and $\upnu(x,\mathrm{d}y)\equiv\upnu(\mathrm{d}y)$ . Let $\bar{X}(t)\coloneqq Q^{\nicefrac{{1}}{{2}}}X(t)$ for $t\geq 0$ . Clearly, ${\{\bar{X}(t)\}_{t\geq 0}}$ is again an Itô process which satisfies

[TABLE]

where ${\{L(t)\}_{t\geq 0}}$ is an $n$ -dimensional pure-jump and zero-drift Lévy process determined by $\upnu(\mathrm{d}y)$ . The corresponding transition probability satisfies

[TABLE]

Thus, we have

[TABLE]

Now, in [11] it has been shown that Eq. 2.16 implies that

[TABLE]

for all $t\geq 0$ and $x,y\in{\mathbb{R}}^{n}$ . Finally we get

[TABLE]

for all $t\geq 0$ and $x,y\in{\mathbb{R}}^{n}$ , which is Eq. 1.17.

Lastly, we prove Proposition 1.1.

Proof of Proposition 1.1.

According to [82, Theorem 4.1], for each $s\in[0,\infty)$ there exists $\Pi_{s}\in\mathcal{C}(\updelta_{x}P_{s},\uppi)$ such that ${\mathscr{W}}_{p}(\updelta_{x}P^{\psi}_{s},\uppi)=\int_{\mathbb{X}\times\mathbb{X}}\mathsf{d}(y,z)\,\Pi_{s}(\mathrm{d}y,\mathrm{d}z)$ . Now, we have that

[TABLE]

which completes the proof. ∎

3 Examples

In this section, we consider applications of the main results to several classes of Markov processes, including Langevin tempered diffusion processes, Ornstein-Uhlenbeck processes with jumps, piecewise Ornstein-Uhlenbeck processes with jumps under constant and stationary Markov controls, state-space models and backward recurrence time chains. Further examples can be found in [24, 25, 32, 33, 78].

3.1 Langevin tempered diffusion processes

We first consider a class of Langevin tempered diffusion processes. Let $\alpha\in(0,1/n)$ , and let $\pi\in C^{2}({\mathbb{R}}^{n})$ be strictly positive, $\pi(x)=c\,\lvert x\rvert^{-\nicefrac{{1}}{{\alpha}}}$ for some $c>0$ and all $x\in{\mathscr{B}}^{c}$ , and $\int_{{\mathbb{R}}^{n}}\pi(x)\,\mathrm{d}x=1$ . Further, for $\beta\in[0,(1+\alpha(2-n))/2]$ and $x\in{\mathbb{R}}^{n}$ , let

[TABLE]

and

[TABLE]

Then, in [33, Proposition 15] it has been shown that the SDE

[TABLE]

admits a weak solution $(\Omega,\mathcal{F},\{\mathcal{F}_{t}\}_{t\geq 0},{\{B(t)\}_{t\geq 0}},{\{X(t)\}_{t\geq 0}},{\mathbb{P}})$ , which is a conservative strong Markov process with continuous sample paths. Moreover, it is irreducible, aperiodic, every compact set is petite, and $\uppi(\mathrm{d}x)\coloneqq\pi(x)\mathrm{d}x$ is its unique invariant probability measure. Here, $(\Omega,\mathcal{F},\{\mathcal{F}_{t}\}_{t\geq 0},{\{B(t)\}_{t\geq 0}},{\mathbb{P}})$ is a standard $n$ -dimensional Brownian motion. Note also that according to Itô’s formula ${\{X(t)\}_{t\geq 0}}$ satisfies (MP) with

[TABLE]

Proposition 3.1.

(i)

If $\beta\in\bigl{[}\alpha,\frac{1}{2}(1+\alpha(1-n))\bigr{)}$ , then the assertions of Theorem 1.1 (iii) hold with ${\mathscr{V}}(x)=1+{\mathscr{V}}_{\mathbb{I}_{n},\frac{\gamma}{\alpha}}(x)$ and $\eta=\frac{\gamma}{\alpha}$ for any $\gamma\in[\alpha,1+\alpha(2-n)-2\beta)$ . 2. (ii)

If $\alpha\in(0,1/(n+1))$ and $\beta\in[0,\alpha)$ , then the assertions of Theorem 1.1 (i) and (ii) hold with

[TABLE]

for any $\gamma\in[3\alpha-2\beta,1+\alpha(2-n)-2\beta)$ . 3. (iii)

Under the assumptions of (ii), $\uppi\in{\mathcal{P}}_{\alpha^{-1}(1-\alpha n)-\iota}({\mathbb{R}}^{n})$ for $\iota\in\bigl{(}0,\alpha^{-1}(1-\alpha(n+1))\bigr{)}$ . Let $\rho\in(0,(1-(n+1)\alpha)\wedge 2(\alpha-\beta))$ and $\varepsilon\in\bigl{[}\alpha^{-1}\rho,2\alpha^{-1}(\alpha-\beta)\bigr{)}$ be fixed. Then, for every $p\in\bigl{[}1,\alpha^{-1}(1-n\alpha-\rho)\bigr{]}$ and $\iota\in(0,2\alpha^{-1}(\alpha-\beta)-\varepsilon)$ there exist a positive constant $\bar{c}$ and a diverging increasing sequence $\{t_{n}\}_{n{\mathbb{N}}}\subset[0,\infty)$ , depending on the above parameters, such that Eq. 1.9 in Theorem 1.2 holds with ${\mathscr{V}}(x)$ as above, $\theta=\alpha^{-1}(1+\alpha(2-n)-2\beta-\rho)$ , and $\vartheta=\alpha^{-1}(1-n\alpha-\rho)$ .

Proof.

(i)

In [33, Theorem 16 (i)] it has been shown that for $\beta\in\bigl{[}\alpha,\frac{1}{2}(1+\alpha(1-n))\bigr{)}$ and $\gamma\in(0,1+\alpha(2-n)-2\beta)$ the Foster-Lyapunov condition in Eq. 1.1 holds with ${\mathscr{V}}(x)$ as above, $\phi(t)=t$ and $C=\bar{{\mathscr{B}}}_{r}$ for some $r>0$ large enough. Also, the relation in Eq. 1.2 easily follows from the form of ${\mathscr{V}}(x)$ and $\phi(t)$ , and the choice of $\eta$ .

(ii)

In [33, Theorem 16 (ii)] it has been shown that for $\alpha\in(0,1/n)$ , $\beta\in[0,\alpha)$ and $\gamma\in(2(\alpha-\beta),1+\alpha(2-n)-2\beta)$ , the Foster-Lyapunov condition in Eq. 1.1 holds with ${\mathscr{V}}(x)$ and $\phi(t)$ as above and $C=\bar{{\mathscr{B}}}_{r}$ for some $r>0$ large enough. The relation in Eq. 1.2 can again be easily verified due to the form of ${\mathscr{V}}(x)$ and $\phi(t)$ , and the choice of $\eta$ .

(iii)

Since $\vartheta+\varepsilon-\nicefrac{{1}}{{\alpha}}+n-1\geq-1$ , we have $\int_{{\mathbb{R}}^{n}}\lvert x\rvert^{\vartheta+\varepsilon}\,\uppi(\mathrm{d}x)\,=\,\infty$ . The assertion now follows from Theorem 1.2 by taking $L(x)=\lvert x\rvert$ .

This completes the proof. ∎

*Remark 3.1**.*

Observe that the rates obtained in Proposition 3.1 (ii) and (iii) match. Also, in Proposition 3.1 (ii) we assume that $\alpha\in\bigl{(}0,(n+1)^{-1}\bigr{)}$ . Namely, for $\alpha\in\bigl{[}(n+1)^{-1},n^{-1}\bigr{)}$ it holds that $\int_{{\mathbb{R}}^{n}}\lvert x\rvert\,\uppi(\mathrm{d}x)=\infty$ , and hence convergence in the ${\mathscr{W}}_{p}$ -distance cannot hold. On the other hand, in this case, [33, Theorem 16 (ii)] shows subexponential convergence in the $f$ -norm. In the following subsections we give examples of Markov processes which are ergodic in the ${\mathscr{W}}_{p}$ -distance but not in the $f$ -norm. For additional results on ergodic properties of Langevin tempered diffusion processes with respect to the $f$ -norm see [24] and [33].

3.2 Ornstein-Uhlenbeck processes with jumps

We next consider a class of Itô processes with linear drift. Let $H$ be an $n\times n$ matrix, and let ${\{L(t)\}_{t\geq 0}}$ be an $n$ -dimensional Lévy process determined by Lévy triplet $\bigl{(}b_{L},a_{L},\upnu_{L}(\mathrm{d}y)\bigr{)}$ . It is well known that the SDE

[TABLE]

admits a unique conservative strong solution ${\{X(t)\}_{t\geq 0}}$ which is a strong Markov process with càdlàg sample paths (see e.g. [2, Theorem 3.1 and Proposition 4.2]). In particular, ${\{X(t)\}_{t\geq 0}}$ is an Itô process satisfying (MP) with $b(x)=b_{L}+Mx$ , $a(x)=a_{L}$ , and $\upnu(x,\mathrm{d}y)=\upnu_{L}(\mathrm{d}y)$ . This process is known as an Ornstein-Uhlenbeck process with jumps. In the case when ${\{L(t)\}_{t\geq 0}}$ is a standard Brownian motion, ${\{X(t)\}_{t\geq 0}}$ is the classical Ornstein-Uhlenbeck process. If $H$ is a Hurwitz matrix (a square matrix whose eigenvalues have all strictly negative real parts), it has been shown in [73, Theorems 4.1 and 4.2] that ${\{X(t)\}_{t\geq 0}}$ admits a unique invariant $\uppi\in{\mathcal{P}}({\mathbb{R}}^{n})$ if, and only if,

[TABLE]

Moreover, if this is the case, then $\lim_{t\to\infty}\updelta_{x}P_{t}\bigl{(}f\bigr{)}\,=\,\uppi\bigl{(}f\bigr{)}$ for all $x\in{\mathbb{R}}^{n}$ and $f\in C_{b}({\mathbb{R}}^{n})$ , i.e. for any $x\in{\mathbb{R}}^{n}$ , the transition kernel $p(t,x,\mathrm{d}y)$ converges weakly, as $t\to\infty$ , to $\uppi(\mathrm{d}y)$ . However, this is not enough for ${\mathscr{W}}_{p}$ -convergence of $p(t,x,\mathrm{d}y)$ to $\uppi(\mathrm{d}y)$ (see [82, Theorem 6.9]). Assume additionally that $1\in\Theta_{\upnu}$ , and let $p\in[1,\theta_{\upnu}]\cap\Theta_{\upnu}$ . Since $H$ is Hurwitz, there exists $Q\in{\mathcal{M}}_{+}$ such that $-(QH+H^{\prime}Q)\in{\mathcal{M}}_{+}$ (see [27, Lemma 2.2]). The left-hand side of Eq. 1.16 then reads

[TABLE]

Now, by setting

[TABLE]

the assertions of Theorem 1.4 follow. We remark here that this result does not necessarily imply ergodicity of ${\{X(t)\}_{t\geq 0}}$ in the $f$ -norm. Indeed, let $n=1$ , and take $L_{t}\equiv 0$ for $t\geq 0$ . Then it is easy to see that $X_{t}=x\,\mathrm{e}^{Ht}$ for $t\geq 0$ . Thus, $\uppi(\mathrm{d}x)=\updelta_{0}(\mathrm{d}x)$ , and $\updelta_{x}P_{t}$ converges to $\uppi(\mathrm{d}x)$ , as $t\to\infty$ , in ${\mathscr{W}}_{p}$ -distance for any $p\geq 1$ , but clearly this convergence cannot hold in the $f$ -norm.

If $\theta_{\upnu}>1$ , and ${\{X(t)\}_{t\geq 0}}$ satisfies the assumptions in [7, Theorem 3.1] (which ensure that ${\{X(t)\}_{t\geq 0}}$ is irreducible and aperiodic, and that the support of the corresponding irreducibility measure has nonempty interior), then according to [2, Proposition 4.3] and [79, Theorems 5.1 and 7.1] (which imply that every compact set is petite for ${\{X(t)\}_{t\geq 0}}$ ) the conclusions of Theorem 1.3 (ii) hold true for any $\theta\in(1,\theta_{\upnu}]\cap\Theta_{\upnu}$ . If $\theta_{\upnu}>0$ , then under the same assumptions as above, [26, Theorem 5.2], [65, Proposition 6.1], and [66, Theorem 4.2] (and [2, Proposition 4.3], and [79, Theorems 5.1 and 7.1]) imply that for any $\theta\in(0,\theta_{\upnu}]\cap\Theta_{\upnu}$ the process ${\{X(t)\}_{t\geq 0}}$ is exponentially ergodic in the $f$ -norm with $f(x)={\mathscr{V}}_{Q,\theta}(x)+1$ . However, this does not necessarily imply ergodicity of ${\{X(t)\}_{t\geq 0}}$ in the ${\mathscr{W}}_{p}$ -distance. To see this take again $n=1$ , and let ${\{L(t)\}_{t\geq 0}}$ be a one-dimensional symmetric $\alpha$ -stable Lévy process with $\alpha\in(0,1)$ and symbol (characteristic exponent) $q(\xi)=\lvert\xi\rvert^{\alpha}$ . Thus, $1\notin\Theta_{\upnu}$ , and $\theta_{\upnu}=\alpha$ . We claim that $\uppi\notin{\mathcal{P}}_{1}({\mathbb{R}})$ . Assume this is not the case. Then,

[TABLE]

In particular, for every $t>0$ it holds that $\int_{{\mathbb{R}}}\lvert y\rvert\,p(t,x,\mathrm{d}y)<\infty$ , $\uppi$ -a.e. On the other hand, according to [73, Theorem 3.1], we have

[TABLE]

for all $t\geq 0$ , $x\in{\mathbb{R}}$ and $f\in{\mathcal{B}}_{b}({\mathbb{R}}),$ where $\upmu_{t}(\mathrm{d}y)$ is a probability measure on ${\mathbb{R}}$ with characteristic function

[TABLE]

and ${\mathcal{B}}_{b}({\mathbb{R}})$ denotes the space of bounded functions in ${\mathcal{B}}({\mathbb{R}})$ . Hence, $\upmu_{t}(\mathrm{d}y)$ is the law of a symmetric $\alpha$ -stable random variable. Now, the monotone convergence theorem implies that

[TABLE]

which is impossible.

Let us mention that ergodic properties of Ornstein-Uhlenbeck processes with jumps in the $f$ -norm, and in particular in the total variation norm, have been considered in [41, 61, 75, 83, 84].

3.3 Piecewise Ornstein-Uhlenbeck processes with jumps

We extend the results from the previous subsection to a class of Itô processes with a piecewise linear drift. Consider an $n$ -dimensional SDE of the form

[TABLE]

where

(i)

the function $\bar{b}\colon{\mathbb{R}}^{n}\to{\mathbb{R}}^{n}$ is given by

[TABLE]

where $l\in{\mathbb{R}}^{n}$ , $v\in{\mathbb{R}}^{n}$ has nonnegative components and satisfies $\langle e,v\rangle=1$ with $e=(1,\dotsc,1)^{\prime}\in{\mathbb{R}}^{n}$ , $M\in{\mathbb{R}}^{n\times n}$ is a nonsingular M-matrix such that the vector $e^{\prime}M$ has nonnegative components, and $\varGamma=\operatorname*{diag}(\gamma_{1},\dotsc,\gamma_{n})$ with $\gamma_{i}\geq 0$ for $i=1,\dotsc,n$ ; 2. (ii)

${\{B(t)\}_{t\geq 0}}$ is a standard $m$ -dimensional Brownian motion, and the covariance function $\sigma\colon{\mathbb{R}}^{n}\to{\mathbb{R}}^{n\times m}$ is locally Lipschitz continuous and satisfies, for some $c>0$ ,

[TABLE] 3. (iii)

${\{L(t)\}_{t\geq 0}}$ is a $n$ -dimensional pure-jump Lévy process specified by a drift $b_{L}\in{\mathbb{R}}^{n}$ and Lévy measure $\upnu_{L}(\mathrm{d}y)$ .

Recall that a $n\times n$ matrix $M$ is called an M-matrix if it can be expressed as $M=\mu\mathbb{I}_{n}-N$ for some $\mu>0$ and some nonnegative $n\times n$ matrix $N$ with the property that $\rho(N)\leq\mu$ , where $\mathbb{I}_{n}$ and $\rho(N)$ denote the $n\times n$ identity matrix and the spectral radius of $N$ , respectively. Clearly, the matrix $M$ is nonsingular if $\rho(N)<\mu$ . It is well known that the SDE in Eq. 3.1 admits a unique conservative strong solution ${\{X(t)\}_{t\geq 0}}$ which is a strong Markov process with càdlàg sample paths (see e.g. [2, Theorem 3.1 and Proposition 4.2]). In particular, ${\{X(t)\}_{t\geq 0}}$ is an Itô process satisfying (MP) with $b(x)=b_{L}+\bar{b}(x)$ , $a(x)=\sigma(x)\sigma(x)^{\prime}$ , and $\upnu(x,\mathrm{d}y)=\upnu_{L}(\mathrm{d}y)$ . This process is often called a piecewise Ornstein-Uhlenbeck process with jumps. It arises as a limit of the suitably scaled queueing processes of multiclass many-server queueing networks with heavy-tailed (bursty) arrivals and/or asymptotically negligible service interruptions. In these models, if the scheduling policy is based on a static priority assignment on the queues, then the vector $v$ in the limiting diffusion Eq. 3.1 corresponds to a constant control. The process ${\{X(t)\}_{t\geq 0}}$ also arises in many-server queues with phase-type service times, where the constant vector $v$ corresponds to the probability distribution of the phases. For a multiclass queueing network with independent heavy-tailed arrivals, the process ${\{L(t)\}_{t\geq 0}}$ is an anisotropic Lévy process consisting of independent one-dimensional symmetric $\alpha$ -stable components. Under service interruptions, ${\{L(t)\}_{t\geq 0}}$ is either a compound Poisson process, or an anisotropic Lévy process described above together with a compound Poisson component. More details on these queueing models can be found in [7, Section 4].

We first discuss the case when $\varGamma v=0$ . This corresponds to the case when the control gives lowest priority to queues whose abandonment rate is zero. When $1\in\Theta_{\upnu}$ , we define

[TABLE]

Proposition 3.2.

In addition to the assumptions of [7, Theorem 3.1] (which ensure that ${\{X(t)\}_{t\geq 0}}$ is irreducible and aperiodic with irreducibility measure having support with nonempty interior), suppose that $\varGamma v=0$ , $2\in\Theta_{\upnu}$ , and $\bigl{\langle}e,M^{-1}\tilde{l}\bigr{\rangle}<0$ .

(i)

If

[TABLE]

then there exists $Q\in{\mathcal{M}}_{+}$ such that the assertions of Theorem 1.3 (i) hold true with $\vartheta=1$ . 2. (ii)

If $a(x)$ is bounded, and $\int_{{\mathscr{B}}^{c}}\mathrm{e}^{\theta\rvert y\rvert}\,\upnu_{L}(\mathrm{d}y)<\infty$ for some $\theta>0$ , then there exists $Q\in{\mathcal{M}}_{+}$ such that the assertions of Theorem 1.3 (iii) hold.

Proof.

(i)

In [7, Theorem 3.2 (i)] it has been shown that there exist $Q\in{\mathcal{M}}_{+}$ , $\bar{c}=\bar{c}(\theta)>0$ , and $\tilde{c}=\tilde{c}(\theta)>0$ , such that for any $\theta\in[1,\theta_{\upnu}]\cap\Theta_{\upnu}$ , we have

[TABLE]

It is easy to see that the above relation implies that there exist $r>0$ , $\hat{c}>0$ , and $\breve{c}>0$ , such that

[TABLE]

with ${\mathscr{V}}(x)={\mathscr{V}}_{Q,\theta}(x)+1$ . The assertion now follows from Theorem 1.3 (i), together with [2, Proposition 4.3], [24, Theorem 3.4], and [79, Theorems 5.1 and 7.1].

(ii)

Let $\tilde{b}(x)\,\coloneqq\,\bar{b}(x)+\tilde{l}-l$ . As shown in the proof of [7, Theorem 3.2 (ii)], there exist $Q\in{\mathcal{M}}_{+}$ , $\bar{c}=\bar{c}(\zeta)>0$ and $\tilde{c}=\tilde{c}(\zeta)>0$ , such that for any $\zeta\in\bigl{(}0,\theta\lVert Q\rVert^{-\nicefrac{{1}}{{2}}}\bigr{)}$ ,

[TABLE]

This together with Lemma 2.2 (iii) imply that, for any $\zeta>0$ sufficiently small, there exist $\hat{c}=\hat{c}(\zeta)>0$ and $\check{c}=\check{c}(\zeta)>0$ , such that

[TABLE]

Again, It is straightforward to see that the above relation implies that there exist $r>0$ , $\breve{c}>0$ and $\mathring{c}>0$ , such that

[TABLE]

The assertion now follows from Theorem 1.3 (iii), and the results from [2, 24, 79] cited in part (i).∎

*Remark 3.2**.*

It has been shown in [7, Theorem 3.3 (b) and Lemma 5.7] that the assumptions $1\in\Theta_{\upnu}$ and $\langle e,M^{-1}\tilde{l}\rangle<0$ are both necessary for the existence of an invariant probability measure of ${\{X(t)\}_{t\geq 0}}$ . Using this, we can exhibit an example where we have ergodicity with respect to the $f$ -norm but not with respect to ${\mathscr{W}}_{p}$ -distance. Suppose that $\varGamma v=0$ , $\langle e,M^{-1}\tilde{l}\rangle<0$ , $a(x)$ satisfies Eq. 3.3, and ${\{L(t)\}_{t\geq 0}}$ is a rotationally invariant $\alpha$ -stable process with $\alpha\in(1,2)$ . Then [7, Theorem 3.1 (i)] shows that ${\{X(t)\}_{t\geq 0}}$ admits a unique invariant $\uppi\in{\mathcal{P}}_{\alpha-1-\iota}({\mathbb{R}}^{n})$ for $\iota\in(0,\alpha-1)$ , and

[TABLE]

for all $x\in{\mathbb{R}}^{n}$ and $\iota\in(0,\alpha-1)$ . Here, $\lVert\cdot\rVert_{\mathrm{TV}}$ stands for the total variation norm, i.e. the $f$ -norm with $f(x)\equiv 1$ . However, $\int_{{\mathbb{R}}^{n}}\lvert x\rvert\,\uppi(\mathrm{d}x)=\infty$ by [7, Theorem 3.4 (b)], so we cannot have convergence in ${\mathscr{W}}_{1}$ -distance.

We next exhibit a lower bound on the polynomial rate of convergence in Proposition 3.2 (i), which is analogous to [7, Theorem 3.4]. We let

[TABLE]

Note that, in general, $\tilde{\theta}_{\upnu}\geq\theta_{\upnu}$ . In [7] it is assumed that ${\{L(t)\}_{t\geq 0}}$ is a compound Poisson process with drift $b_{L}$ , and Lévy measure $\upnu_{L}(\mathrm{d}y)$ which is supported on a half-line of the form $\{\zeta w\colon\zeta\in[0,\infty)\}$ with $\langle e,M^{-1}w\rangle>0$ , and $a(x)$ satisfies Eq. 3.3. This implies that $\tilde{\theta}_{\upnu}=\theta_{\upnu}$ , and subsequently, this equality is used in the proof of [7, Lemma 5.7 (b)] to establish that, provided $\varGamma v=0$ , $\int_{{\mathbb{R}}^{n}}\bigl{(}\langle e,M^{-1}x\rangle^{+}\bigr{)}^{p-1}\,\uppi(\mathrm{d}x)<\infty$ implies $p\in\Theta_{\upnu}$ for $p>1$ . We use this fact, namely that the conclusions of [7, Lemma 5.7 (b)] hold under the weaker assumption that $\tilde{\theta}_{\upnu}=\theta_{\upnu}$ in the proof of the following proposition.

Proposition 3.3.

In addition to the assumptions of [7, Theorem 3.1], assume that $\varGamma v=0$ , $\bigl{\langle}e,M^{-1}\tilde{l}\bigr{\rangle}<0$ , and $\tilde{\theta}_{\upnu}=\theta_{\upnu}\in(2,\infty)$ . Then, due to Proposition 3.2 (i), ${\{X(t)\}_{t\geq 0}}$ admits a unique invariant $\uppi\in{\mathcal{P}}_{\theta_{\upnu}-1-\iota}({\mathbb{R}}^{n})$ , $\iota\in(0,\theta_{\upnu}-1)$ . Next, fix $\rho\in(0,(\theta_{\upnu}-2)\wedge 1)$ and $\varepsilon\in(\rho,1)$ . Then, for any $p\in[1,\theta_{\upnu}-\rho-1]$ and $\iota\in(0,1-\varepsilon)$ there exist $\bar{c}>0$ and a diverging increasing sequence $\{t_{n}\}_{n\in{\mathbb{N}}}\subset[0,\infty)$ , depending on these parameters, such that Eq. 1.9 holds with $\theta=\theta_{\upnu}-\rho$ , $\vartheta=\theta-1$ , and ${\mathscr{V}}(x)={\mathscr{V}}_{Q,\theta}(x)+1$ , where $Q\in{\mathcal{M}}_{+}$ is given in Proposition 3.2 (i).

Proof.

Observe first that $\vartheta+\varepsilon>\theta_{\upnu}-1$ . Thus, according to [7, Lemma 5.7 (b)], we have

[TABLE]

The assertion now follows from the proof of Proposition 3.2 (i) (together with [2, Proposition 4.3], [24, Theorem 3.4], and [79, Theorems 5.1 and 7.1]), and Theorem 1.2 by setting $L(x)=\langle e,M^{-1}x\rangle^{+}$ and $\phi(t)=t^{\nicefrac{{(\theta-1)}}{{\theta}}}$ . ∎

We now discuss the case when $\varGamma v\neq 0$ . For $x\in{\mathbb{R}}^{n}$ , we write $x\geq 0$ ( $x\gneqq 0$ ) to indicate that all components of $x$ are nonnegative (nonnegative and at least one is strictly positive). Also, for $x,y\in{\mathbb{R}}^{n}$ we write $x\geq y$ if, and only if, $x-y\geq 0$ .

Proposition 3.4.

In addition to the assumptions of [7, Theorem 3.1], suppose that $\theta_{\upnu}>0$ ,

[TABLE]

and that one of the following holds:

(i)

$Mv\geq\varGamma v\gneqq 0$ ; 2. (ii)

$M=\operatorname*{diag}(m_{1},\dotsc,m_{d})$ * with $m_{i}>0$ , $i=1,\dotsc,n$ , and $\varGamma v\neq 0$ .*

Then there exists $Q\in{\mathcal{M}}_{+}$ such that the assertions of Theorem 1.3 (ii) hold true.

Proof.

In [7, Theorem 3.5] it has been shown that there exist $Q\in{\mathcal{M}}_{+}$ , $\bar{c}=\bar{c}(\theta)>0$ , and $\tilde{c}=\tilde{c}(\theta)>0$ , such that for any $\theta\in(0,\theta_{\upnu}]\cap\Theta_{\upnu}$ , we have

[TABLE]

As in Proposition 3.2, it is easy to see that the above relation implies that there exist $r>0$ , $\hat{c}>0$ and $\breve{c}>0$ , such that

[TABLE]

with ${\mathscr{V}}(x)={\mathscr{V}}_{Q,\theta}(x)+1$ . The assertion now follows from Theorem 1.3 (ii), together with the results from [2, 24, 79] cited in the proof of Proposition 3.3. ∎

In the case when $\varGamma v\neq 0$ (under (i) or (ii) in Proposition 3.4) the dynamics are contractive in the ${\mathscr{W}}_{p}$ -distance. This is shown by establishing an asymptotic flatness (uniform dissipativity) property for ${\{X(t)\}_{t\geq 0}}$ . As a consequence, we assert exponential ergodicity of ${\{X(t)\}_{t\geq 0}}$ with respect to ${\mathscr{W}}_{p}$ , without assuming irreducibility and aperiodicity, i.e. we allow the SDE in Eq. 3.1 to be degenerate.

Proposition 3.5.

Suppose that $2\in\Theta_{\upnu}$ , $\sigma(x)$ is Lipschitz continuous, and either (i) or (ii) in Proposition 3.4 holds. Then there exists $Q\in\mathcal{M}_{+}$ such that the matrices

[TABLE]

are in $\mathcal{M}_{+}$ . Let $\underline{\kappa}$ denote the smallest eigenvalue of the positive definite matrices in Eq. 3.6, and $\overline{\lambda}_{Q}$ , $\underline{\lambda}_{Q}$ denote the largest, smallest eigenvalue of $Q$ , respectively. For $p\geq 1$ , let

[TABLE]

where $\operatorname{Lip}(\sqrt{Q}\,\sigma)$ is the Lipschitz constant of $\sqrt{Q}\,\sigma(x)$ with respect to the Hilbert-Schmidt norm, and suppose that $c(p)>0$ for some $p\in[2,\theta_{\upnu}]\cap\Theta_{\upnu}$ . Then the assertions of Theorem 1.4 hold true. If $\sigma(x)\equiv\sigma$ and $1\in\Theta_{\upnu}$ , the assertions of Theorem 1.4 hold true for any $p\in[1,\theta_{\upnu}]\cap\Theta_{\upnu}$ .

Proof.

Existence of the matrix $Q$ has been proven in [7, Theorem 3.5]. We prove that Eq. 1.16 holds with $c(p)$ defined above. First, clearly,

[TABLE]

for all $x,y\in{\mathbb{R}}^{n}$ . We next discuss the term $\bigl{\langle}\varDelta_{y-x}\tilde{b}(x),Q(y-x)\bigr{\rangle}$ for $x,y\in{\mathbb{R}}^{n}$ . Clearly, $\varDelta_{y-x}\tilde{b}(x)=\varDelta_{y-x}\bar{b}(x)$ for $x,y\in{\mathbb{R}}^{n}$ . With $\hat{v}=-M^{-1}(Mv-\varGamma v)$ , we have $\bar{b}(x)=l-M(x+\langle e,x\rangle^{+}\,\hat{v})$ . If both $x$ and $y$ are on the same half-space, i.e. $\langle e,x\rangle\geq 0$ and $\langle e,y\rangle\geq 0$ , or the opposite, then

[TABLE]

So suppose, without loss of generality, that $\langle e,x\rangle\geq 0$ and $\langle e,y\rangle\leq 0$ . Then we have

[TABLE]

We distinguish two cases.

(i)

$\bigl{\langle}y-x,QM\hat{v}e^{\prime}x\bigr{\rangle}\leq 0$ . Then of course subtracting Eq. 3.8a from Eq. 3.8b, we obtain

[TABLE] 2. (ii)

$\bigl{\langle}y-x,QM\hat{v}e^{\prime}x\bigr{\rangle}>0$ . Since $\langle e,x\rangle\geq 0$ , we must have $\bigl{\langle}y-x,QM\hat{v}\bigr{\rangle}>0$ . This in turn implies, since $\langle e,y\rangle\leq 0$ , that

[TABLE]

Adding Eqs. 3.8a and 3.9 and subtracting Eq. 3.8b from the sum, we obtain

[TABLE]

Finally, combining Eqs. 3.7 and 3.10, we obtain

[TABLE]

thus completing the proof. ∎

The hypothesis in Proposition 3.5 that $c(p)>0$ is, of course, always true if $\sigma(x)\equiv\sigma$ , in which case we have $c(p)=p\frac{\underline{\kappa}}{2\overline{\lambda}_{Q}}$ . This is the scenario for multiclass queueing models with service interruptions described in [7, Section 4.2].

Some examples of degenerate SDEs of the form Eq. 3.1 for which Proposition 3.5 is applicable are the following.

(i)

${\{L(t)\}_{t\geq 0}}$ is given by $L(t)=R\tilde{L}(t)$ for $t\geq 0$ , where $R\in\mathbb{R}^{n\times r}$ has rank smaller than $\min\{n,r\}$ , and $\{\tilde{L}(t)\}_{t\geq 0}$ is a $r$ -dimensional Lévy process. As a special case $\{\tilde{L}(t)\}_{t\geq 0}$ may be composed of mutually independent $\alpha$ -stable processes. This is the case in the queueing example described below. 2. (ii)

${\{L(t)\}_{t\geq 0}}$ is a degenerate subordinate Brownian motion, as studied in [87].

The following is an example of a degenerate SDE that arises in applications for which Proposition 3.4 is applicable. Consider a two class $GI/M/k+M$ queue with class-1 jobs having a Poisson process, and class-2 jobs having a heavy-tailed renewal arrival process. Service and patience times are exponentially distributed with rates $m_{i}$ and $\gamma_{i}$ for $i=1,2$ , respectively. Assume that the arrival, service and abandonment processes are mutually independent, and that the number of servers is $k$ . Consider a sequence of such models indexed by $k$ , operating in the critically loaded asymptotic modified Halfin-Whitt regime as $k\to\infty$ . Let ${\{A^{k}_{i}(t)\}_{t\geq 0}}$ denote the arrival process for class $i=1,2$ , with arrival rates $\lambda^{k}_{i}$ . Assume that $m_{i}$ and $\gamma_{i}$ for $i=1,2$ are independent of $k$ , and that $\frac{\lambda^{k}_{i}}{k}\to\lambda_{i}>0$ as $k\to\infty$ , for $i=1,2$ . The arrival process ${\{A^{k}_{1}(t)\}_{t\geq 0}}$ satisfies a functional central limit theorem (FCLT) with a Brownian motion limit ${\{\hat{A}_{1}(t)\}_{t\geq 0}}={\{\sqrt{\lambda_{1}}B_{1}(t)\}_{t\geq 0}}$ , where ${\{B_{1}(t)\}_{t\geq 0}}$ is a standard Brownian motion, i.e.

[TABLE]

Here, $\xRightarrow{\mathrm{J}_{1}}$ denotes the convergence in the space $D=D([0,\infty),{\mathbb{R}})$ of càdlàg functions endowed with the Skorokhod $\mathrm{J}_{1}$ topology. We assume that the arrival process ${\{A^{k}_{2}(t)\}_{t\geq 0}}$ satisfies a FCLT with a symmetric $\alpha$ -stable Lévy process ${\{\hat{A}_{2}(t)\}_{t\geq 0}}$ , $\alpha\in(1,2)$ , in the limit, i.e.

[TABLE]

Here, $\xRightarrow{\mathrm{M}_{1}}$ denotes the convergence in the space $D$ with the $\mathrm{M}_{1}$ topology. Let $\rho^{k}_{i}=\frac{\lambda^{k}_{i}}{km_{i}}$ and $\rho_{i}=\frac{\lambda_{i}}{m_{i}}$ for $i=1,2$ . The modified Halfin-Whitt regime requires the parameters satisfy

[TABLE]

In addition, we assume that $k^{-\nicefrac{{1}}{{\alpha}}}(\lambda^{k}_{i}-k\lambda_{i})\to l_{i}$ as $k\to\infty$ for $i=1,2$ . Next, let ${\{X^{k}_{i}(t)\}_{t\geq 0}}$ denote the number of class- $i$ jobs in the system. Define the scaled processes $\hat{X}^{k}_{i}(t)=k^{-\nicefrac{{1}}{{\alpha}}}(X^{k}_{i}(t)-k\rho_{i}t)$ for $t\geq 0$ . Let ${\{U^{k}_{i}(t)\}_{t\geq 0}}$ be the scheduling control process, representing allocations of service capacity to class $i$ . Let $\hat{X}^{k}(t)=\bigl{(}\hat{X}^{k}_{1}(t),\hat{X}^{k}_{2}(t)\bigr{)}^{\prime}$ and $U^{k}(t)=\bigl{(}U^{k}_{1}(t),U^{k}_{2}(t)\bigr{)}^{\prime}$ for $t\geq 0$ . We consider work conserving and preemptive scheduling policies resulting in constant controls at the limit, i.e. ${\{U^{k}(t)\}_{t\geq 0}}\xRightarrow[k\to\infty]{\mathrm{J}_{1}}{\{V(t)\}_{t\geq 0}}$ , where $V(t)=v$ for $t\geq 0$ with $v\in{\mathbb{R}}^{2}$ being a probability vector. Then, as in [7, Theorem 4.1], it can shown that ${\{\hat{X}^{k}(t)\}_{t\geq 0}}\xRightarrow[k\to\infty]{\mathrm{M}_{1}}{\{X(t)\}_{t\geq 0}}$ , where the limit process ${\{X(t)\}_{t\geq 0}}$ is a solution to the following two-dimensional degenerate $\alpha$ -stable driven SDE:

[TABLE]

which is Eq. 3.1 with $l=(l_{1},l_{2})^{\prime}$ , $M=\operatorname*{diag}(m_{1},m_{2})$ , $\varGamma=\operatorname*{diag}(\gamma_{1},\gamma_{2})$ , $\sigma(x)=(0,0)^{\prime}$ , and $L(t)=(0,\hat{A}_{2}(t))^{\prime}$ for $t\geq 0$ . Observe that the process ${\{X(t)\}_{t\geq 0}}$ does not fall into any of the four categories in [7, Theorem 3.1]. In fact, one can consider multiple classes of jobs with all heavy-tailed arrival processes that have different scaling parameters $\alpha_{i}$ ’s for $i=1,\dots,\bar{k}$ , in their corresponding FCLTs. The centered queueing process should be scaled as $k^{-\nicefrac{{1}}{{\alpha}}}$ , where $\alpha\coloneqq\min_{i=1,\dots,\bar{k}}\{\alpha_{i}\}$ , and the limit process has the components ${\{X_{i}(t)\}_{t\geq 0}}$ driven by independent $\alpha$ -stable processes if the arrival process of class $i$ has the parameter $\alpha_{i}$ equal to the minimum $\alpha$ , and the other components are degenerate without stochastic driving terms.

We remark here that without assuming irreducibility and aperiodicity, establishing subgeometric ergodicity in the case $\varGamma v=0$ is difficult. Consider the following example. Let $n=1$ , $\sigma(x)\equiv 0$ , $L(t)\equiv 0$ for $t\geq 0$ , and

[TABLE]

Clearly, $\bar{b}(x)$ satisfies all the assumptions in [7], and

[TABLE]

A straightforward calculation shows that

[TABLE]

Let

[TABLE]

Then it is easy to see that the conditions (1)–(3) in [12, Theorem 2.4] hold. However, condition (4) does not hold. Namely, for arbitrary $t_{0}>0$ let $x,y>t_{0}$ . Then, $\mathsf{d}\bigl{(}X^{x}(t),X^{y}(t)\bigr{)}=\mathsf{d}(x,y)$ for all $t_{0}\leq t\leq x\wedge y$ .

Let us mention that ergodic properties of piecewise Ornstein-Uhlenbeck processes with jumps in the total variation norm have been considered in [7, 23, 70].

3.4 Piecewise Ornstein-Uhlenbeck processes with jumps under

stationary Markov controls

In Subsection 3.3 we consider a model with a constant control, i.e. with the vector $v\in\Delta\coloneqq\{u\in{\mathbb{R}}^{n}\colon u\geq 0,\ \langle e,u\rangle=1\}$ being constant and fixed. If the scheduling policy (control) is a function of the state of the system, then $v(x)$ in the limiting SDE Eq. 3.1 is, in general, a Borel measurable map from ${\mathbb{R}}^{n}$ to $\Delta$ . We call such a $v(x)$ a stationary Markov control and denote the set of such controls by $\mathfrak{U}_{\mathrm{SM}}$ . If $L_{t}\equiv 0$ for $t\geq 0$ , or it is a compound Poisson process, it follows from the results in [37] that, under any $v\in\mathfrak{U}_{\mathrm{SM}}$ , Eq. 3.1 admits a unique conservative strong solution which is a strong Markov process with càdlàg sample paths. In the general case, we consider the subclass of stationary Markov controls for which

[TABLE]

is locally Lipschitz continuous. We let $\widetilde{\mathfrak{U}}_{\mathrm{sm}}$ denote the class of such controls. Clearly, for any $v\in\widetilde{\mathfrak{U}}_{\mathrm{sm}}$ , the drift $\bar{b}_{v}(x)$ has at most linear growth. Other parameters are as in Subsection 3.3. Again, the SDE of the form Eq. 3.1, with $\bar{b}(x)$ replaced by $\bar{b}_{v}(x)$ , admits a unique conservative strong solution ${\{X(t)\}_{t\geq 0}}$ which is a strong Markov process with càdlàg sample paths. Also, it is an Itô process satisfying (MP) with $b(x)=b_{L}+\bar{b}_{v}(x)$ , $a(x)=\sigma(x)\sigma(x)^{\prime}$ , and $\upnu(x,\mathrm{d}y)=\upnu_{L}(\mathrm{d}y)$ .

Recently, in [6] the authors have studied ergodic properties with respect to the total variation norm of this model with ${\{L(t)\}_{t\geq 0}}$ being either (or a combination of) a rotationally invariant $\alpha$ -stable Lévy process, an anisotropic Lévy process consisting of independent one-dimensional symmetric $\alpha$ -stable components, or a compound Poisson process. Observe that in this situation we cannot follow the procedure from the constant control case. Namely, the matrices $Q\in{\mathcal{M}}_{+}$ used in constructing the appropriate Lyapunov functions ${\mathscr{V}}(x)$ depend on $v$ .

Proposition 3.6.

Grant the assumptions of [7, Theorem 3.1], and suppose that $M=\operatorname*{diag}(m_{1},\dotsc,m_{n})$ , with $m_{i}>0$ for $i=1,\dots,n.$

(i)

Assume that the diagonal components of $\varGamma$ are strictly positive, $a(x)$ satisfies Eq. 3.5, and ${\{L(t)\}_{t\geq 0}}$ is either a rotationally invariant $\alpha$ -stable Lévy process, an anisotropic Lévy process consisting of independent one-dimensional symmetric $\alpha$ -stable components (in both cases we assume that $\alpha\in(1,2)$ ), or a compound Poisson process satisfying $1\in\Theta_{\upnu}$ . We allow ${\{L(t)\}_{t\geq 0}}$ to have a drift. Then, for any $v\in\widetilde{\mathfrak{U}}_{\mathrm{sm}}$ and $\theta\in[1,\theta_{\upnu}]\cap\Theta_{\upnu}$ , the assertions of Theorem 1.1 (iii) hold true with $\eta=\theta$ , and ${\mathscr{V}}(x)=\bigl{(}\bar{\mathscr{V}}(x)\bigr{)}^{\theta}+1$ , where $\bar{\mathscr{V}}\in C^{2}({\mathbb{R}}^{n})$ (given explicitly in **[6, Definition 1])* is bounded from below away from zero, is Lipschitz continuous, and satisfies*

[TABLE] 2. (ii)

Assume $\bigl{\langle}e,M^{-1}\tilde{l}\bigr{\rangle}<0$ , where $\tilde{l}$ is given in Eq. 3.2, $a(x)$ satisfies Eq. 3.3, and ${\{L(t)\}_{t\geq 0}}$ is a pure-jump Lévy process (possibly with drift) satisfying $2\in\Theta_{\upnu}$ . Then, for any $v\in\widetilde{\mathfrak{U}}_{\mathrm{sm}}$ and $\theta\in[2,\theta_{\upnu}]\cap\Theta_{\upnu}$ , the assertions of Theorem 1.1 (i) and (ii) hold true with $\phi(t)=t^{\nicefrac{{(\theta-1)}}{{\theta}}}$ , $\eta=\theta-1$ , and ${\mathscr{V}}(x)$ as in (i). 3. (iii)

In addition to the assumptions in (ii) assume that $\tilde{\theta}_{\upnu}=\theta_{\upnu}\in(2,\infty)$ , where $\tilde{\theta}_{\upnu}$ is given in Eq. 3.4. Then, due to (ii), for any $v\in\widetilde{\mathfrak{U}}_{\mathrm{sm}}$ , ${\{X(t)\}_{t\geq 0}}$ admits a unique invariant $\uppi_{v}\in{\mathcal{P}}_{\theta_{\upnu}-1-\iota}({\mathbb{R}}^{n})$ for $\iota\in(0,\theta_{\upnu}-1)$ . Next, fix $\rho\in(0,(\theta_{\upnu}-2)\wedge 1)$ and $\varepsilon\in(\rho,1)$ . Then, for any $v\in\widetilde{\mathfrak{U}}_{\mathrm{sm}}$ such that $\varGamma v(x)=0$ a.e., $p\in[1,\theta_{\upnu}-\rho-1]$ and $\iota\in(0,1-\varepsilon)$ , there exist $\bar{c}>0$ and a diverging increasing sequence $\{t_{n}\}_{n\in{\mathbb{N}}}\subset[0,\infty)$ , depending on these parameters, such that Eq. 1.9 holds for the corresponding $\uppi_{v}(\mathrm{d}x)$ with $\theta=\theta_{\upnu}-\rho$ , $\vartheta=\theta-1$ , and ${\mathscr{V}}(x)$ as above.

Proof.

(i)

Observe first that in the case when ${\{L(t)\}_{t\geq 0}}$ is a rotationally invariant $\alpha$ -stable Lévy process or an anisotropic Lévy process consisting of independent one-dimensional symmetric $\alpha$ -stable components, $\Theta_{\upnu}=[0,\alpha)$ . In [6, Theorem 3 and the discussion after Theorem 5] it has been shown that for any $v\in\widetilde{\mathfrak{U}}_{\mathrm{sm}}$ and $\theta\in[1,\theta_{\upnu}]\cap\Theta_{\upnu}$ there exist $\bar{c}=\bar{c}(\theta,v)>0$ and $\tilde{c}=\tilde{c}(\theta,v)>0$ , such that

[TABLE]

It is easy to see that the above relation implies that there exist $r>0$ , $\hat{c}>0$ , and $\breve{c}>0$ , such that

[TABLE]

The assertion then follows from Theorem 1.1 (iii), together with [2, Proposition 4.3], [24, Theorem 3.4], and [79, Theorems 5.1 and 7.1].

(ii)

In Theorem 5 and the discussion following the proof of this theorem in [6] it has been shown that for any $v\in\widetilde{\mathfrak{U}}_{\mathrm{sm}}$ and $\theta\in(1,\theta_{\upnu}]\cap\Theta_{\upnu}$ there exist $r=r(\theta,v)>0$ , $\bar{c}=\bar{c}(\theta,v)>0$ , and $\tilde{c}=\tilde{c}(\theta,v)>0$ , such that

[TABLE]

It is easy to see that the above relation implies that there exist $\hat{r}>0$ , $\check{c}>0$ , and $\breve{c}>0$ , such that

[TABLE]

with ${\mathscr{V}}(x)$ given as above. The assertion now follows from Theorem 1.1 (i) and (ii), together with the results from [2, 24, 79] cited in part (i).

(iii)

Clearly, $\vartheta+\varepsilon>\theta_{\upnu}-1$ . Thus, according to [7, Lemma 5.7 (b)],

[TABLE]

The assertion now follows from (the proof of) (ii) (together with the results from [2, 24, 79] cited in part (i)), and Theorem 1.2 by setting $L(x)=\bar{\mathscr{V}}(x)$ and $\phi(t)=t^{\nicefrac{{(\theta-1)}}{{\theta}}}$ .∎

As discussed in Subsection 3.3, the hypothesis that $\tilde{\theta}_{\upnu}=\theta_{\upnu}$ is true if ${\{L(t)\}_{t\geq 0}}$ is a compound Poisson process (possibly with drift) with Lévy measure $\upnu_{L}(\mathrm{d}y)$ supported on a half-line of the form $\{tw\colon t\in[0,\infty)\}$ with $\langle e,M^{-1}w\rangle>0$ .

Ergodic properties in the $f$ -norm of piecewise Ornstein-Uhlenbeck processes with jumps under stationary Markov controls have been considered in [5, 6].

3.5 State-space models

Let $F\colon{\mathbb{R}}^{n}\to{\mathbb{R}}^{n}$ be continuous, and such that $\lvert F(x)\rvert\leq c\lvert x\rvert$ for some $c>0$ and all $x\in{\mathbb{R}}^{n}.$ Further, let $X(0)$ be an ${\mathbb{R}}^{n}$ -valued random variable, and let $\{W(k)\}_{k\in{\mathbb{N}}}$ be a sequence of i.i.d. ${\mathbb{R}}^{n}$ -valued random variables independent of $X(0)$ . Assume that the common distribution of $\{W(k)\}_{k\in{\mathbb{N}}}$ has a nontrivial absolutely continuous component which is bounded away from zero in a neighborhood of the origin. Then the Markov process defined by

[TABLE]

is irreducible, aperiodic, and all compact sets are petite (see [78, Proposition 5.2]). Further, assume that there exist constants $l\in{\mathbb{N}}$ , $l\geq 2$ , $\varepsilon\in(0,1)$ , and $\bar{c},r>0$ , such that

[TABLE]

Proposition 3.7.

Under the above assumptions, the assertions of Theorem 1.1 (i) and (ii) hold with ${\mathscr{V}}(x)=\lvert x\rvert^{l}$ , $\phi(t)=t^{\nicefrac{{(l-1)}}{{l}}}$ , and $\eta=l-1$ .

Proof.

In [78, Proposition 5.2] it has been proved that the Foster-Lyapunov condition in Eq. 1.1 holds with ${\mathscr{V}}(x)$ and $\phi(t)$ as above, and $C={\mathscr{B}}_{\bar{r}}$ for some $\bar{r}>0$ . The result now follows from Theorem 1.1 (i) and (ii). ∎

Ergodic properties of state-space models in the $f$ -norm have been studied in [78, 32].

3.6 Backward recurrence time chain

Let $\{p_{i}\}_{i\geq 0}\subset(0,\infty)$ be such that $p_{0}=1$ , $p_{i}<1$ for $i\in{\mathbb{N}}$ , and $\prod_{j=0}^{i}p_{j}\to 0$ , as $i\to\infty$ . Let $\{X(k)\}_{k\geq 0}$ be a Markov process on $\{0,1,\dotsc\}$ defined by the transition kernel $p(i,i+1)=1-p(i,0)\coloneqq p_{i}$ for $i\geq 0$ . The process $\{X(k)\}_{k\geq 0}$ is irreducible and aperiodic, and it admits a unique invariant $\uppi\in{\mathcal{P}}(\{0,1,\dotsc\})$ if, and only, if

[TABLE]

In this case, $\uppi(0)=\uppi(1)=(2+c)^{-1}$ , and $\uppi(i)=(2+c)^{-1}\prod_{j=0}^{i-1}p_{j}$ for $i\geq 2$ .

Proposition 3.8.

(i)

If there exist $i_{0}\in{\mathbb{N}}$ and $\alpha>1$ , such that $p_{i}=\frac{1+\alpha}{i}$ for $i\geq i_{0}$ , then the assertions of Theorem 1.1 (i) and (ii) hold with

[TABLE] 2. (ii)

Under the assumptions in (i), $\uppi\in{\mathcal{P}}_{\alpha-\iota}(\{0,1,\dotsc\})$ for $\iota\in(0,\alpha)$ . Next, fix $\rho\in(0,(\alpha-1)\wedge 1)$ and $\varepsilon\in[\rho,1)$ . Then, for every $p\in[1,\alpha-\rho]$ and $\iota\in(0,1-\varepsilon)$ there exist a positive constant $c$ and a diverging increasing sequence $\{t_{n}\}_{n\in{\mathbb{N}}}\subset[0,\infty)$ , depending on these parameters, such that Eq. 1.9 holds with ${\mathscr{V}}(i)$ as above, $\theta=1+\alpha-\rho$ , and $\vartheta=\alpha-\rho$ .

Proof.

(i)

In [25, Section 3] it has been shown that the Foster-Lyapunov condition in Eq. 1.1 holds with a Lyapunov function $\bar{\mathscr{V}}(i)$ which asymptotically behaves like ${\mathscr{V}}(i)$ , $\phi(t)$ as above, and $C$ being a finite set for any $\alpha>0$ and $\beta\in(0,1)$ . Taking into account Eq. 1.2, the assertion follows.

(ii)

From the assumptions on the sequence $\{p_{i}\}_{i\geq 0}$ we see that $\lim_{i\to\infty}i^{1+\alpha}\,\uppi(i)>0$ . Now, since $\vartheta+\varepsilon-1-\alpha\geq-1$ , we have $\sum_{i=0}^{\infty}i^{\vartheta+\varepsilon}\,\uppi(i)\,=\,\infty$ . The assertion now follows from Theorem 1.2 by taking $L(i)=i$ .∎

4 Concluding Remarks

We remark on some other approaches in the study of exponential or subexponential ergodicity of Markov processes. By analyzing polynomial moments of hitting times of compact sets directly, polynomial ergodicity results are established in [80, Theorem 6] for a class of irreducible (with respect to the Lebesgue measure) and aperiodic diffusion processes. In a follow-up work, by using analogous techniques, the same author established polynomial ergodicity of a class of diffusion processes without directly assuming irreducibility and aperiodicity of the process, but employing instead a so-called (local) Dobrushin condition (also known as Markov-Dobrushin condition) [81, Theorem 6]. This approach is based on a Foster-Lyapunov condition of the form Eq. 1.1, and instead of assuming irreducibility and aperiodicity of ${\{X(t)\}_{t\geq 0}}$ , it is assumed that (i) ${\mathscr{V}}(x)$ has precompact sub-level sets, and (ii) for every $\delta>0$ there exists $t_{\delta}\in{\mathbb{T}}\setminus\{0\}$ such that

[TABLE]

(see [53, Chapter 3]). Observe that this condition actually means that for each $(x,y)$ satisfying ${\mathscr{V}}(x)+{\mathscr{V}}(y)\leq\delta$ the probability measures $p(t_{\delta},x,\mathrm{d}z)$ and $p(t_{\delta},y,\mathrm{d}z)$ are not mutually singular. Intuitively, the Dobrushin condition encodes irreducibility and aperiodicity of ${\{X(t)\}_{t\geq 0}}$ , and petiteness of sub-level sets of ${\mathscr{V}}(x)$ . Based on these assumptions, and using an appropriate Markov coupling of ${\{M(t)\}_{t\geq 0}}$ , it follows that the $\Phi^{-1}$ -modulated moment of the corresponding coupling time is finite and controlled by ${\mathscr{V}}(x)+{\mathscr{V}}(y)$ . This then implies (sub)geometric ergodicity of ${\{X(t)\}_{t\geq 0}}$ in the total variation norm (see [38, Theorem 4.1] or [53, Chapter 3]).

We remark that irreducibility and aperiodicity (together with Eq. 1.1) imply that the Dobrushin condition holds on the Cartesian product of any petite set with itself. Namely, according to [65, Proposition 6.1], for any petite set $C$ there exists $t_{C}\in{\mathbb{T}}\setminus\{0\}$ such that for the measure $\upchi(\mathrm{d}t)$ (in the definition of petiteness) the Dirac measure in $t_{C}$ can be taken (together with some non-trivial measure $\upnu_{\upchi}(\mathrm{d}x)$ ). Thus, $p(t_{C},x,B)\geq\upnu_{\upchi}(B)$ for any $x\in C$ and $B\in\mathfrak{B}(\mathbb{X})$ , which implies that

[TABLE]

If, in addition, ${\{X(t)\}_{t\geq 0}}$ is $C_{b}$ -Feller (i.e. $x\mapsto\int_{\mathbb{X}}f(y)\,p(t,x,\mathrm{d}y)$ is continuous and bounded for any $t\in{\mathbb{T}}$ and any continuous and bounded function $f(x)$ ), and the support of the corresponding irreducibility measure has nonempty interior, then every compact set is petite (see [79, Theorems 5.1 and 7.1]) and thus Eq. 4.1 holds for any bounded set $C$ . This shows that, at least in this particular situation, the approach based on the Dobrushin condition is more general than the approach based on irreducibility and aperiodicity. Situations where it has a clear advantage are discussed in [54, 1]. In [54], the author considers a Markov process obtained as a solution to a Lévy-driven SDE with highly irregular coefficients and noise term; while in [1], a diffusion process with highly irregular (discontinuous) drift function and uniformly elliptic diffusion coefficient has been considered. In these concrete situations it is not clear whether one can obtain irreducibility and aperiodicity of the processes, whereas the authors obtain Eq. 4.1 for any compact set $C$ (see [54, Theorem 1.3] and [1, Lemma 3]). For additional results on ergodic properties of Markov processes based on the Dobrushin condition we refer the readers to [38, 53].

Acknowledgements

We thank the anonymous referee for the helpful comments that have led to significant improvements of the results in the article. This research was supported in part by the Army Research Office through grant W911NF-17-1-001, and in part by the National Science Foundation through grants DMS-1715210, CMMI-1635410, and DMS-1715875. Financial support through the Alexander von Humboldt Foundation (No. HRV 1151902 HFST-E) and the Croatian Science Foundation under the project 8958 (for N. Sandrić) is gratefully acknowledged.

Bibliography87

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] {barticle} [author] \bauthor \bsnm Abourashchi, \bfnm N. \binits N. and \bauthor \bsnm Veretennikov, \bfnm A. Yu. \binits A. Y. ( \byear 2010). \btitle On stochastic averaging and mixing. \bjournal Theory Stoch. Process. \bvolume 16 \bpages 111–129. \bmrnumber 2779833 \endbibitem
2[2] {barticle} [author] \bauthor \bsnm Albeverio, \bfnm Sergio \binits S., \bauthor \bsnm Brzeźniak, \bfnm Zdzisław \binits Z. and \bauthor \bsnm Wu, \bfnm Jiang-Lun \binits J.-L. ( \byear 2010). \btitle Existence of global solutions and invariant measures for stochastic differential equations driven by Poisson type noise with non-Lipschitz coefficients. \bjournal J. Math. Anal. Appl. \bvolume 371 \bpages 309–322. \bdoi 10.1016/j.jmaa.2010.05.039 \bmrnumber 2661009 \endbibitem
3[3] {barticle} [author] \bauthor \bsnm Andrieu, \bfnm Christophe \binits C., \bauthor \bsnm Fort, \bfnm Gersende \binits G. and \bauthor \bsnm Vihola, \bfnm Matti \binits M. ( \byear 2015). \btitle Quantitative convergence rates for subgeometric Markov chains. \bjournal J. Appl. Probab. \bvolume 52 \bpages 391–404. \bdoi 10.1239/jap/1437658605 \bmrnumber 3372082 \endbibitem
4[4] {bbook} [author] \bauthor \bsnm Arapostathis, \bfnm Ari \binits A., \bauthor \bsnm Borkar, \bfnm Vivek S. \binits V. S. and \bauthor \bsnm Ghosh, \bfnm Mrinal K. \binits M. K. ( \byear 2012). \btitle Ergodic control of diffusion processes. \bseries Encyclopedia of Mathematics and its Applications \bvolume 143. \bpublisher Cambridge University Press, Cambridge. \bmrnumber 2884272 \endbibitem
5[5] {barticle} [author] \bauthor \bsnm Arapostathis, \bfnm A. \binits A., \bauthor \bsnm Hmedi, \bfnm H. \binits H. and \bauthor \bsnm Pang, \bfnm G. \binits G. ( \byear 2020). \btitle On uniform exponential ergodicity of Markovian multiclass many-server queues in the Halfin-Whitt regime. \bjournal Math. Oper. Res. \bnote (to appear). \endbibitem
6[6] {bincollection} [author] \bauthor \bsnm Arapostathis, \bfnm A. \binits A., \bauthor \bsnm Hmedi, \bfnm H. \binits H., \bauthor \bsnm Pang, \bfnm G. \binits G. and \bauthor \bsnm Sandrić, \bfnm N. \binits N. ( \byear 2019). \btitle Uniform polynomial rates of convergence for a class of Lévy-driven controlled SD Es arising in multiclass many-server queues. In \bbooktitle Modeling, stochastic control, optimization, and applications. \bseries IMA Vol. Math. Appl. \bvolume 164 \b
7[7] {barticle} [author] \bauthor \bsnm Arapostathis, \bfnm Ari \binits A., \bauthor \bsnm Pang, \bfnm Guodong \binits G. and \bauthor \bsnm Sandrić, \bfnm Nikola \binits N. ( \byear 2019). \btitle Ergodicity of a Lévy-driven SDE arising from multiclass many-server queues. \bjournal Ann. Appl. Probab. \bvolume 29 \bpages 1070–1126. \bdoi 10.1214/18-AAP 1430 \bmrnumber 3910024 \endbibitem
8[8] {barticle} [author] \bauthor \bsnm Arisawa, \bfnm Mariko \binits M. ( \byear 2009). \btitle Homogenization of a class of integro-differential equations with Lévy operators. \bjournal Comm. Partial Differential Equations \bvolume 34 \bpages 617–624. \bdoi 10.1080/03605300902963518 \bmrnumber 2560294 \endbibitem

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Subexponential upper and lower bounds in Wasserstein distance

Abstract

keywords:

keywords:

1 Introduction

1.1 Summary of the results

Theorem 1.1**.**

Theorem 1.2**.**

1.2 Ergodicity of a class of Lévy-type processes

Definition 1.1**.**

Theorem 1.3**.**

Theorem 1.4**.**

Proposition 1.1**.**

1.3 Literature review

1.4 Organization of the article

2 Proofs of the main results

Proof of Theorem 1.1.

Proof of Theorem 1.2.

Lemma 2.1**.**

Proof.

Lemma 2.2**.**

Proof.

Proof of Theorem 1.3.

Lemma 2.3**.**

Proof.

Proof of Theorem 1.4.

Proof of Proposition 1.1.

3 Examples

3.1 Langevin tempered diffusion processes

Proposition 3.1**.**

Proof.

Remark 3.1*.*

3.2 Ornstein-Uhlenbeck processes with jumps

3.3 Piecewise Ornstein-Uhlenbeck processes with jumps

Proposition 3.2**.**

Proof.

Remark 3.2*.*

Proposition 3.3**.**

Proof.

Proposition 3.4**.**

Proof.

Proposition 3.5**.**

Proof.

3.4 Piecewise Ornstein-Uhlenbeck processes with jumps under

Proposition 3.6**.**

Proof.

3.5 State-space models

Proposition 3.7**.**

Proof.

3.6 Backward recurrence time chain

Proposition 3.8**.**

Proof.

4 Concluding Remarks

Acknowledgements

Theorem 1.1.

Theorem 1.2.

Definition 1.1.

Theorem 1.3.

Theorem 1.4.

Proposition 1.1.

Lemma 2.1.

Lemma 2.2.

Lemma 2.3.

Proposition 3.1.

*Remark 3.1**.*

Proposition 3.2.

*Remark 3.2**.*

Proposition 3.3.

Proposition 3.4.

Proposition 3.5.

Proposition 3.6.

Proposition 3.7.

Proposition 3.8.