Lyapunov criteria for uniform convergence of conditional distributions of absorbed Markov processes
Nicolas Champagnat, Denis Villemonais

TL;DR
This paper develops Lyapunov criteria to analyze the uniform convergence of conditional distributions in absorbed Markov processes, including Lotka-Volterra and Feller diffusions, advancing understanding of their quasi-stationary behavior.
Contribution
It introduces novel non-linear Lyapunov criteria involving two functions, applicable to a broad class of Markov processes with absorption.
Findings
Criteria apply to Lotka-Volterra birth and death processes
Criteria extend to Feller diffusions with interactions
Results establish conditions for uniform convergence of conditional distributions
Abstract
We study the quasi-stationary behavior of multidimensional processes absorbed when one of the coordinates vanishes. Our results cover competitive or weakly cooperative Lotka-Volterra birth and death processes and Feller diffusions with competitive Lotka-Volterra interaction. To this aim, we develop original non-linear Lyapunov criteria involving two Lyapunov functions, which apply to general Markov processes.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
11footnotetext: Université de Lorraine, IECL, UMR 7502, Campus Scientifique, B.P. 70239, Vandœuvre-lès-Nancy Cedex, F-54506, France22footnotetext: CNRS, IECL, UMR 7502, Vandœuvre-lès-Nancy, F-54506, France33footnotetext: Inria, TOSCA team, Villers-lès-Nancy, F-54600, France.
E-mail: [email protected], [email protected]
Lyapunov criteria for uniform convergence of conditional distributions of absorbed Markov processes
Nicolas Champagnat1,2,3, Denis Villemonais1,2,3
Abstract
We study the uniform convergence to quasi-stationarity of multidimensional processes absorbed when one of the coordinates vanishes. Our results cover competitive or weakly cooperative Lotka-Volterra birth and death processes and Feller diffusions with competitive Lotka-Volterra interaction. To this aim, we develop an original non-linear Lyapunov criterion involving two functions, which applies to general Markov processes.
Keywords: stochastic Lotka-Volterra systems; multitype population dynamics; multidimensional birth and death process; multidimensional Feller diffusions; process absorbed on the boundary; quasi-stationary distribution; uniform exponential mixing property; Lyapunov function
2010 Mathematics Subject Classification. Primary: 60J27; 37A25; 60B10. Secondary: 92D25; 92D40.
1 Introduction
We consider a Markov process evolving in a state space , where and is absorbing. A quasi-stationary distribution for is a probability measure on such that
[TABLE]
where is the first hitting time of and, for any probability measure on , is the law of with initial distribution .
Our goal is to provide a computational method, taking the form of a nonlinear Lyapunov type condition (sometimes also referred to as drift condition) ensuring the existence and uniqueness of a quasi-stationary distribution and the uniform convergence in total variation of the law of given when to this quasi-stationary distribution, which means that there exist two constants such that
[TABLE]
for all initial distribution on , where is the usual total variation distance on the set of finite, signed measures on , defined by . We apply this result to two standard models in ecology and evolution, called Lotka-Volterra (or logistic) birth-death or diffusion processes [27, 6, 7], which have attracted a lot of attention in the past and for which the question of uniform convergence toward a quasi-stationary distribution remains largely open in the multi-dimensional case.
Practical (linear) Lyapunov type criteria for convergence to quasi-stationary distributions were also developed in [12]. However, these results usually only entail non-uniform convergence with respect to the initial condition. In the applications we consider here, the results of Sections 4 and 5 in [12] would ensure the existence of two positive functions and on such that
[TABLE]
for all initial distribution on . In the cases considered below, the function may be taken bounded, but is usually not bounded away from zero close to the boundary of , which leads to a non-uniform convergence result.
As may be expected, the stronger convergence result (1.1) requires a finer control of the behavior of the process near the boundary, uniformly in . As a consequence, our Lyapunov criteria are more involved than those of [12] and the techniques used here differ: in the present article, we use the control of the derivative of the continuous-time semi-group of the process to localize the conditioned process when it starts close to the boundary (see Proposition 2.3), while [12] makes use of the time-discretisation of the semi-group for this purpose. Additional arguments are also required to control the behavior of the conditioned process close to infinity. Note however that, in the end, our approach relies on the discrete time criterion of [9] and it is tempting to think that one may use a combination of the present approach and of the discrete time criterion of [12] in order to prove (1.2) with . Although this would lead to more complicated criteria than the one presented below, such an approach could also apply to processes that do not come down from infinity, such as Orstein-Uhlenbeck processes, with uniform convergence among initial distributions in bounded subsets. We leave this question for future research.
The uniform convergence (1.1) is also transferred to other properties of the process. Indeed, it provides uniformity in the time to the so-called mortality/extinction plateau (see [30]), uniform convergence of the process conditioned to late survival to the so-called -process, uniform exponential ergodicity of the -process (see [13]), and uniform convergence of to an eigenfunction for the semigroup, for some positive constant , when (see [9, Theorem 2.5]).
One of the main tools of our proofs is [9], where we showed that the uniform exponential convergence of conditional distributions in total variation to a unique quasi-stationary distribution is equivalent to the following conditions: there exists a probability measure on such that
- (A1)
there exist such that for all ,
[TABLE]
- (A2)
there exists such that for all and ,
[TABLE]
Although it has the merit of generality, this criterion appears to be hard to check in practice [13, 5, 14, 11, 16, 23]. In particular, it lacks computational methods for verification. This is one of the purposes of the new criterion we present. It involves two bounded nonnegative functions and such that when converges to the boundary of or to , satisfying
[TABLE]
for some bounded subset of and
[TABLE]
for some and some constants , where denotes (an extension of) the infinitesimal generator of the Markov process .
We apply this criterion to Lotka-Volterra birth and death processes, and to competitive Lotka-Volterra Feller diffusion processes. The quasi-stationary behavior of (extensions of) these two models have received a lot of attention in the one-dimensional case [34, 33, 25, 29, 3, 14, 15, 22]. We focus here on the multidimensional case, where the processes evolve on the state spaces for birth and death processes (with ) and for diffusion processes, with , and where absorption corresponds to the extinction of a single population. This means that and (where ) for multidimensional birth and death processes and and for multidimensional diffusions. Non-uniform exponential convergence to quasi-stationary distributions for such processes can be obtained using [12], [35] or [20].
Remark 1*.*
The case where absorption corresponds to the extinction of the whole population, i.e. , can be handled combining the results of the present paper and those known in the one-dimensional case [13, 14] following the methods of [4, Thm. 1.1]. This case was also considered in [16].
A Lotka-Volterra birth and death process in dimension is a Markov process on with transition rates from to in given by
[TABLE]
where where the 1 is at the -th coordinate. Note that the set is absorbing for the process. We make the usual convention that
[TABLE]
From the biological point of view, the constant is the birth rate per individual of type , the constant is the death rate per individual of type , is the rate of death of an individual of type from competition with an individual of type , and is the rate of birth of an individual of type from cooperation with (or predation of) an individual of type . In general, a Lotka-Volterra process could be explosive if some of the are positive, but the assumptions of the next theorem ensure that it is not the case and that the process is almost surely absorbed in finite time.
Theorem 1.1**.**
Consider a competitive Lotka-Volterra birth and death process in as above. Assume that the matrix defines a positive operator on in the sense that, for all , . Then the process has a unique quasi-stationary distribution and there exist constants such that, for all probability measures on ,
[TABLE]
An important difficulty here is the fact that the absorption rate (i.e. the rate of jump from a state in to a state in ) is not bounded. Birth and death processes with bounded absorption rates are much easier to study, cf. e.g. [9]. The existence of a quasi-stationary distribution for this kind of multi-dimensional birth and death processes can also be obtained using the theory of -positive matrices, as exposed in [19], but without the uniform exponential convergence (1.1).
A competitive Lotka-Volterra Feller diffusion process in dimension is a Markov process on , where , is a solution of the stochastic differential equation
[TABLE]
where are independent standard Brownian motions. The Brownian terms and the linear drift terms correspond to classical Feller diffusions, and the quadratic drift terms correspond to Lotka-Volterra interactions between coordinates of the process. The variances per individual are positive numbers, and the growth rates per individual can be any real number, for all . The parameters are assumed nonnegative for all , which corresponds to competitive Lotka-Volterra interaction. It is well known that, in this case, there is global strong existence and pathwise uniqueness for the SDE (1.3), and that it is almost surely absorbed in finite time in if for all (see [4] and Section 5).
Theorem 1.2**.**
Consider a competitive Lotka-Volterra Feller diffusion in as above. Assume that for all . Then the process has a unique quasi-stationary distribution and there exist constants such that, for all probability measures on ,
[TABLE]
This results was previously known in dimension 2 when the constants and satisfy , which implies that the process (after some transformations) is a Kolmogorov diffusion (i.e. of the form for some Brownian motion and some function , see [4]). Our result is valid in any dimension and has no restriction on the coefficients. One can also expect to extend our result to cooperative cases (e.g. with and as in [4]) by using our abstract Lyapunov criterion with functions combining those used to prove Theorems 1.1 and 1.2. Another motivations of our study comes from [1], where the coming down from infinity of Lotka-Volterra Feller diffusions is studied. It appears that such processes may go extinct far from compact sets for very large initial conditions. Theorem 1.2 proves that this does not prevent the conditioned process to come back to compact sets fast.
Our general non-linear Lyapunov criterion, Theorem 2.4, is stated in Section 2 and proved in Section 3. Sections 4 and 5 are devoted to the study of (extensions of) competitive Lotka-Volterra birth and death processes and competitive Lotka-Volterra Feller diffusions and to the proofs of Theorems 1.1 and 1.2.
Since the publication of the first version of this preprint on Arxiv, the criteria presented here have been applied to other models, including diffusion models of deadlocks in distributed algorithms [8] and stochastic reaction networks [21], with a weakened condition for birth and death processes.
2 A general Lyapunov criterion for uniform exponential convergence of conditional distributions
Our general framework is inspired from [31].
2.1 Definitions and notations
We consider a càdlàg (right continuous with left limits) time-homogeneous strong Markov process with state space , which is assumed to be a metric space with distance function , equipped with its Borel -fields and such that is measurable and . In what follows, is a fixed arbitrary point in and we define the open sets
[TABLE]
for all .
We assume that , that is an absorbing set for the process and we introduce
[TABLE]
We assume that a.s. and that the process is regularly absorbed, in the sense that
[TABLE]
and that, for all and , .
We also assume that, for any closed set , the entry time in defined by , are -stopping times.111One can easily adapt our proofs to cases where entry times in other sets (e.g. open) are strong Markov times for the process. In particular, since is closed, and thus are -stopping times.
We shall make use of the following weakened notion of generator for , inspired from [31] and which extends the usual weak infinitesimal generator [18].
Definition 2.1**.**
We say that a measurable function belongs to the domain of the weakened generator of if there exists a measurable function such that, for all , and ,
[TABLE]
and we define on . We also define for all .
Due to the local form of the above Dynkin formula, it is much easier to check that belongs to than to the usual domain of the weak infinitesimal generator.
We also define the set of admissible functions to which our Lyapunov criterion applies. We will say that a measurable function on or is locally bounded if it is bounded on for all .
Definition 2.2**.**
We say that a couple of functions and measurable from to is an admissible couple of functions if
(i)
* and are bounded, identically [math] on , positive on , and and are locally bounded on . *
(ii)
We have the convergences
[TABLE]
and
[TABLE]
(iii)
* and belong to the domain of the weakened generator of , is bounded from above and is bounded from below.*
The main point of this definition of admissible functions is the following result, whose technical proof is postponed to Section 3.
Proposition 2.3**.**
Assume is a couple of admissible functions. Then, for all and ,
[TABLE]
and
[TABLE]
2.2 A non-linear Lyapunov criterion
In the following assumption, we need a pair of functions, while usual drift conditions of Foster-Lyapunov criteria only use one (see for instance [31]). Roughly speaking, the function is used to control return time in compact sets from neighborhood of the boundary, while the second one, , is used to control the absorption rate. One of the main difficulty when checking the following assumption, is that needs also to be related to the absorption probability, via the inequality (2.7).
Assumption 1**.**
There exist constants , such that
[TABLE]
and there exist constants and such that
[TABLE]
The main role of the first part in the above assumption is to bound the derivative computed in Proposition 2.3, which will allow us to show that the quantity is uniformly bounded in for large enough. The second part of this assumption is needed to check that the boundedness of implies that, conditionally on , with high probability for large. This implies that the problem can be localized in this set , so that, in order to check Conditions (A1–A2), it is enough to assume the following local versions of (A1–A2), which are much easier to check.
Assumption 2**.**
There exists a probability measure on such that, for all , there exist and satisfying
[TABLE]
In addition, for all , there exists a constant such that, for all ,
[TABLE]
The Lyapunov criterion of Assumption 1 will be useful to check that the conditioned process comes back quickly in bounded subsets of from any neighborhood of the boundary. However, it does not imply this property for initial distribution in a neighborhood of infinity. This is the purpose of the next assumption. Since it only concerns the unconditioned process, it may be proved using usual drift conditions or probabilistic arguments, as we illustrate in the two examples of Sections 4 and 5.
Assumption 3**.**
For all , there exists such that
[TABLE]
where .
We can now state our main general result. Its proof is given in Section 3.
Theorem 2.4**.**
Assume that the process is regularly absorbed and that there exists a couple of admissible functions satisfying Assumption 1. Assume also that Assumptions 2 and 3 are satisfied. Then the process admits a unique quasi-stationary distribution and there exist constants such that for all probability measure on ,
[TABLE]
3 Proof of the results of Section 2
We first give the proof of Proposition 2.3 in Subsection 3.1 and the proof of Theorem 2.4 in Subsection 3.2. The proofs of two technical lemmas are given in Subsections 3.3 and 3.4.
3.1 Proof of Proposition 2.3
In what follows, we use the classical definition of the integral of a signed function with respect to a positive measure by (where and denote respectively the positive and negative parts of ), which is well defined as soon as at least one of the two terms is finite. Classical results (as Lebesgue’s theorem, Fatou’s lemma and Fubini’s theorem) still hold in this case.
Using the Definition 2.1 of the weakened infinitesimal generator, we have for all
[TABLE]
Note that
[TABLE]
Using (2.1), the Assumption (2.4) and that is uniformly bounded, Lebesgue’s theorem implies that
[TABLE]
Now Fatou’s lemma applied to the right-hand side of (3.1) (using that is bounded from above) gives
[TABLE]
where we have used Fubini’s theorem for the last inequality. Since and is bounded from above, we deduce that and that . Therefore, we can actually apply Lebesgue’s Theorem to the right-hand side of (3.1) and hence
[TABLE]
The same argument applies to (note that (2.3) and (2.4) imply that a.s.):
[TABLE]
Therefore and are continuous with respect to and (cf. e.g. [2, Lem. VIII.2]), for all , belongs to the Sobolev space (the set of functions from to in admitting a derivative in the sense of distributions in ).
In particular, since and hence for all , we deduce from the continuity of that . Therefore, we deduce from standard properties of functions [2, Cor. VIII.9 and Cor. VIII.10] that also belongs to and admits as derivative
[TABLE]
Hence we have proved (2.5).
3.2 Proof of Theorem 2.4
The proof is based on two lemmas. The first one combines Proposition 2.3 and Assumption (2.6) to give uniform (in ) controls on for large enough. Its proof is given in Subsection 3.3.
Lemma 3.1**.**
There exists two positive constants and such that, for all and all ,
[TABLE]
In particular, there exists such that, for all and all ,
[TABLE]
The second lemma makes use of Assumption (2.7) to deduce from Lemma 3.1 the following inequality. Its proof is given in Subsection 3.4
Lemma 3.2**.**
There exist and a constant such that, for all and all ,
[TABLE]
We prove Theorem 2.4 by checking that the two conditions (A1) and (A2) in the introduction are satisfied (cf. [13]).
*Step 1: Proof of (A1).
*We first remark that there exists such that (where and below are from Assumption 2) and hence such that, for all , all and all ,
[TABLE]
where we used Markov’s property and an induction procedure over . Hence we can assume without loss of generality that, for all , (where is from Assumption 1).
As a consequence, Assumption (2.7), Inequality (3.6) and Markov’s property entail that, for all ,
[TABLE]
On the other hand,
[TABLE]
Setting , the two above equations imply that, for all ,
[TABLE]
and hence that Assumption (A1) holds true.
*Step 2: Proof of (A2).
*Using (3.7), for all , all and all , we obtain that, for all ,
[TABLE]
Setting and using inequality (2.8) of Assumption 2, we deduce that, for all and all ,
[TABLE]
Now, we apply Assumption 3 for as defined above: there exists such that
[TABLE]
Note that, since is càdlàg, on the event . Hence, using (3.9) and the strong Markov property at time (which is a stopping time since it is the entry time in a closed set), for all ,
[TABLE]
Note that, if , then and . Thus, for all , . Hence, using (3.8) twice, we have for all
[TABLE]
Since , we have , so the last inequality implies (A2).
3.3 Proof of Lemma 3.1
*Step 1: Proof of (3.4).
*Fix and . On the one hand, it follows from (2.6) that
[TABLE]
and hence
[TABLE]
On the other hand, we deduce from (2.6) that
[TABLE]
where we used Hölder’s inequality to deduce that
[TABLE]
Now, because of Assumption (2.3), there exists large enough such that, for all ,
[TABLE]
Therefore, for such a value of ,
[TABLE]
Finally, we obtain from the last inequality, (3.10) and (3.11) that there exists two positive constants such that
[TABLE]
*Step 2: Proof of (3.5).
*We define and . Propositions 2.3 and (3.4) imply that, for all and all ,
[TABLE]
Since , this implies that, for all , there exists such that for any . We prove this by contradiction: assume on the contrary that for all , . Then, for all ,
[TABLE]
Integrating this differential inequality up to time entails
[TABLE]
which gives a contradiction.
Hence, using Proposition 2.3 and (3.4) again, we deduce that, for all and all ,
[TABLE]
Using the same argument repetitively between time (instead of [math]) and (instead of ) gives the result for all time .
3.4 Proof of Lemma 3.2
Set . Equation (2.3) allows us to fix such that
[TABLE]
Lemma 3.1 implies that, for all ,
[TABLE]
Therefore,
[TABLE]
Since , we deduce that there exists a constant such that
[TABLE]
4 Application to multidimensional birth and death processes absorbed when one of the coordinates hits 0
We consider general multitype birth and death processes in continuous time, taking values in for some . Let be a Markov process on with transition rates
[TABLE]
for all , with , where the nonzero coordinate is the -th one, and are functions from to . This model represents a density-dependent population dynamics with types of individuals (say species), where (resp. ) represents the reproduction rate (resp. death rate) per individuals of species when the population is in state .
Note that the forms of the birth and death rates imply that, once a coordinate of the process hits 0, it remains equal to 0. This corresponds to the extinction of the population of type . Hence, the set is absorbing for the process .
We define for all
[TABLE]
where . We shall assume
Assumption 4**.**
There exists small enough such that, for all large enough,
[TABLE]
and
[TABLE]
Note that, since the set is finite for all , it is standard to check that any function is in the domain of the weakened infinitesimal generator of and, for all ,
[TABLE]
Under Assumption (4.2), setting , we have
[TABLE]
This classically entails that
[TABLE]
(The argument is very similar to the one used for Lemma 3.1.) In particular, the process is non-explosive and is finite almost surely. Therefore, the process is regularly absorbed, as defined in Section 2.1. We can now state the main result of the section.
Theorem 4.1**.**
Under Assumption 4, the multi-dimensional competitive birth and death process absorbed when one of its coordinates hits 0 admits a unique quasi-stationary distribution and there exist constants such that, for all probability measure on ,
[TABLE]
As will appear in the proof below, to check our conditions, it is sufficient to take functions and of the form for some . More precisely, the first part of Condition (2.6) can be checked for with decreasing and the second part for with increasing , to take advantage of the drift of the birth and death process towards 0 for large . However, let us emphasize that, although the choice (i.e. ) is natural, it cannot satisfy the first part of Condition (2.6), since in this case (the absorption rate) is unbounded. The direct study of is possible, but only allows to recover particular cases of Theorem 4.1 (cf. [10, Section 4]). Our criterion is more flexible and better adapted to multidimensional birth and death models of interacting populations.
It is easy to check that Assumption 4 is satisfied in the general Lotka-Volterra birth and death process of the introduction. Indeed, we clearly have and
[TABLE]
for . Under the assumptions of Theorem 1.1, there exists such that, for all ,
[TABLE]
This entails Assumption 4.
Proof of Theorem 4.1.
Using (4.3) and copying the arguments of [13, Sec. 4.1.1 and Thm. 4.1], one deduces that Assumption 2 is satisfied with and that Assumption 3 is also satisfied.
Hence we only have to find a couple of admissible functions satisfying Assumption 1. This couple of functions is given for all by
[TABLE]
and
[TABLE]
for appropriate choices of . Note that the two functions are bounded, nonnegative and positive on . Note also that, since (taking ), so Conditions (i-ii) of Definition 2.2 are clearly satisfied (in this discrete state space case, the condition (2.4) is trivial since, almost surely, for all large enough). Note also that, since , Condition (2.7) is also obviously satisfied.
Hence, we only have to check (2.6) since this of course implies that and satisfy Point (iii) of Definition 2.2. So we compute
[TABLE]
where we used the fact that
[TABLE]
Hence it follows from Assumption (4.1) that there exists large enough such that for all large enough. This entails the first inequality in (2.6).
We fix such a value of . Using that
[TABLE]
and
[TABLE]
for large enough, we compute for such
[TABLE]
where . Choosing and , Assumption (4.2) implies that for with large enough. Since , we have the second inequality in (2.6). ∎
5 Application to multidimensional Feller diffusions absorbed when one of the coordinates hits 0
We consider a general multitype Feller diffusion in , solution to the stochastic differential equation
[TABLE]
where are independent standard Brownian motions, are positive constants and are measurable maps from to . From the biological point of view, represents the growth rate per individual of species in a population of size vector . We shall make the following assumption.
Assumption 5**.**
Assume that, for all , is locally Hölder on and that there exist and such that
[TABLE]
and there exist constants , and such that
[TABLE]
This assumption implies in particular the non-explosion, strong existence and pathwise uniqueness for (5.1). Indeed, since is locally Hölder, standard arguments entail the strong existence and pathwise uniqueness for (5.1) up to the explosion time. Now, Assumption (5.2) and standard comparison results for one-dimensional diffusion processes (see e.g. Theorem 1.1 in [24, Chapter VI]) entail that each coordinate of the process can be upper bounded by the solution of the one-dimensional Feller diffusion
[TABLE]
with initial value . Since is a diffusion on for which is an entrance boundary and [math] an exit boundary, we deduce that each coordinate is non-explosive and hence that the unique solution to (5.1) is non-explosive. Moreover, the subset is an absorbing boundary for the diffusion process (5.1), so the process is regularly absorbed in the sense of Section 2.2.
Strong existence and pathwise uniqueness imply well-posedness of the martingale problem, hence the strong Markov property hold on the canonical space with respect to the natural filtration (see e.g. [32]). Since the paths of are continuous, the hitting times of closed subsets of are stopping times for this filtration. In addition, it follows from Itô’s formula and the local boundedness of the coefficients of the SDE that any measurable function twice continuously differentiable on belongs to the domain of the weakened generator of and
[TABLE]
We can now state the main result of the section.
Theorem 5.1**.**
Under Assumption 5, the multi-dimensional Feller diffusion process absorbed when one of its coordinates hits 0 admits a unique quasi-stationary distribution and there exist constants such that, for all probability measure on ,
[TABLE]
It is straightforward to check that Assumption 5 is satisfied in the competitive Lotka-Volterra case, that is when with and for all . Hence Theorem 1.2 is an immediate corollary of Theorem 5.1. Assumption 5 allows for other biologically relevant models. For instance, one can consider ecosystems where the competition among individuals only acts when the population size reaches a level , which leads for instance to the SDE
[TABLE]
where are non-negative constants and for all . Note also that a similar approach (i.e. using Theorem 2.4 with Lyapunov functions of the form ) can also be used to handle diffusion processes evolving in bounded boxes.
Before giving the proof of Theorem 5.1, let us give some intuition about the choice of functions and . The behavior of the process when one of the coordinates is close to 0 is similar to the behavior of a one-dimensional diffusion absorbed at 0 started close to 0. This suggests to look for Lyapunov functions of the form and where the functions and satisfy our criterion for one-dimensional diffusions like (5.4), say
[TABLE]
with generator . In the one dimensional case, Conditions (2.6) are restrictive only close to 0 and close to . When , assuming and when is close to 0, we obtain
[TABLE]
In particular, to satisfy the first part of (2.6) close to 0, we need , and to satisfy the second part of (2.6), we need that
[TABLE]
in the neighborhood of 0. This holds true if and . Similarly, assuming and close to , with , we obtain
[TABLE]
For such functions and close to , the first part of (2.6) is always satisfied and the second part of (2.6) requires
[TABLE]
when . This holds true if .
This gives the intuition to check our criterion in dimension 1222Note that more efficient criteria exist for one-dimensional diffusions (see [14]). The interest of our criterion comes from the fact that it applies to multi-dimensional processes.. The multi-dimensional case of Theorem 5.1 requires a more careful study, since we need to consider cases where some of the coordinates of the process are close to 0 or , and the others belong to some compact set.
Proof of Theorem 5.1.
Up to a linear scaling of the coordinates, we can assume without loss of generality that for all , so we will only consider this case from now on. Note that Assumption 5 is not modified by the rescaling (up to appropriate changes of the constants , , and ).
We divide the proof into five steps, respectively devoted to the construction of a function satisfying the first inequality in (2.6), of a function satisfying the second inequality in (2.6), to the proof of (2.7), to the proof of a local Harnack inequality (needed to check Assumption 2), and the proofs of Assumption 2 and of Assumption 3.
Step 1: construction of a function satisfying the first inequality in (2.6).
Recall the definition of the constants and from Assumption 5. We use the following lemma, whose proof is left to the reader (see Figure 1 for a typical graph of ).
Lemma 5.2**.**
There exists such that, for all , there exists a function twice continuously differentiable on such that
[TABLE]
* for all , is nonincreasing and convex on ,*
[TABLE]
We set and
[TABLE]
We have
[TABLE]
Now, it follows from the properties of and Assumptions 5 that, for all and all ,
[TABLE]
where we used in the last inequality the fact that for all . Using once again this property, we deduce that, for some constant independent of ,
[TABLE]
Hence, for all ,
[TABLE]
This and Assumption (5.3) imply that
[TABLE]
for some constant , where we used Assumption (5.2) in the last inequality.
Hence, there exist and a constant such that
[TABLE]
since (we equip with the distance). This ends the proof that satisfies the first inequality in (2.6).
Step 2: construction of a function satisfying (2.6) and verification that is a couple of admissible functions.
For , we define
[TABLE]
where the function is twice continuously differentiable on , increasing concave and such that
[TABLE]
for some constants and and where is defined in Assumption (5.2). Since and , it is possible to find such that such a function exists as soon as . Hence, we shall assume that belongs to the non-empty interval . We have
[TABLE]
and
[TABLE]
We deduce from Assumptions (5.2) that there exist constants such that
[TABLE]
Thus, since for some constant and since ,
[TABLE]
Therefore, choosing such that and , for all such that is small enough or is big enough. Since is bounded from above by (5.6) and since and are positive continuous on any compact subset of , we have proved Condition (2.6).
We can now check that is a couple of admissible functions. First, and are both bounded, positive on and vanishing on . They both belong to the domain of the weakened infinitesimal generator of . Since the function is positive continuous on and
[TABLE]
we deduce that (2.3) holds true. Condition (2.4) is also clear since almost surely. Finally, since is bounded from above and is bounded from below, we have proved that is a couple of admissible functions.
Step 3: proof of (2.7).
Using the upper bound for all and , where is solution to the SDE (5.4), and noting that the processes are independent, we have for all and all ,
[TABLE]
Now, there exist constants and such that
[TABLE]
To prove this, we can consider a scale function of the diffusion such that and for . Using the expression of the scale function and the speed measure (see e.g. [32, V.52]), one easily checks that when for some and that Proposition 4.9 of [14] is satisfied, so that for some . Since when , (5.7) is proved and hence (2.7) holds true.
Step 4: Harnack inequality for . Consider a bounded nonnegative measurable function and define the application . Our aim is to prove that, for all , there exist two constants and , which do not depend on , such that
[TABLE]
We fix and we will omit the indices for the constants and for the rest of Step 4. Let be a compact set with boundary such that and such that . We set and . Let and be fixed in such that . We define and as the joint law of starting from and starting from , respectively. It follows from Lusin’s theorem (see e.g. [17, Thm. 7.5.2]) that there exists a sequence of bounded functions from to such that converges bounded pointwisely toward , -almost everywhere. For all , we let be the solution to the linear parabolic equation
[TABLE]
By [28, Thm 5.1.15], for all , is of regularity . In particular, applying Itô’s formula to at time and taking the expectation, one deduces that,
[TABLE]
By Lebesgue’s theorem, the last quantity converges when to
[TABLE]
where we used the strong Markov property at time . Similarly, converges to .
Using the Harnack inequality provided by [26, Theorem 1.1] (with and ), we deduce that there exists a constant which does not depend on , such that nor on such that
[TABLE]
Hence, we deduce that
[TABLE]
and (5.8) is proved.
Step 5 : proof that Assumptions 2 and 3 are satisfied. Fix and let denote the conditional law . Then the Harnack inequality (5.8) entails that, for all such that and all measurable nonnegative bounded on ,
[TABLE]
This means that
[TABLE]
Now, let . Since is bounded, connected and at a positive distance of , is uniformly bounded from below in by a positive constant . Therefore, Markov’s property implies that, for all ,
[TABLE]
This is the first part of Assumption 2.
The second part of Assumption 2 is also a consequence of (5.8). Indeed, for any fixed and for all , this equation applied to and the Markov property entail that
[TABLE]
Since is non-increasing, we deduce that
[TABLE]
Since has a finite diameter and is connected, we deduce that there exists such that, for all ,
[TABLE]
Now, for , we simply use the fact that is uniformly bounded from below on by a constant . In particular,
[TABLE]
As a consequence, the second part of Assumption 2 is satisfied.
Assumption 3 is a direct consequence of the domination by solutions to (5.4), since these solutions come down from infinity and hit [math] in finite time almost surely (cf. e.g. [3]).
Finally, we deduce from Steps 1, 2, 3 and 5 that all the assumptions of Theorem 2.4 are satisfied. This concludes the proof of Theorem 5.1. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] V. Bansaye. Approximation of stochastic processes by nonexpansive flows and coming down from infinity. Ann. Appl. Probab. , 29(4):2374–2438, 08 2019.
- 2[2] H. Brezis. Analyse fonctionnelle . Collection Mathématiques Appliquées pour la Maîtrise. [Collection of Applied Mathematics for the Master’s Degree]. Masson, Paris, 1983. Théorie et applications. [Theory and applications].
- 3[3] P. Cattiaux, P. Collet, A. Lambert, S. Martínez, S. Méléard, and J. San Martín. Quasi-stationary distributions and diffusion models in population dynamics. Ann. Probab. , 37(5):1926–1969, 2009.
- 4[4] P. Cattiaux and S. Méléard. Competitive or weak cooperative stochastic Lotka-Volterra systems conditioned on non-extinction. J. Math. Biol. , 60(6):797–829, 2010.
- 5[5] N. Champagnat, K. A. Coulibaly-Pasquier, and D. Villemonais. Criteria for exponential convergence to quasi-stationary distributions and applications to multi-dimensional diffusions. Séminaire de Probabilités XLIX , pages 165–182, 2018.
- 6[6] N. Champagnat, R. Ferrière, and S. Méléard. Unifying evolutionary dynamics: From individual stochastic processes to macroscopic evolution. Theor. Pop. Biol. , 69:297–321, 2006.
- 7[7] N. Champagnat, R. Ferrière, and S. Méléard. Individual-based probabilistic models of adaptive evolution and various scaling approximations. In Seminar on Stochastic Analysis, Random Fields and Applications V , volume 59 of Progr. Probab. , pages 75–113. Birkhäuser, Basel, 2008.
- 8[8] N. Champagnat, R. Schott, and D. Villemonais. Probabilistic Non-asymptotic Analysis of Distributed Algorithms. Ar Xiv e-prints , Feb. 2018.
