Stochastic approximation of quasi-stationary distributions for diffusion processes in a bounded domain
Michel Bena\"im (UNINE), Nicolas Champagnat (IECL, TOSCA-NGE-POST),, Denis Villemonais (IECL, TOSCA-NGE-POST)

TL;DR
This paper introduces a stochastic approximation method to estimate the quasi-stationary distribution of diffusion processes in bounded domains, demonstrating convergence of the occupation measure to the distribution.
Contribution
It presents a novel stochastic approximation approach for quasi-stationary distributions of diffusions, combining recent theoretical results with reinforcement-based processes.
Findings
Occupation measure converges to the unique quasi-stationary distribution.
Method effectively estimates quasi-stationary distributions in bounded domains.
Theoretical proofs confirm convergence using stochastic approximation techniques.
Abstract
We study a random process with reinforcement, which evolves following the dynamics of a given diffusion process in a bounded domain and is resampled according to its occupation measure when it reaches the boundary. We show that its occupation measure converges to the unique quasi-stationary distribution of the diffusion process absorbed at the boundary of the domain. Our proofs use recent results in the theory of quasi-stationary distributions and stochastic approximation techniques.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
11footnotetext: Université de Neuchâtel, Switzerland22footnotetext: Université de Lorraine, CNRS, Inria, IECL, F-54000 Nancy, France
E-mail: [email protected], [email protected]
Stochastic approximation of quasi-stationary distributions for diffusion processes in a bounded domain
Michel Benaïm1, Nicolas Champagnat2, Denis Villemonais2
Abstract
We study a random process with reinforcement, which evolves following the dynamics of a given diffusion process in a bounded domain and is resampled according to its occupation measure when it reaches the boundary. We show that its occupation measure converges to the unique quasi-stationary distribution of the diffusion process absorbed at the boundary of the domain. Our proofs use recent results in the theory of quasi-stationary distributions and stochastic approximation techniques.
Résumé
Nous étudions un processus stochastique avec renforcement, qui évolue suivant une diffusion dans un domaine borné, avec ré-échantillonnage suivant sa mesure d’occupation lorsqu’il atteint la frontière. Nous montrons que sa mesure d’occupation converge vers l’unique distribution quasi-stationnaire de la diffusion absorbée au bord du domaine. Nos preuves s’appuient sur des résultats récents en théorie des distributions quasi-stationnaires et sur des techniques d’approximation stochastique.
Keywords: random processes with reinforcement, stochastic approximation, pseudo-asymptotic trajectories, quasi-stationary distributions.
2010 Mathematics Subject Classification. Primary: 60B12, 60J60, 60B10, 60F99; Secondary: 60J70.
1 Introduction
Let be a time homogeneous Markov process with state space , where is a measurable space and is an absorbing state for the process. This means that implies for all , -almost surely for all and, in particular,
[TABLE]
is a stopping time. We also assume that and for all and .
We consider a random process with reinforcement, which evolves following the dynamic of when it lies in and which is resampled according to its occupation measure when it reaches . More precisely, given a probability measure on , we set
[TABLE]
where ,
- •
is a realization of the process with (i.e. under ) and the stopping time is defined as the first hitting time of by ,
- •
given , is a realization of the process with , where
[TABLE]
and the first hitting time of by ,
- •
for all , given , is a realization of the process with , where
[TABLE]
and the first hitting time of by .
We also define for all
[TABLE]
This process has been studied in several situations, with the main goal of proving an almost sure convergence result for the occupation measure when . In the finite state space case and in a discrete time setting, Aldous, Flannery and Palacios [1] solved this problem by showing that the proportion of colours in a Pólya urn type process converges almost surely to the left eigenfunction of the replacement matrix, which was also identified as the quasi-stationary distribution of a corresponding Markov chain (we refer the reader to the surveys [15, 19] and to the book [11] for general references on quasi-stationary distributions; basic facts and useful results on quasi-stationary distributions are also reminded in Section 3). Under a similar setting but using stochastic approximation techniques, Benaïm and Cloez [3] and Blanchet, Glynn and Zheng[6] independently proved the almost sure convergence of the occupation measure toward the quasi-stationary distribution of . These works have since been generalized to the compact state space case by Benaïm, Cloez and Panloup [4] under general criteria for the existence of a quasi-stationary distribution for . Continuous time diffusion processes with smooth bounded killing rate on compact Riemanian manifolds have been recently considered by Wang, Roberts and Steinsaltz [21], who show that a similar algorithm with weights also converges toward the quasi-stationary distribution of the underlying diffusion process. Recently, Mailler and Villemonais [14] have proved such a convergence result for processes with smooth and bounded killing rate evolving in non-compact (more precisely unbounded) spaces using a measure-valued Pólya process representation of this reinforced algorithm.
The aim of the present paper is to solve the question of convergence of the occupation measure toward the quasi-stationary distribution of when this process is a uniformly elliptic diffusion evolving in an open bounded connected open set with boundary , with hard killing when the process hits the boundary. This answers positively the open problem stated in Section 8 of [4]. Note that the difficulty is twofold: firstly, the state space is an open domain in and is thus non-compact; secondly, the absorption occurs through killing at the boundary, which corresponds to an infinite killing rate.
Our main assumptions concern the regularity of the domain and of the parameters of the diffusion . They are satisfied in particular if the coefficients of the stochastic differential equation satisfied by are Hölder continuous. Our assumptions ensure the existence of a unique quasi-stationary distribution for and allows us to prove the almost sure convergence of the occupation measure toward . As in [2, 3, 21], we make use of stochastic approximation techniques (in the sense of [2, 5]). These works strongly rely on the proof techniques of [16], which are based on technical regularity results that do not adapt well to the present setting. Our proof uses instead recent advances in the theory of quasi-stationary distributions [8] together with coupling arguments, in order to prove that the occupation measure dynamics are globally asymptotically stable. Combined with the general results on asymptotic pseudo trajectories of [2, 5], this entails the almost sure convergence of the occupation measure.
The paper is organised as follows. In Section 2, we state our main assumptions and results. In Section 3, we gather useful general results on quasi-stationary distributions from [8, 10] and prove new general results on a key operator , which has its own interest and should be useful for future adaptation of the methods developed below. Section 4 is devoted to the proof of our main result, which consists in checking that the occupation measure of the resampling points is (up to a time change and linearization) an asymptotic pseudo-trajectory of a measure-valued dynamical system related to the operator (we refer the reader to [2] for an introduction to asymptotic pseudo-trajectories and their use in stochastic approximation theory).
2 Main result
From now on, we consider a diffusion process in a connected bounded open set of , with boundary and absorbed at . We assume that is solution to the SDE
[TABLE]
where is a -dimensional Brownian motion, is bounded and continuous and is continuous, is uniformly elliptic and for all ,
[TABLE]
for some function such that . Note that, in this case, the process described in the introduction is well-defined since one can prove that a.s. [4, Lemma 8.1].
In [7, Section 5.3], it was proved that, under the above regularity assumptions, the killed diffusion process admits a unique quasi-stationary distribution, i.e. a probability measure on such that
[TABLE]
where denotes the hitting time of by the process. Moreover, it is well known that, in this case, there exists a positive constant such that for all (see Section 3 for more results on quasi-stationary distributions).
Remark 1*.*
In fact, the result of [7, Section 5.3] is stronger and entails the exponential convergence in total variation norm of the conditional law of toward , uniformly in the initial distribution. The proof relies on the fact that Conditions (A1) and (A2) as enunciated in the next section are satisfied by the process (see Section 3 for details and additional properties).
Remark 2*.*
This last property was also proved to hold true for general one-dimensional diffusions in or absorbed at the boundary of and coming down from infinity in [9] and for diffusion processes in compact, connected manifolds with boundary absorbed at when the infinitesimal generator of is given by , where is the Laplace-Beltrami operator and is a vector field in [7]. All the results of this paper, and in particular the next one, can be extended to these two situations.
The main result of this article is the following one.
Theorem 2.1**.**
For all bounded measurable function , one has
[TABLE]
Moreover, almost surely when .
3 Properties of the Green operator
The results of Subsection 3.1 are valid for general absorbed Markov processes, not only for diffusion processes absorbed at the boundary of a domain. In Subsection 3.2, we provide properties on the measure-valued dynamical system induced by the Green operator of the process. Although not specific to diffusion processes, the later part uses the fact that the semi-group of the underlying process is Lipschitz regular.
3.1 General properties
Let us consider in this section a Markov process on a measurable space , absorbed in at time
[TABLE]
assumed a.s. finite. We also assume that for all and all .
A probability measure on is called a quasi-stationary distribution if
[TABLE]
It is well known that a probability measure is a quasi-stationary distribution if and only if there exists a probability measure on such that
[TABLE]
for all measurable subsets of . The fact that is a quasi-stationary distribution also implies the existence of a constant such that
[TABLE]
In [8], the authors provide a necessary and sufficient condition on for the existence of a probability measure on and constants such that
[TABLE]
where is the set of probability measures on and is the total variation norm defined as for all , where is the set of bounded measurable functions on . This immediately implies that is the unique quasi-stationary distribution of and that (3.1) holds for any initial probability measure .
The necessary and sufficient condition for (3.3) is given by the existence of a probability measure on and of constants such that
[TABLE]
and
[TABLE]
Under Conditions (A1) and (A2), it follows from the general results of [8, Prop. 2.3] that there exists a bounded function such that and, for all and all ,
[TABLE]
In the case of diffusion processes, is a nonnegative solution to where is the infinitesimal generator of the process in the set of bounded measurable functions equiped with the norm. The constant is the same as in (3.3). In particular, there exists a constant such that
[TABLE]
One can actually obtain a better bound combining Theorem 2.1 and Equation (3.2) of [10]: there exists a time and a constant such that, for all and all ,
[TABLE]
We may—and will—assume without loss of generality that .
We denote by the (nonconservative) semigroup of the Markov process , acting on the set of bounded measurable functions on and defined for all such function by
[TABLE]
Note that we made here the slight abuse of notation that . Because of (3.5), we can define the Green operator on as
[TABLE]
and this operator is bounded on equiped with the norm. Let be the set of probability measures on . For all , we also define the notation
[TABLE]
so that in particular and . Since is bounded, the operator is well-defined for all .
Proposition 3.1**.**
Assume that Conditions (A1) and (A2) are satisfied. Then, for all , all and all , we have
[TABLE]
where the constants and are those involved in (3.3), (3.4) and (3.5). We also have for some constant
[TABLE]
and for all ,
[TABLE]
Proof.
We first check by induction that for all ,
[TABLE]
This is of course true for . Assuming it is true for a given , we have
[TABLE]
which concludes the induction. Then, it follows from (3.3), (3.4) and (3.5) that
[TABLE]
The inequality (3.8) follows.
We then deduce from (3.8) that
[TABLE]
Now, it follows from (3.6) that
[TABLE]
Combining the last two inequalities entails (3.9).
Similarly, for all , and , we deduce from (3.8) that
[TABLE]
Now, it follows from (3.12) that
[TABLE]
The last two inequalities entail (3.10). ∎
3.2 Properties of a measure-valued dynamical system
We begin with the following proposition, which ensures that is regularizing. In particular, for all (which denotes the set of bounded continuous functions from to ), is in . This Feller property implies that is continuous with respect to the weak topology on the set of non-negative measures with finite mass on . Similarly, one deduces that is continuous.
Proposition 3.2**.**
For all bounded measurable functions , the application is Lipschitz continuous, with Lipschitz norm proportional to .
Proof.
From Priola and Wang [17], one deduces that there exists a constant which does not depend on such that, for all and all ,
[TABLE]
Applying this inequality to at time and using inequality (3.5), one deduces that
[TABLE]
As a consequence,
[TABLE]
which concludes the proof of Proposition 3.2. ∎
The following proposition states the uniqueness of the evolution equation satisfied by the continuous process .
Proposition 3.3**.**
For each probability measure on , the equation
[TABLE]
where is a measure valued function defined, for all non-negative finite measures on by
[TABLE]
admits a unique weak solution in , where is equiped with the weak topology, in the sense that, for all bounded continuous function and all ,
[TABLE]
In addition, this unique weak solution takes its values in and is given by .
Proof.
The fact that satisfies (3.13) is immediate. Let us check that this equation has no other solution. In order to do so, we consider one of its solutions and introduce the measure valued process defined by
[TABLE]
This process is weak solution to the linear evolution equation
[TABLE]
whose unique weak solution is . Indeed, let be two weak solutions to the linear equation. Set where the supremum is taken over the set of continuous function such that . Then is lower semicontinuous, hence measurable, as a supremum of continuous functions. Thus, by Gronwall’s lemma (measurable version, see [13]), This proves uniqueness.
As a consequence, for all ,
[TABLE]
which concludes the proof of Proposition 3.3. ∎
4 Proof of Theorem 2.1
The general idea of the proof is inspired from [2] and consists in proving that a time-change of the sequence of probability measures
[TABLE]
is an asymptotic pseudo-trajectory (see [2] for the definition of an asymptotic pseudo-trajectory) of a measure-valued dynamical system related to the normalized semigroup . The asymptotic properties given in Proposition 3.1 then allow to deduce that almost surely converges to . The proof is divided in three steps. First, we prove in Subsection 4.1 tightness properties on the measure-valued process . The convergence of to is proved in Subsection 4.2, using a key lemma on asymptotic pseudo-trajectories properties for , proved in Subsection 4.3. Theorem 2.1 is then be deduced from the convergence of using martingale arguments in Subsection 4.4.
4.1 Tightness
The following proposition entails that the paths of are a.s. relatively compact in the set of probability measures on endowed with the weak topology.
Proposition 4.1**.**
For all , there exists such that, almost surely,
[TABLE]
Proof.
Let be the distance to . There exists a neighborhood of in where is so that we can apply Itô’s formula: for all such that ,
[TABLE]
We introduce the random time-change such that
[TABLE]
and we observe that there exist constants such that for all . Then, there exists a Brownian motion such that the process satisfies
[TABLE]
where the process is progressively measurable and bounded by a constant .
We introduce such that and the reflected drifted Brownian motion solution to
[TABLE]
and such that , where is the local time of at at time .
Since the jumps of are positive, one can prove following [20, Prop. 2.2] that a.s. for all . Moreover, the process is ergodic and satisfies almost surely
[TABLE]
where is the stationary distribution of on .
Now, for all , there exists such that . Hence, almost surely for all large enough
[TABLE]
Since is continuous and for all , this concludes the proof of Proposition 4.1. ∎
The previous proposition entails that, for all , there exists such that, almost surely, for large enough. The following proposition is of a slightly different nature and it will be used later in order to prove that there exists a constant such that, almost surely, for large enough.
Proposition 4.2**.**
For all , there exists such that, almost surely, one has
[TABLE]
and
[TABLE]
where , and the constant and the process were defined in the proof of Proposition 4.1.
Proof.
Fix . From Proposition 4.1, there exists such that, almost surely, for large enough. For all , we define the random variable in
[TABLE]
so that . We also define the sequence of points in by
[TABLE]
where is an arbirary point in . By definition, the law of conditionally to and is . Moreover, is measurable with respect to , and hence, for all , (we denote by the probability conditionally to and )
[TABLE]
Using the law of large numbers for submartingales, this implies that, almost surely and for all ,
[TABLE]
Observing that, almost surely, there exists such that for all , this concludes the proof of (4.2).
To prove (4.3), we observe that, due to the coupling argument of the proof of Proposition 4.1,
[TABLE]
where are i.i.d. copies of such that is independent of for all . Therefore, we can use the law of large numbers for submartingales as above to conclude the proof of Proposition 4.2. ∎
4.2 Study of the empirical measure of the resampling points
In this subsection, we focus on the behaviour of the random sequence of measures defined in (4.1). Our aim is to prove the following proposition using the theory of pseudo-asymptotic trajectories.
Proposition 4.3**.**
The sequence of probability measures converges almost surely to with respect to the weak topology.
Proof.
We follow an approach inspired from [2]. Let be defined as and for , where
[TABLE]
We consider the linearly interpolated version of defined, for all and all , by
[TABLE]
where we define by convention for some fixed .
Let be a sequence of bounded continuous functions from to such that the metric
[TABLE]
metrizes the weak topology on measures on .
The main point of the proof consists in using [2, Theorem 3.2] to prove that is an asymptotic pseudo-trajectory of (3.13). By Proposition 3.3, this means in our setting that, for all ,
[TABLE]
This is stated in the next lemma, proved in the next subsection.
Lemma 4.4**.**
The measure-valued process is almost surely an asymptotic pseudo-trajectory for the distance on the set of probability measures on of the semi-flow induced by (3.13) and defined in Proposition 3.3.
Once this is proved, Proposition 4.3 follows easily: indeed is almost surely a relatively compact asymptotic pseudo-trajectory of the semi-flow induced by (3.13) for which is a compact global attractor, which implies the result (see for instance [4, Corollary 5.3] and [2, 5]).
∎
4.3 Proof of Lemma 4.4
For all , we have
[TABLE]
where, recalling the definition of in (3.7) and of in (3.14),
[TABLE]
Fix and small enough so that the conclusions of Proposition 4.2 hold true. Setting , which is positive by Proposition 3.2, we define for all the random variable in
[TABLE]
The conclusion of Proposition 4.2 entails that .
Following [2], before proving Lemma 4.4, we begin by proving the next lemma.
Lemma 4.5**.**
Almost surely, for all bounded measurable function , the numeric sequence admits a finite limit when .
Proof.
For all , we introduce the -field generated by , and . Fix . We start by observing that
[TABLE]
so that is predictable with respect to the filtration .
Following [18, Lemma 1], we define and
[TABLE]
where denotes the expectation conditionally to . Observe is a martingale with respect to and that
[TABLE]
We have, for all
[TABLE]
As a consequence, the martingale is uniformly bounded in and hence converges almost surely. Let us now prove that converges almost surely when .
We have, for all ,
[TABLE]
For all , this quantity is almost surely bounded by . For all , the definition of entails that
[TABLE]
We first consider the term in (4.6). It follows from the fact that is a -martingale and from Cauchy-Schwarz inequality that
[TABLE]
where since, by (3.5),
[TABLE]
Consider now the term in (4.5).
[TABLE]
where we used (4.7) with to obtain the last inequality.
We deduce that is (beware that the may depend on ), so that and hence that almost surely.
Because of the a.s. convergence of the sequence , we conclude that converges almost surely when for all . Since, almost surely, there exists such that , this concludes the proof of Lemma 4.5. ∎
Proof of Lemma 4.4.
We introduce the time-changed version of the sequence as for all and all . We also define for all .
To apply [2, Theorem 3.2], one needs to prove that is almost surely relatively compact, that it is almost surely uniformly continuous and that all limit points of in , endowed with the topology of uniform convergence for the metric on compact time inervals, are almost surely weak solutions of (3.13), where .
The fact that is relatively compact is an immediate consequence of Proposition 4.2 and the almost surely uniform continuity is also immediately obtained from the construction of , since for all ,
[TABLE]
and since almost surely by Proposition 4.2.
In order to prove the last point, we adapt the method developed in [2, Proposition 4.1]. Assume that there exists an increasing sequence of positive numbers converging to such that converges to an element in with respect to the uniform convergence on compact time intervals. Our aim is to prove that is a weak solution to (3.13).
For all , define by
[TABLE]
so that, using the equality ,
[TABLE]
where, for all ,
[TABLE]
For all , let us denote by the unique non-negative integer such that . Then, proceeding as in (4.8), one easily checks that
[TABLE]
Since when and since , where and are bounded continuous functions, we deduce that converges to [math] when .
Also, for all and ,
[TABLE]
Hence Lemma 4.5 implies that also goes to [math] when .
Finally, since is clearly sequentially continuous in , one finally deduces that, for all ,
[TABLE]
which means that is a weak solution to (3.13) and hence, by [2, Theorem 3.2], that is an asymptotic pseudo-trajectory of the flow induced by (3.13). ∎
4.4 Proof of Theorem 2.1
Fix any bounded measurable functions . For all , we set
[TABLE]
The random sequence is a -martingale and
[TABLE]
This martingale property implies that
[TABLE]
From [12, Theorem 1.3.17], we deduce that goes almost surely to zero when goes to infinity, that is
[TABLE]
Since is continuous and bounded for any bounded measurable function (see Proposition 3.2), one deduces from Proposition 4.3 that, almost surely,
[TABLE]
Applying this result to , one deduces that converges to almost surely and hence that converges to almost surely. Since, for all ,
[TABLE]
the almost sure convergence of to when follows from the almost sure convergence of to the positive constant .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] D. Aldous, B. Flannery, and J. L. Palacios. Two applications of urn processes the fringe analysis of search trees and the simulation of quasi-stationary distributions of markov chains. Probab. Eng. Inf. Sci. , 2(3):293–307, 1988.
- 2[2] M. Benaïm. Dynamics of stochastic approximation algorithms. In Séminaire de Probabilités, XXXIII , volume 1709 of Lecture Notes in Math. , pages 1–68. Springer, Berlin, 1999.
- 3[3] M. Benaïm and B. Cloez. A stochastic approximation approach to quasi-stationary distributions on finite spaces. Electron. Commun. Prob. , 20, 2015.
- 4[4] M. Benaïm, B. Cloez, F. Panloup, et al. Stochastic approximation of quasi-stationary distributions on compact spaces and applications. Ann. Appl. Probab. , 28(4):2370–2416, 2018.
- 5[5] M. Benaïm and M. W. Hirsch. Asymptotic pseudotrajectories and chain recurrent flows, with applications. J. Dynam. Differential Equations , 8(1):141–176, 1996.
- 6[6] J. Blanchet, P. Glynn, and S. Zheng. Analysis of a stochastic approximation algorithm for computing quasi-stationary distributions. Adv. Appl. Probab. , 48(3):792–811, 2016.
- 7[7] N. Champagnat, K. A. Coulibaly-Pasquier, and D. Villemonais. Criteria for exponential convergence to quasi-stationary distributions and applications to multi-dimensional diffusions. In Séminaire de Probabilités XLIX , pages 165–182. Springer, 2018.
- 8[8] N. Champagnat and D. Villemonais. Exponential convergence to quasi-stationary distribution and Q-process. Probab. Theory Rel. , 164(1):243–283, 2016.
