A variational approach to nonlinear and interacting diffusions
Marc Arnaudon (IMB), Pierre Del Moral (CMAP, CQFD)

TL;DR
This paper introduces a new variational calculus framework combining multiple techniques to analyze stability and chaos propagation in nonlinear, interacting diffusions across diverse stochastic models and manifolds.
Contribution
It develops a unified variational approach integrating gradient flows, stochastic interpolations, and spectral theory for nonlinear diffusions, including on manifolds, with new exponential contraction results.
Findings
First exponential contraction inequalities for this class of models.
Uniform propagation of chaos over time.
Applications to fluid mechanics and granular media.
Abstract
The article presents a novel variational calculus to analyze the stability and the propagation of chaos properties of nonlinear and interacting diffusions. This differential methodology combines gradient flow estimates with backward stochastic interpolations, Lyapunov linearization techniques as well as spectral theory. This framework applies to a large class of stochastic models including non homogeneous diffusions, as well as stochastic processes evolving on differentiable manifolds, such as constraint-type embedded manifolds on Euclidian spaces and manifolds equipped with some Riemannian metric. We derive uniform as well as almost sure exponential contraction inequalities at the level of the nonlinear diffusion flow, yielding what seems to be the first result of this type for this class of models. Uniform propagation of chaos properties w.r.t. the time parameter are also provided.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
A variational approach to nonlinear and interacting diffusions
M. Arnaudon
Institut de Mathématiques de Bordeaux (IMB), Bordeaux University, France
P. Del Moral P. Del Moral was supported in part by funding from the Chaire Stress Test, BNP Paribas SFTS and CMAP, Polytechnique Palaiseau, France INRIA, Bordeaux Research Center & CMAP, Polytechnique Palaiseau, France
Abstract
The article presents a novel variational calculus to analyze the stability and the propagation of chaos properties of nonlinear and interacting diffusions. This differential methodology combines gradient flow estimates with backward stochastic interpolations, Lyapunov linearization techniques as well as spectral theory. This framework applies to a large class of stochastic models including non homogeneous diffusions, as well as stochastic processes evolving on differentiable manifolds, such as constraint-type embedded manifolds on Euclidian spaces and manifolds equipped with some Riemannian metric. We derive uniform as well as almost sure exponential contraction inequalities at the level of the nonlinear diffusion flow, yielding what seems to be the first result of this type for this class of models. Uniform propagation of chaos properties w.r.t. the time parameter are also provided. Illustrations are provided in the context of a class of gradient flow diffusions arising in fluid mechanics and granular media literature. The extended versions of these nonlinear Langevin-type diffusions on Riemannian manifolds are also discussed.
Keywords : Nonlinear diffusions, mean field particle systems, variational equations, logarithmic norms, gradient flows, contraction inequalities, Wasserstein distance, Riemannian manifolds.
Mathematics Subject Classification : 65C35, 82C80, 58J65, 47J20.
1 Introduction
1.1 Description of the models
We denote by , resp. and the spectral norm, the Frobenius norm, and the logarithmic norm of some matrix , where stands for the transpose of , and the maximal eigenvalue. With a slight abuse of notation, we denote by the identity -matrix, for any .
Let be some time varying differentiable vector field with Jacobian matrix on , for some parameter . Consider the deterministic flow starting at associated with the evolution equation
[TABLE]
The r.h.s. equation is often called the first order variational equation associated with the flow along the trajectory . This equation plays a central role in the sensitivity analysis of nonlinear dynamical systems w.r.t. their initial conditions. For instance, the spectral norm of can be estimated in terms of the logarithmic norm using the inequalities
[TABLE]
A proof of this assertion can be found in [14], see also [27] for extensions to Lipschitz functions on Banach spaces. Whenever for some , the r.h.s. estimate in (1.2) readily implies the exponential stability estimate
[TABLE]
The linearization technique discussed above is often referred as the Lyapunov first or indirect method to analyze the stability of nonlinear dynamical systems. For a more thorough discussion on this subject we refer to the pioneering work by Lyapunov [24], as well as to chapter 4 in the more recent monograph by Khalil [23].
The main objective of this article is to extend these results to nonlinear diffusions and their mean field particle interpretations on Euclidian as well as on differentiable manifolds. The differential analysis of conventional diffusions w.r.t. initial conditions is also one of the stepping stones of Bismut and Malliavin calculus. This framework is mainly designed to study the existence and the properties of smooth probability densities in terms of the differential properties of the diffusion semigroup. For a more thorough discussion on this subject we refer to [13, 30], and references therein.
The relevant mathematical apparatus for the description and the variational analysis of stochastic processes on manifolds being technically more sophisticated than conventional differential calculus, this introduction only discusses nonlinear and interacting diffusions on Euclidian spaces. The extended versions of these models on Riemannian manifolds are discussed in some details in section 3.2, as well as in section 4.3.
Let be the set of Borel probability measures on with finite second absolute moment, equipped with the -Wasserstein distance given by
[TABLE]
In the above display, the infimum is taken over all pairs of random variables with respective distributions and ; and stands for the Euclidian distance between and on the product space .
Also let and be differentiable functions from into and , for some ; and let be an -dimensional Brownian motion. For any and any time horizon we denote by be the stochastic flow defined for any and any starting point by the McKean-Vlasov diffusion
[TABLE]
In the above display, stands for the evolution semigroup
[TABLE]
We further assume that the mean field drift and diffusion functions are given by
[TABLE]
We shall assume that the nonlinear diffusion flow (1.4) is well defined. For instance, the existence of this flow is ensured as son as and are Lipschitz, see for instance [18, 22].
The mean field particle system associated with (1.4) is defined by the stochastic flow of a system of interacting diffusions
[TABLE]
with the empirical measures
[TABLE]
In the above displayed formulae, stands for the initial configuration and are independent copies of .
1.2 Statement of some main results and article organisation
To motivate this study, the variational calculus developed in the article is illustrated with the following example
[TABLE]
for some , some confinement type potential function (a.k.a. the exterior potential) and some interaction potential function . This class of nonlinear diffusions and the corresponding particle interpretations were introduced by H. P. McKean in [28, 29]. The extended versions of these Langevin-type nonlinear diffusions on Riemannian manifolds are discussed in the end of section 3.2 as well as in section 4.3.
Nonlinear diffusions (1.4) with constant diffusion and gradient-type drifts (1.6) arise in fluid mechanics, and more particularly in the modeling of granular flows [6, 7, 35, 42]. In this context, represents the evolution semigroup of the velocity of a diffusive particule interacting with the distribution of the particles around its location and following some confinement exterior potential. In this interpretation, the mean field particle model (1.5) can be seen as a particle-type representation of the granular flow.
In the last two decades, the analysis of the long time behavior of this particular class of gradient type flow diffusions have been developed in various directions:
The first articles on the long time behavior of these models are the couple of articles by Tamura [33, 34]. The stability properties of one dimensional models has been started in [4, 5] as well as in [6], see also [9, 11, 35].
Since this period, several sophisticated probabilistic techniques have been developed to analyze the long time behavior of these Langevin-type nonlinear diffusions, including log-Sobolev functional inequalities [25, 26], entropy dissipation [10, 15], as well as gradient flows in Wasserstein metric spaces and optimal transportation inequalities [8, 10, 12, 31], combining the functional Bakry-Emery method [3], with the Otto-Villani approach [32]. The long time self-stabilizing behavior of this class of processes in multi-wells landscapes has also been developed by J. Tugaut in a series of articles [36, 37, 39, 40, 41]. For a more thorough discussion on this subject we refer to the recent article [17], and the references therein.
Unfortunately, most of the probabilistic techniques discussed above only apply to gradient flow type diffusions of the form (1.6). The variational calculus developed in the present article is not restricted to this class of gradient-type nonlinear models. Nevertheless, because of their importance in practice this introduction illustrates some of our main results in this context.
Firstly, and rather surprisingly, the variational methodology developed in the present article applies directly to gradient flow models of the form (1.6), simplifying considerably both of their stability analysis as well as the convergence analysis of their mean field particle interpretations.
This framework also allows to relax unnecessary technical conditions such as the symmetry of the interaction potential function, or the invariance of the center of mass, currently used in the literature on this subject (see for instance [33], as well as section 2 in [10], and section 1 in the more recent article [8]). It also allows to derive uniform as well as almost sure exponential stability inequalities at the level of the nonlinear diffusion flow. For instance, when is an even convex function with bounded Hessian , and when , for some we have the almost sure estimates
[TABLE]
The above estimate is also met for odd interaction potential, as soon as . In the above display, it is implicitly assumed that the stochastic flows are driven by the same Brownian motion.
These almost sure inequalities are direct consequence of the contraction inequality (2.6), the remark (2.15) and the almost sure estimates stated in corollary 3.2.
To the best of our knowledge, the almost sure exponential decays (1.7) are the first result of this type for this class of nonlinear gradient flow diffusions.
Consider a pair of random variables with distributions on and set
[TABLE]
Under the assumptions on the potential functions discussed above, for any differentiable function on with bounded gradient we have the first order differential formula
[TABLE]
with the linear differential operator
[TABLE]
For a more precise statement we refer to theorem 2.2. Almost sure and uniform estimates of the first order differential maps are also provided in theorem 2.3.
Section 4.1 also presents a differential calculus to estimate the gradient of the stochastic flow of the interacting particle model (1.5). Under the assumptions on the potential functions discussed above, we shall prove the following uniform spectral norm estimate
[TABLE]
The above result is a direct consequence of theorem 4.1. The above estimate ensures that the -particle model converges exponentially fast to its invariant measure with some exponential decay that doesn’t depends on the number of particles. The latter property can also be checked using more sophisticated Logarithmic Sobolev inequalities [25]. To the best of our knowledge, the almost sure exponential decays stated above are the first result of this type for this class of interacting Langevin-type diffusions.
Section 4.2 also provides a natural differential calculus to derive quantitative and uniform propagation of chaos estimates for nonlinear diffusions of the form (1.5). Applying these results to interacting Langevin-type diffusions, without further work we recover the uniform estimates stated in theorem 1.2 in [25].
We emphasize that the differential calculus presented in this article allows to consider nonlinear diffusions evolving in differential manifolds. This should not come as a surprise since our framework allows to enter the variations of the diffusion matrices associated with these stochastic models which encapsulates the Riemannian structure of the manifold.
We illustrate these comments in the end of section 2.2 with a rather detailed discussion of an elementary nonlinear geometric-type diffusion. The manifold version of (1.9) is also provided in theorem 3.14.
We also underline that the variational calculus on differentiable manifolds developed in section 3.2 provides another view and additional results for the diffusions in endowed, when possible, with the Riemannian metric under which these diffusions are Brownian motion with drift. In this context, different types of synchronous coupling lead to gradient flow estimates where gradients of the diffusion functions are replaced by Ricci curvatures.
Quantitative propagation of chaos estimates of mean field particle systems on Riemannian manifolds are provided in section 4.3. Special attention is paid to derive uniform estimates w.r.t. the time horizon.
2 Nonlinear diffusion semigroups
2.1 Some gradient flow estimates
This section presents some basic properties of the first variational equation associated with the nonlinear diffusion (1.4). Let be the -th column vector of , and let and be the gradient of the functions and w.r.t. the coordinate . We also let be the -th coordinate of the column vector . The Jacobian of the diffusion flow is given by the gradient -matrix
[TABLE]
Consider the regularity condition stated below:
* : There exists some such that for any and we have*
[TABLE]
This spectral condition produces several gradient estimates. For instance, we have the following uniform estimate
[TABLE]
In addition, we have the almost sure estimate
[TABLE]
The proofs of the above assertions are provided in the appendix, on page Proof of (2.2) and (2.3). For the nonlinear Langevin diffusion discussed in (1.6) we have
[TABLE]
Arguing as in (1.3) we readily check the following proposition.
Proposition 2.1**.**
Assume is satisfied. In this situation, we have
[TABLE]
In addition, we have the almost sure estimate
[TABLE]
Whenever the above estimates ensure that the transition semigroup is exponentially stable, that is we have that
[TABLE]
These contraction inequalities quantify the stability of the stochastic flow w.r.t. the initial state , but they don’t give any information of the stability properties of the nonlinear semigroup w.r.t. the initial measure .
2.2 A first order differential calculus
This section presents a natural first order differential calculus to analyze the stability properties of the nonlinear semigroup . Consider the matrices
[TABLE]
In this notation, our second regularity condition takes the following form:
* : There exists some such that for any and we have*
[TABLE]
Let be the collection of random variables with distribution defined in (1.8). We also consider a couple of independent stochastic flows
[TABLE]
driven by independent Brownian motions, say and , and starting from a couple of independent random variables and with the same law.
In the further development of this section, we denote by the expectation operator w.r.t. the Brownian motion and the random variable . In this notation, we have
[TABLE]
This implies that
[TABLE]
with the initial condition
[TABLE]
A simple calculation yields the following estimate
[TABLE]
The inequality in the above display can be turned into an equality when . Also note that
[TABLE]
Let be the set of differentiable functions on with bounded derivative. A direct consequence of the fundamental theorem of calculus yields the following theorem.
Theorem 2.2**.**
For any and any and we have the first order differential formula (1.9). In addition, we have the exponential contraction inequality
[TABLE]
When , the above theorem provides an alternative and rather elementary proof of the exponential asymptotic stability of time varying McKean-Vlasov diffusions with non necessarily homogenous diffusion functions. To the best of our knowledge this stability property is the first result of this type for this general class of nonlinear diffusions.
For the Langevin-type diffusion discussed in (1.6) we have and the matrix reduces to
[TABLE]
When is odd we have
[TABLE]
In the reverse angle, if is even and convex then we have
[TABLE]
As expected, explicit formulae are available for linear and Gaussian models. For instance, when
[TABLE]
the diffusion flow is linear w.r.t. and given for any by the formula
[TABLE]
In the above display, stands for the identity function on . In this context, the process defined in (2.10) is also given by the formula
[TABLE]
This yields the rather crude estimate
[TABLE]
Up to some constant, this shows that the r.h.s. Wasserstein contraction estimate in (2.13) is met with . Applying Coppel’s inequality (cf. Proposition 3 in [14]) we can also choose for any , where stands for the spectral abscissa of a square matrix .
It may happen the stochastic flow (1.4) remains in some domain . The simplest model we have in head is the geometric diffusion on associated with the parameters
[TABLE]
In this situation, the diffusion flow is nonlinear w.r.t. and given for any by
[TABLE]
with the function defined by
[TABLE]
In the above display, we have used the convention . In this context, the process defined in (2.10) is also given by the formula
[TABLE]
Assume that is chosen so that . In this situation, for any we have
[TABLE]
as well as
[TABLE]
This yields the estimate
[TABLE]
Up to some constant, this shows that the r.h.s. Wasserstein contraction estimate in (2.13) is met with .
The analysis of nonlinear diffusions on more general differentiable manifolds is based on more sophisticated differential techniques. The extension of the variational calculus developed above to this class of stochastic processes on manifolds is provided in section 3.2.
We end this section with some illustrations of our results on time homogeneous models satisfying condition . We set , and . By theorem 2.2, there exists an unique invariant measure
[TABLE]
For the nonlinear Langevin diffusion discussed in (1.6) condition is met when (2.14) or (2.15) are satisfied. In this context, is a conventional Langevin diffusion given by the time homogeneous stochastic differential equation
[TABLE]
In this situation, the unique invariant measure of is given by
[TABLE]
In the above display, stands for the Lebesgue measure on . In this case the measure is the unique solution of the equation . We underline that the uniqueness of the invariant measure is not ensured for double-well confinement potential functions and too small noise. Further details on this subject including a description of the invariant measures for small noise can be found in the series of articles [19, 20, 21].
Whenever is met, we also have the uniform moment estimates
[TABLE]
In the same vein, when when and are met we have
[TABLE]
for some finite constant . The last assertion comes from the fact that
[TABLE]
2.3 Some almost sure estimates
We fix the parameters and some given time horizon , and we set , for any , with the process defined in (2.11). Also consider the processes
[TABLE]
with the collection of processes
[TABLE]
In this notation, the evolution equation (2.11) reduces to
[TABLE]
Let be the solution of the matrix evolution equation
[TABLE]
In this notation, we readily check that
[TABLE]
Whenever condition is met, for any given and any we have
[TABLE]
This shows that
[TABLE]
In addition, when is uniformly bounded, and is met, using (2.12) we have almost sure estimate
[TABLE]
with the uniform spectral norm
[TABLE]
We summarize the above discussion with the following theorem.
Theorem 2.3**.**
Assume that is uniformly bounded, and conditions and are met. In this situation, we have the almost sure estimate
[TABLE]
with the process defined in (1.8) and the parameter .
3 Some extensions
3.1 A backward variational formula
The stochastic transition semigroup associated with the flow is defined for any mesurable function on by the formula
[TABLE]
For twice differentiable function we have the gradient and the Hessian formulae
[TABLE]
In the above display, stand for the tensors functions
[TABLE]
Also recall that the infinitesimal generator of the stochastic flow (1.4) is given for any twice differentiable function by the second order operator
[TABLE]
Next theorem is an extension of a theorem by Da Prato-Menaldi-Tubaro [16] to nonlinear diffusions.
Theorem 3.1**.**
Assume that and are Lipschitz functions w.r.t. the parameters . In this situation, for any we have
[TABLE]
where stands for the backward integration notation, so that the r.h.s. term in the above formula is a square integrable backward martingale.
The proof of the above formula follows the elegant stochastic backward variational analysis developed in [16]. A sketched proof is provided in the appendix, on page Proof of (3.1).
We further assume that . In this situation, using the backward formula (3.1) we check the stochastic interpolation formula
[TABLE]
Equivalently, we have
[TABLE]
Combining (2.2) and (2.3) with (2.13) we obtain the following corollary.
Corollary 3.2**.**
Assume the conditions of theorem 3.1 are satisfied and we have and , for some constant . Also assume that and are met for some parameters and . In this situation we have the exponential decay estimates
[TABLE]
In addition, when we have the uniform and almost sure estimates
[TABLE]
3.2 Diffusions on smooth manifolds
This section is concerned with the extension of our results to nonlinear diffusions on Riemannian manifolds. Let us begin with the general necessary facts about nonlinear diffusions in manifolds. Our presentation will be made as similar as possible to the one in Euclidean space. For this, we will need Itô differentials of manifold valued diffusions, parallel translation, covariant differential of tangent bundle valued semimartingales.
Let be a smooth manifold of dimension . Stratonovich calculus is similar on and on . So we are able to deal with Stratonovich SDE’s of the type
[TABLE]
where for
[TABLE]
is a -valued Brownian motion and is a linear map . For simplicity will not depend on time, but the time-dependent can also be treated, we refer to [1] for this extension, and also for the details of the constructions below.
The only situation we will be interested in is when for all the map
[TABLE]
is a linear diffeomorphism. In this situation a scalar product can be defined in and then in , leading to a Riemannian structure on . The scalar product in is
[TABLE]
and the scalar product in is
[TABLE]
Associated to the metric is the Levi-Civita connection , which will be used to define parallel transport, Itô equations, Itô covariant differentials. Recall that the parallel transport along a continuous -valued semimartingale is the linear map which satisfies and the Stratonovich SDE . It is the natural extension to parallel transport along smooth paths, and it is an isometry. Parallel translation allows to anti-develop in with the Stratonovich integral
[TABLE]
The process takes its values in the vector space, it has an Itô differential , which allows to define the Itô differential of
[TABLE]
This Itô differential is formally a vector which can be expressed in local coordinates as
[TABLE]
The next object to consider is Itô covariant derivative of a -valued continuous semimartingale :
[TABLE]
easily defined from the fact that is vector valued. From the isometry property of parallel translation we easily get the formula for another -valued semimartingale and ,
[TABLE]
Defining (where for two vector fields , denotes the covariant derivative of in the direction ), it is well known that the Stratonovich SDE (3.3) is equivalent to the Itô SDEs
[TABLE]
A remarkable fact concerning this equation, is that whenever it exists, a solution to equation (3.9) is a diffusion with nonlinear generator , where
[TABLE]
So we can consider that the starting point of our study is SDE (3.9) in a Riemannian manifold .
Let us adapt the regularity conditions and :
Define , where is the covariant derivative with respect to the variable , it is a linear map from into itself, and is its adjoint with respect to the Riemannian metric.
* : There exists some such that for any and we have*
[TABLE]
where is the Ricci curvature tensor of .
Let be as in (2.8) with gradient replaced by covariant derivative.
Define .
* : There exists some such that for any and we have*
[TABLE]
where , are the product metric and Ricci curvature on .
Theorem 3.3**.**
We have the exponential expansion or contraction inequalities
[TABLE]
for some finite constant . In addition, we have
[TABLE]
- Remark:
The results of Theorem 3.3 still hold when and depend on time, one just has to replace in by and in by .
Proof.
The proof of the first estimate is similar to the proof of Theorem 4.1 in [1] (where time dependent metrics are considered), so we will omit it. The proof of the second one is a combination of this proof and to the one of Theorem 2.2 in the present article. Let us go into the details.
Let , two random variables with values in , and such that minimizes under the condition that has law and has law . For all , let be a geodesic between and .
As in the proof of Theorem 2.2, let and solve the equation
[TABLE]
where is a valued Brownian motion independent of . Let be independent of with the same law, and the solution to the Itô SDE
[TABLE]
where is the parallel transport along the . Notice that .
The equation (3.15) is not an SDE on the manifold , it is an SDE on -valued paths. Existence of solutions have been established in [1]. The processes are obtained one from the others by infinitesimal synchronious coupling, and it is the only construction where a.s. the paths has finite variation. Moreover, the derivatives of theses paths satisfy
[TABLE]
where is the vector such that . The advantage of this construction is that the above covariant derivative has finite variation, and this implies
[TABLE]
Then the proof is similar to the one of Theorem 2.2:
[TABLE]
This implies that
[TABLE]
On the other hand, we have
[TABLE]
This ends the proof of the theorem.
An important example of nonlinear diffusions in manifolds is again given by Langevin diffusions, defined as in (3.9), with now
[TABLE]
where is a potential function, is the Riemannian distance associated to the metric , is the distance to and is a function. A sufficient condition defined by (3.17) to be well defined and smooth is that the derivative of is nul at the origin and the support of is included in , where denotes the injectivity radius of . But smoothness of is not a necessary condition for defining nonlinear diffusions.
We find that for ,
[TABLE]
In this context, condition reduces to
[TABLE]
If for instance is simply connected with nonpositive curvature (which implies that the distance function is convex), and is nondecreasing, a sufficient condition is
[TABLE]
The computation of reveals that it is symmetric, and that for ,
[TABLE]
In this context condition reduces to
[TABLE]
where . Here again, when is simply connected with nonpositive curvature, is convex and nondecreasing, the above condition is met as soon as
[TABLE]
4 Mean field interacting diffusions
4.1 Stability properties
The interacting diffusion flow presented in (1.5) can be rewritten as
[TABLE]
with the drift and the diffusion functions defined for any with by the formulae
[TABLE]
For any differentiable function and any and we consider the gradient blocks
[TABLE]
In this notation, for any we have
[TABLE]
and the diagonal term
[TABLE]
Using the composition rule
[TABLE]
we check that
[TABLE]
* : There exists some such that for any and we have*
[TABLE]
This spectral condition produces several gradient estimates. For instance, arguing as in (2.6) we have the following theorem.
Theorem 4.1**.**
Assume condition is satisfied. In this situation we have the uniform exponential decay estimates
[TABLE]
In addition, when we have the uniform almost sure exponential decay estimate
[TABLE]
The proof of the above theorem is provided in the appendix, on page Proof of theorem 4.1.
For the nonlinear Langevin diffusion discussed in (1.6) we have and
[TABLE]
In this situation we have
[TABLE]
with the matrix with block entries
[TABLE]
When is odd we have
[TABLE]
When is even and convex we have and therefore
[TABLE]
In this situation, we also have
[TABLE]
Last but not least, whenever we have
[TABLE]
Note that holds when is even. In this situation, the diffusion reduces to a conventional Langevin diffusion
[TABLE]
In this context, the stationary measure of the particle model is given by the Gibbs measure
[TABLE]
4.2 Propagation of chaos properties
For any differentiable function from into we let be the gradient matrices w.r.t. the coordinate , and we set
[TABLE]
We extend matrix-valued functions to the product space by setting
[TABLE]
We also consider the mapping , , and for any we set
[TABLE]
Let and be the functions defined for any and by
[TABLE]
The matrices in the above display are given by
[TABLE]
and the matrices are given by
[TABLE]
Consider the following regularity condition:
* : There exists some such that for any and we have*
[TABLE]
Let be independent copies of a random variable with distribution on . Let and consider the diffusion processes defined as by replacing the occupation measures by the distributions ; that is, for any we have
[TABLE]
Theorem 4.2**.**
Assume condition is satisfied. In this situation, for any and any distribution on we have
[TABLE]
with the parameters
[TABLE]
Proof.
We set . Using the decomposition
[TABLE]
we check that
[TABLE]
with and defined by
[TABLE]
Applying Cauchy-Schwartz inequality we find that
[TABLE]
with
[TABLE]
To estimate the term we observe that
[TABLE]
On the other hand, for any differentiable function from into , and for any and we have the first order decomposition
[TABLE]
with the matrix
[TABLE]
By symmetry arguments, this implies that
[TABLE]
In the same vein, we have
[TABLE]
This yields the estimate
[TABLE]
To estimate the term we use the decomposition
[TABLE]
Also notice that
[TABLE]
We also have
[TABLE]
This implies that
[TABLE]
from which we check that
[TABLE]
Combining the above decompositions we check that
[TABLE]
Combining the above estimate with (4.13) we find that
[TABLE]
from which we conclude that
[TABLE]
Applying twice Cauchy-Schwartz inequality we check the estimate
[TABLE]
On the other hand, we have
[TABLE]
This implies that
[TABLE]
Recalling that for any and , we check that
[TABLE]
This ends the proof of the theorem.
We end this section with some comments on the regularity condition .
For the nonlinear Langevin diffusion discussed in (1.6) we have and
[TABLE]
In this context, we have
[TABLE]
Also observe that for any we have the decomposition
[TABLE]
with the matrices
[TABLE]
In the above display stands for the matrix defined in (2.9), and stand for the matrices defined for any by
[TABLE]
Consider the following regularity condition:
* : There exists some such that for any and we have*
[TABLE]
Assume that is met. Using the fact that , for any random matrix , we check that
[TABLE]
Several uniform estimates can be derived combining (4.12) with the moments estimates (2.17). For instance, suppose we are given a time homogeneous model , for some functions with uniformly bounded first order derivatives. Also assume is met for some . In this context, the moments estimates (2.17) ensure that
[TABLE]
for some constant whose values only depends on the measure . Choosing in (4.12) we readily check that
[TABLE]
4.3 Propagation of chaos in manifolds
Our aim is to state an analogous of Theorem 4.2 in a Riemannian manifold . We will take the notations of Section 3.2. Let us denote by the Riemannian distance in . Now are independent copies of a random variable with distribution on . For the diffusions satisfy the Itô SDE
[TABLE]
with linear, , and independent -valued Brownian motions independent of . Denote , . The diffusions are independent and identically distributed, with law at time . Define an approximation of with the Markov process satisfying and for all ,
[TABLE]
where for , denotes parallel translation along the minimal geodesic from to . It is well-known that such an equation has a solution, which realizes the coupling by parallel translation of martingale parts of and (see e.g. [2] or [43]). The only difficulty is when is in the cutlocus of , but this difficulty can be overcome by constructing approximations of the solutions which are decoupled in an -neighbourhood of the cutlocus, and by letting then tend to [math]. However the solution obtained is not strong. Anyway, since is an isometry and the are independent, the process is a Brownian motion in with drift , so it is a diffusion process. Moreover independent valued Brownian motions can be found such that
[TABLE]
they satisfy
[TABLE]
for some “complementary” martingale .
The important fact about this construction is that the distance has finite variation. More precisely, letting for with not belonging to the cutlocus of , the geodesic from to in time and we have
[TABLE]
In the above display stands for a nondecreasing process which increases only when is in the cutlocus of , and is the index map defined for , and , by
[TABLE]
where is a unit speed geodesic from to started at time [math], is an orthonormal basis of , and is a Jacobi field along (see e.g. [2]). It is well known that when then where is the same quantity computed in a constant curvature manifold, for two points at the same distance. Moreover we have the explicit values
[TABLE]
In any case, , so we obtain as a general result that when
[TABLE]
So we have
[TABLE]
Define similarly to the previous section for a Riemannian manifold and a map such that : for , elements of
[TABLE]
Also define
[TABLE]
[TABLE]
where , , and set
[TABLE]
Consider the following regularity condition:
* : There exists some such that for any and we have*
[TABLE]
Theorem 4.3**.**
Assume that the Ricci curvatures of are bounded below by and that the condition is satisfied. Then
[TABLE]
with the parameter defined as in Theorem 4.2.
- Remark:
The result of Theorem 4.3 extends to the case when and depend on time, if we replace the bound below of the Ricci curvatures by the assumption that .
Proof.
The proof is completely similar to the one of Theorem 4.2, thus it is only sketched. Letting we arrive at
[TABLE]
where
[TABLE]
and
[TABLE]
which leads to
[TABLE]
so letting we get
[TABLE]
This ends the proof of (4.32).
Let us investigate condition (4.31) for the Langevin diffusion with drift (3.17), namely
[TABLE]
We need the additional assumption . In this situation, the computation of in (4.34) yields the formula
[TABLE]
where we denoted , leading to the condition : for all ,
[TABLE]
This condition is met for instance when for all ,
[TABLE]
Appendix
Proof of (2.2) and (2.3)
After some calculations we check that
[TABLE]
with the matrix valued martingale
[TABLE]
and
[TABLE]
In the above display, we have used the fact that , for any random matrix . The end of the proof of (2.2) and (2.3) is now clear.
Proof of (3.1)
For any time mesh with and with we have
[TABLE]
Also observe that
[TABLE]
with the random fields
[TABLE]
Using elementary manipulations, for any we check that
[TABLE]
for some finite constants and . Recalling that and are Lipschitz functions we check that the almost sure convergence
[TABLE]
Using the Taylor expansion
[TABLE]
we check that
[TABLE]
Rearranging the terms we find that
[TABLE]
with the remainder term
[TABLE]
This yields the decomposition
[TABLE]
with the remainder term
[TABLE]
On the other hand, we have
[TABLE]
This implies that
[TABLE]
We end the proof of (3.1) by letting the time step .
Proof of theorem 4.1
Observe that
[TABLE]
This implies that
[TABLE]
This ends the proof of (4.1). The proof of (4.4) and (4.5) come from the formula
[TABLE]
with the martingale
[TABLE]
defined in terms of the diffusion processes
[TABLE]
The end of the proof of (4.4) and (4.5) follows the same lines of arguments as the proof of (2.2) and (2.3), thus it is skipped. This ends the proof of the theorem.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. Arnaudon, A. Coulibaly, A. Thalmaier, Horizontal diffusions in C 1 superscript 𝐶 1 C^{1} path space. Séminaire de Probabilités XLIII, 73–94, Lecture Notes in Mathematics 2006, Springer (2010)
- 2[2] M. Arnaudon, A. Thalmaier and F.Y. Wang, Harnack inequality and heat kernel estimates on manifolds with curvature unbounded below, Bull. Sci. Math. 130 (2006) 223–233
- 3[3] D. Bakry, M. Emery. Diffusions hypercontractives. In Séminaire de probabilités, XIX, 1983/84, pp. 177–206. Lect. Notes in Math. 1123, Springer (1985).
- 4[4] S. Benachour, B. Roynette, D. Talay, P. Vallois. Nonlinear self-stabilizing processes, part I. Existence, invariant probability, propagation of chaos. Stochastic processes and their applications, vol. 75, no. 2, pp. 173–201 (1998).
- 5[5] S. Benachour, B. Roynette, P. Vallois. Nonlinear self-stabilizing processes, part II: Convergence to invariant probability. Stochastic processes and their applications, vol. 75, no.2, pp. 203–224 (1998).
- 6[6] D. Benedetto, E. Caglioti, M. Pulvirenti. A kinetic equation for granular media. RAIRO Modèl. Math. Anal. Numér. vol. 31, no. 5, pp. 615–641 (1997).
- 7[7] D. Benedetto, E. Caglioti, E., Carrillo, M. Pulvirenti. A non-Maxwellian steady distribution for one-dimensional granular media. J. Statist. Phys.vol. 91, pp. 979–990 (1998).
- 8[8] F. Bolley, I. Gentil, A. Guillin. Uniform convergence to equilibrium for granular media. Archive for Rational Mechanics and Analysis, vol. 208, no. 2, pp. 429–445 (2013).
