A second order analysis of McKean-Vlasov semigroups
M Arnaudon (IMB), P del Moral (CMAP, CQFD)

TL;DR
This paper develops a second order differential calculus for McKean-Vlasov semigroups, enabling detailed analysis of their regularity, stability, and propagation of chaos properties in nonlinear diffusion processes.
Contribution
It introduces a novel second order calculus framework and explicit expansions for McKean-Vlasov semigroups, advancing understanding of their stability and chaos propagation.
Findings
Provides second order Taylor expansions with remainders
Derives Bismut-Elworthy-Li formulae for gradients and Hessians
Establishes exponential decay inequalities for semigroup stability
Abstract
We propose a second order differential calculus to analyze the regularity and the stability properties of the distribution semigroup associated with McKean-Vlasov diffusions. This methodology provides second order Taylor type expansions with remainder for both the evolution semigroup as well as the stochastic flow associated with this class of nonlinear diffusions. Bismut-Elworthy-Li formulae for the gradient and the Hessian of the integro-differential operators associated with these expansions are also presented. The article also provides explicit Dyson-Phillips expansions and a refined analysis of the norm of these integro-differential operators. Under some natural and easily verifiable regularity conditions we derive a series of exponential decays inequalities with respect to the time horizon. We illustrate the impact of these results with a second order extension of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and financial applications · Mathematical Biology Tumor Growth · Markov Chains and Monte Carlo Methods
A second order analysis of McKean-Vlasov semigroups
M. Arnaudon
Univ. Bordeaux, CNRS, Bordeaux INP, IMB, UMR 5251, F-33400, Talence, France
P. Del Moral
INRIA, Bordeaux Research Center, Talence, France & CMAP, Polytechnique Palaiseau, France
Abstract
We propose a second order differential calculus to analyze the regularity and the stability properties of the distribution semigroup associated with McKean-Vlasov diffusions. This methodology provides second order Taylor type expansions with remainder for both the evolution semigroup as well as the stochastic flow associated with this class of nonlinear diffusions. Bismut-Elworthy-Li formulae for the gradient and the Hessian of the integro-differential operators associated with these expansions are also presented.
The article also provides explicit Dyson-Phillips expansions and a refined analysis of the norm of these integro-differential operators. Under some natural and easily verifiable regularity conditions we derive a series of exponential decays inequalities with respect to the time horizon. We illustrate the impact of these results with a second order extension of the Alekseev-Gröbner lemma to nonlinear measure valued semigroups and interacting diffusion flows. This second order perturbation analysis provides direct proofs of several uniform propagation of chaos properties w.r.t. the time parameter, including bias, fluctuation error estimate as well as exponential concentration inequalities.
Keywords : Nonlinear diffusions, mean field particle systems, variational equations, logarithmic norms, gradient flows, Taylor expansions, contraction inequalities, Wasserstein distance, Bismut-Elworthy-Li formulae.
Mathematics Subject Classification : 65C35, 82C80, 58J65, 47J20.
1 Introduction
1.1 Description of the models
For any we let be the convex set of probability measures on with absolute -th moment and equipped with the Wasserstein distance of order denoted by . Also let be some Lipschitz function from into and let be an -dimensional Brownian motion defined on some filtered probability space . We also consider the Hilbert space equipped with the inner product . Up to a probability space enlargement there is no loss of generality to assume that contains square integrable -valued variables independent of the Brownian motion.
For any and any time horizon we denote by the stochastic flow defined for any and any starting point by the McKean-Vlasov diffusion
[TABLE]
In the above display, stands for the evolution semigroup on defined by the formulae
[TABLE]
We denote by the generator of the stochastic flow . The existence of the stochastic flow is ensured by the Lipschitz property of the drift function see for instance [41, 47]. To analyze the smoothness of the semigroup we need to strengthen this condition.
We shall assume that the function is differentiable at any order with uniformly bounded derivatives. In addition, the partial differential matrices w.r.t. the first and the second coordinate are uniformly bounded; that is for any we have
[TABLE]
In the above display, stands for the spectral norm of some matrix , where stands for the transpose of , and the maximal and minimal eigenvalue. In the further development of the article, we shall also denote by the symmetric part of a matrix . In the further development of the article we represent the gradient of a real valued function as a column vector, or equivalently as the transpose of the differential-Jacobian operator which is, as any cotangent vector, represented by a row vector. The gradient and the Hessian of a column vector valued function as tensors of type and , see for instance (3.1).
The mean field particle interpretation of the nonlinear diffusion (1.1) is described by a system of -interacting diffusions defined by the stochastic differential equations
[TABLE]
In the above display, stands for independent random variables with common distribution , and are independent copie of the Brownian motion .
McKean-Vlasov diffusions and their mean field type particle interpretations arise in a variety of application domains, including in porous media and granular flows [7, 8, 18, 67], fluid mechanics [58, 59, 61, 68], data assimilation [10, 26, 36], and more recently in mean field game theory [9, 14, 13, 15, 16, 17, 46, 43], and many others.
The origins of this subject certainly go back to the beginning of the 1950s with the article by Harris and Kahn [45] using mean field type splitting techniques for estimating particle transmission energies. We also refer to the pioneering article by Kac [50, 51] on particle interpretations of Boltzmann and Vlasov equations, and the seminal articles by McKean [58, 59] on mean field particle interpretations of nonlinear parabolic equations arising in fluid mechanics. Since this period, the analysis of this class of mean field type nonlinear diffusions and their discrete time versions have been developed in various directions. For a survey on these developments we refer to [15, 26, 65], and the references therein.
The McKean-Vlasov diffusions discussed in this article belong to the class of nonlinear Markov processes. One of the most important and difficult research questions concerns the regularity analysis and more particularly the stability and the long time behavior of these stochastic models.
In contrast with conventional Markov processes, one of the main difficulty of these Markov processes comes from the fact that the evolution semigroup is nonlinear w.r.t. the initial condition of the system. The additional complexity in the analysis of these models is that their state space is the convex set of probability measures, thus conventional functional analysis and differential calculus on Banach space cannot be directly applied.
The main contribution of this article is the development of a second order differential calculus to analyze the regularity and the stability properties of the distribution semigroup associated with McKean-Vlasov diffusions. This methodology provides second order Taylor type expansions with remainder for both the evolution semigroup as well as the stochastic flow associated with this class of nonlinear diffusions. We also provide a refined analysis of the norm of these integro-differential operators with a series of exponential decays inequalities with respect to the time horizon.
The article is organized as follows:
The main contributions of this article are briefly discussed in section 1.2. The main theorems are stated in some detailed in section 2. Section 3 provides some pivotal results on tensor integral operators and on integro-differential operators associated with the second order Taylor expansions of the semigroup . Section 4 is dedicated to the analysis of the tangent process associated with the nonlinear diffusion flow. We presents explicit Dyson-Phillips expansions as well as some spectral estimates. The last section, section 4 is mainly concerned with the proofs of the first and second order Taylor expansions. The proof of some technical results are collected in the appendix. Detailed comparisons with existing literature on this subject are also provided in section 2.5.
1.2 Statement of some main results
One of the main contribution of the present article is the derivation of a second order Taylor expansion with remainder of the semigroup on probability spaces. For any pair of measures , these expansions take basically the following form:
[TABLE]
In the above display, stands some first and second order operators, with . A more precise description of these expansions and the remainder terms is provided in section 2.2.
Section 2.3.1, also provides an almost sure second order Taylor expansions with remainder of the random state of the McKean diffusion w.r.t. the initial distribution . These almost sure expansions take basically the following form
[TABLE]
for some random functions from into , with . A more precise description of these almost sure expansions is provided in section 2.3.1 (see for instance (2.19) and theorem 2.6).
Given some random variable with distribution , observe that the stochastic flow satisfies the -valued stochastic differential equation
[TABLE]
In the above display, stands for the drift function from into itself defined by the formula
[TABLE]
In the above display, stands for an independent copy of . The above Hilbert space valued representation of the McKean-Vlasov diffusion (1.1) readily implies that for any we have the exponential contraction inequality
[TABLE]
for some , as soon as the following condition is satisfied
[TABLE]
for any and any . In addition, in this framework the first order differential of the stochastic flow coincides with the conventional Fréchet derivative of functions from an Hilbert space into another. In addition, we shall see that the gradient of first order operator coincides with the dual of the tangent process associated with the Hilbert space-valued representation (1.6) of the McKean-Vlasov diffusion (1.1); that is, for any smooth function we have that the dual tangent formula
[TABLE]
A more precise description of the Fréchet differential and the dual operator is provided in section 2.1 and section 4. A proof of the above formula is provided in theorem 4.8.
The Taylor expansions discussed above are valid under fairly general and easily verifiable conditions on the drift function. For instance, the regularity condition (1.2) is clearly satisfied for linear drift functions. As it is well known, dynamical systems and hence stochastic models involving drift functions with quadratic growth require additional regularity conditions to ensure non explosion of the solution in finite time.
Of course the expansions (1.4) and (1.5) will be of rather poor practical interest without a better understanding of the differential operators and the remainder terms. To get some useful approximations, we need to quantify with some precision the norm of these operators. A important part of the article is concerned with developing a series of quantitative estimates of the differential operators and the remainder term; see for instance theorem 2.3 and theorem 2.4.
To avoid estimates that grow exponentially fast with respect to the time horizon, we need to estimate with some precision the operator norms of the differential operators in (1.4). To this end, we shall consider an additional regularity condition:
* : There exists some and such that for any and any time horizon we have*
[TABLE]
In the above display, stands for the identity matrix and the matrix-valued function defined by
[TABLE]
Whenever (1.9) and (1.10) are met for some parameters and all the exponential estimates stated in the article remains valid but they grow exponentially fast with respect to the time horizon. More detailed comments on the above regularity conditions, including illustrations for linear drift and gradient flow models, as well as comparisons with related conditions used in the literature on this subject are also provided in section 2.4.
Under the above condition, we shall develop several exponential decays inequalities for the norm of the differential operators as well as for the remainder terms in the Taylor expansions. The first order estimates are given in (2.6), the ones on the Bismut-Elworthy-Li gradient and Hessian extension formulae are provided in (2.7) and (2.8). Second and third order estimates can also be found in (2.12) and (2.15).
The second order differential calculus discussed above provides a natural theoretical basis to analyze the stability properties of the semigroup and the one of the mean field particle system discussed in (1.3).
For instance, a first order Taylor expansion of the form (1.4) already indicates that the sensitivity properties of the semigroup w.r.t. the initial condition are encapsulated in the first order differential operator . Roughly speaking, whenever is satisfied, we show that there exists some parameter such that
[TABLE]
for some operator norms . For a more precise statement we refer to theorem 2.2 and the discussion following the theorem.
The second order expansion (1.4) also provides a natural basis to quantify the propagation of chaos properties of the mean field particle model (1.3). Combining these Taylor expansions with a backward semigroup analysis we derive a a variety of uniform mean error estimates w.r.t. the time horizon. This backward second order analysis can be seen a second order extension of the Alekseev-Gröbner lemma [1, 42] to nonlinear measure valued and stochastic semigroups. For a more precise statement we refer to theorem 2.7. As in (1.11), one of the main feature of the expansion (1.4) is that it allows to enter the stability properties of the limiting semigroup into the analysis of the flow of empirical measures .
Roughly speaking, this backward perturbation analysis can be interpreted as a second order variation-of-constants technique applied to nonlinear equations in distribution spaces. As in the Ito’s lemma, the second order term is essential to capture the quadratic variation of the processes, see for instance the recent articles [35, 48] in the context of conventional stochastic differential equation, as well as in [4, 31] in the context of interacting jump models.
The discrete time version of this backward perturbation semigroup methodology can also be found in chapter 7 in [25], a well as in the articles [27, 28, 30] and [34, 37] for general classes of mean field particle systems.
The central idea is to consider the telescoping sum on some time mesh given by the interpolating formula
[TABLE]
Applying (1.4) and whenever we have the second order approximation
[TABLE]
with the local fluctuation random fields
[TABLE]
For discrete generation particle systems, are defined by conditionally independent variables given the system . For a more rigorous analysis we refer to section 2.3.2.
The above decomposition shows that the first order operator reflects the fluctuation errors of the particle measures, while the second order term encapsulates their bias. In other words, estimating the norm of second order operator allows to quantify the bias induced by the interaction function, while the estimation of first order term is used to derive central limit theorems as well as -mean error estimates.
As in (1.11), these estimates take basically the following form. For and any sufficiently regular function we have
[TABLE]
In addition, we have the uniform bias estimate w.r.t. the time horizon
[TABLE]
In the above display, stands for some operator norm, and stands for some finite constants whose values doesn’t depend on the time horizon. We emphasize that the above results are direct consequence of a second order extension of the Alekseev-Gröbner type lemma for particle density profiles. For more precise statements we refer to theorem 2.7 and the discussion following the theorem.
1.3 Some basic notation
Let be the set of bounded linear operators from a normed space into a possibly different normed space equipped with the operator norm . When we write instead of .
With a slight abuse of notation, we denote by the identity -matrix, for any , as well as the identity operator in . We also denote by any (equivalent) norm on some finite dimensional vector space over .
We also use the conventional notation , , , and so on for the partial derivatives w.r.t. some real valued parameters , , and .
We let be the gradient column vector associated with some smooth function from into . Given some smooth function from into we denote by the gradient matrix associated with the column vector function . We also let be the second order differential operator defined for any twice differentiable function on by the Hessian-type formula
[TABLE]
We consider the space of -differentiable functions and we denote by the subspace of functions such that
[TABLE]
We equip with the norm
[TABLE]
When there are no confusions, we drop to lower symbol and we write instead of the supremum norm of some real valued function. We let be the identify function on and for any and we set
[TABLE]
For any , we also denote by some polynomial function of with . When we write instead of .
Under our regularity conditions on the drift function, using elementary stochastic calculus for any and we check the following estimates
[TABLE]
In the above display and throughout the rest of the article, we write and with and some collection of non decreasing and non negative functions of the time parameter whose values may vary from line to line, but which only depend on the parameters , as well as on the drift function . Importantly these contants do not depend on the probability measures . We also write and when the constant do not depend on the time horizon.
2 Statement of the main theorems
2.1 First variational equation on Hilbert spaces
As expected, the Fréchet differential of the stochastic flow associated with the stochastic differential equation (1.6) satisfies an Hilbert space-valued linear equation (cf. (4.1)). The drift-matrix of this evolution equation is given by the Fréchet differential of the drift function evaluated along the solution of the flow. Mimicking the exponential notation of the solution of conventional homogeneous linear systems, the evolution semigroup (a.k.a. propagator) associated with the first variational equation is written as follows
[TABLE]
The above exponential is understood as an operator valued Peano-Baker series [64]. A more detailed presentation of these models is provided in section 4.
The -log-norm of an operator is defined by
[TABLE]
Our first main result is an extension of an inequality of Coppel [22] to tangent processes associated with Hilbert-space valued stochastic flows.
Theorem 2.1**.**
For any time horizon and any we have the log-norm estimate
[TABLE]
In addition, we have
[TABLE]
The proof of the above theorem in provided in section 4.1.
Let be a pair of random variables with distributions . Also let be the probability distribution of the random variable
[TABLE]
This observation combined with the above theorem yields an alternative and more direct proof of an exponential Wasserstein contraction estimate obtained in [5]. Namely, using (2.2) we readily check the -exponential contraction inequality
[TABLE]
For any function with bounded derivative we also quote the first order expansion
[TABLE]
In the above display, stands for the conventional inner product on . The above assertion is a direct consequence of theorem 4.8.
2.2 Taylor expansions with remainder
The first expansion presented in this section is a first order linearization of the measure valued mapping in terms of a semigroup of linear integro-differential operators.
Theorem 2.2**.**
For any and , there exists a semigroup of linear operators from into itself such that
[TABLE]
In addition, when is satisfied we have the gradient estimate
[TABLE]
The proof of the above theorem with a more explicit description of the first order operators are provided in section 4.3. In (2.6) we can choose , with the parameter introduced in (1.10). The semigroup property is a consequence of theorem 4.5 and the gradient estimates is a reformulation of the operator norm estimate discussed in (4.14).
We also provide Bismut-Elworthy-Li-type formulae that allow to extend the gradient and Hessian operators with to measurable and bounded functions. When the condition is satisfied we show the following exponential estimates
[TABLE]
In addition, we have the Hessian estimate
[TABLE]
The proof of the first assertion can be found in remark 4.7 on page 4.7. The proof of the Hessian estimates is a consequence of the decomposition of discussed in (5.1) and the Hessian estimates (3.26) and (3.41).
It is worth mentioning that the semigroup property is equivalent to the chain rule formula
[TABLE]
which is valid for any . Without further work, theorem 2.2 also yields the exponential -contraction inequality
[TABLE]
with the same parameter a in (2.6). In the same vein, the estimate (2.7) yields the total variation estimate
[TABLE]
with the same parameter a in (2.7). In all the inequalities discussed above we can choose any parameter such that , with the parameter introduced in (1.10). In the -contraction inequality (2.10) we can choose . A more refined estimate is provided in section 2.4.
Next theorem provides a first order Taylor expansion with remainder.
Theorem 2.3**.**
For any and , there exists a linear operators from into such that
[TABLE]
with the first order operator introduced in theorem 2.2. In addition, when is satisfied we also have the estimate
[TABLE]
The proof of the above theorem in provided in section 5.2. A more precise description of the second order operator is provided in (5.9) and (5.13). Using (2.11) and arguing as in the proof of proposition 2.1 in [4], for any twice differentiable function with bounded derivatives we check the backward evolution equation
[TABLE]
with the first order operator introduced in theorem 2.3. The above equation is a central tool to derive an extended version of the Alekseev-Gröbner lemma [1, 42] to measure valued semigroups and interacting diffusions (cf. theorem 2.7).
Next theorem provides a second order Taylor expansion with remainder.
Theorem 2.4**.**
For any and , there exists a linear operators from into such that
[TABLE]
with the second order operator introduced in theorem 2.3. In addition, when is satisfied we have the third order estimate
[TABLE]
The proof of the first part of the above theorem in provided in section 5.3. We can choose in (2.15) any parameter such that , with the parameter introduced in (1.10). The proof of the third order estimate (2.15) is rather technical, thus it is provided in the appendix, on page Proof of the estimate (2.15).
2.3 Illustrations
The first part of this section states with more details the almost sure expansions discussed in (1.5). Up to some differential calculus technicalities, this result is a more or less direct consequence of the Taylor expansions with remainder presented in theorem 2.3 and theorem 2.4 combining with a backward formula presented in [5].
The second part of this section is concerned with a second order extension of the Alekseev-Gröbner lemma to nonlinear measure valued semigroups and interacting diffusion flows. This second order stochastic perturbation analysis is also mainly based on the second order Taylor expansion with remainder presented in theorem 2.4 .
In the further development of this section without further mention we shall assume that condition is satisfied.
2.3.1 Almost sure expansions
We recall the backward formula
[TABLE]
The above formula combined with (2.4) and the tangent process estimates presented in section 3.3 yields the uniform almost sure estimates
[TABLE]
The above estimate is a consequence of (2.4) and conventional exponential estimates of the tangent process (cf. for instance (3.2)). A detailed proof of this claim and the backward formula (2.16) can be found in [5].
We extend the operators introduced in theorem 2.4 to tensor valued functions with by considering the same type tensor function with entries
[TABLE]
for any . A brief review on tensor spaces is provided in section 3.1. We also consider the function
[TABLE]
Combining the first order formulae stated in theorem 2.3 with conventional Taylor expansions we check the following theorem.
Theorem 2.5**.**
For any , and we have the almost sure expansion
[TABLE]
with the second order remainder function such that
[TABLE]
The detailed proof of the above theorem is provided in the appendix, on page Proof of theorem 2.6.
Second order expansions are expressed in terms of the functions defined for any and for any by the formulae
[TABLE]
We associate with these objects the function defined by
[TABLE]
In the above display, stands for the functions given by
[TABLE]
We are now in position to state the main result of this section.
Theorem 2.6**.**
For any , and we have the almost sure expansion
[TABLE]
with a third order remainder function such that
[TABLE]
The proof of the above theorem is provided in the appendix, on page Proof of theorem 2.6. In the remainder term estimates presented in the above theorems, we can choose any parameter such that , with the parameter introduced in (1.10).
2.3.2 Interacting diffusions
For any , the -mean field particle interpretation associated with a collection of generators is defined by the Markov process with generators given for any sufficiently smooth function and any by
[TABLE]
with the function
[TABLE]
We extend to symmetric functions on by setting
[TABLE]
In this notation, in our context we readily check that
[TABLE]
for any symmetric function , with the function on defined for any by the formula
[TABLE]
A proof of the above formula is provided in the appendix, on page Proof of (2.22). Applying Ito’s formula, for any smooth function we prove that
[TABLE]
In the above display, stands for a martingale random field with angle bracket
[TABLE]
The above evolution equation is rather standard in mean field type interacting particle system theory, a detailed proof can be found in [29] (see for instance section 4.3). In the same vein, with some obvious abusive notation, using (2.22) we have
[TABLE]
We fix a final time horizon and we denote by
[TABLE]
the martingale associated with the predictable function
[TABLE]
Combining the Itô formula with the tensor product formula (2.22) and with the backward formula (2.13) we obtain
[TABLE]
This implies that
[TABLE]
This yields the following theorem.
Theorem 2.7**.**
For any time horizon , the interpolating semigroup satisfies for any with the evolution equation
[TABLE]
The above theorem can be seen as a second order extension of the Alekseev-Gröbner lemma [1, 42] to nonlinear measure valued and stochastic semigroups. This result also extends the perturbation theorem obtained in [4] (cf. theorem 3.6) in the context of interacting jumps processes to McKean-Vlasov diffusions. The discrete time version of the backward perturbation analysis described above can also be found in [27, 28, 30] in the context of Feynman-Kac particle models (see also [25, 26, 31]).
We end this section with some direct consequences of the above theorem. Firstly, using (2.6) and (2.12) we have the almost sure estimates
[TABLE]
Without further work, the above inequality yields the uniform bias estimate stated in the r.h.s. of (1.13), for any twice differentiable function with bounded derivatives. Using well known martingale concentration inequalities (cf. for instance lemma 3.2 in [60]), there exists some finite parameter such that for any and any the probability of the following event
[TABLE]
is greater than . In addition, using the Burkholder-Davis-Gundy inequality, for any we obtain the time uniform estimates stated in the r.h.s. of (1.12). On the other hand, using (2.5) and (2.6) we have the almost sure exponential contraction inequality
[TABLE]
This yields the bias estimates
[TABLE]
for any twice differentiable function with bounded derivatives. The r.h.s. estimate comes from well known estimates of the average of the Wassertein distance for occupation measures, see for instance [38] and the more recent studies [40, 56]. The above inequality yields the following uniform bias estimate
[TABLE]
2.4 Comments on the regularity conditions
We discuss in this section the regularity condition introduced in (1.9). We illustrate these spectral conditions for linear-drift and gradient flow models. Comparisons with related conditions presented in other works are also provided.
Firstly, we mention that the condition stated in (1.9) has been introduced in the article [5] to derive several Wasserstein exponential contraction inequalities as well as uniform propagation of chaos estimates w.r.t. the time horizon.
Using the log-norm triangle inequality and recalling that the log-norm is dominated by the spectral norm we check that
[TABLE]
Choosing and as the supremum of the maximal eigenvalue functional of the matrices and , the Cauchy interlacing theorem (see for instance [55] on page 294) yields .
For linear drift functions
[TABLE]
the matrix reduces to the two-by-two block partitioned matrix
[TABLE]
In this situation the diffusion flow is given by the formula
[TABLE]
In the one dimensional case we have
[TABLE]
Nonlinear Langevin diffusions are associated with the drift function
[TABLE]
some confinement type potential function (a.k.a. the exterior potential) and some interaction potential function . In this context we have
[TABLE]
When the potential function is even and convex we have
[TABLE]
In the reverse angle, when the function is odd we have the formula
[TABLE]
In both situations, condition is satisfied when the strength of the confinement type potential dominates the one of the interaction potential; that is when we have that
[TABLE]
The decay rate in the -contraction inequality (2.4) is larger than the decay rate in the -contraction inequality (2.10). In addition, the -exponential stability requires that dominates the spectral norm of the matrix . Next we provide a more refined analysis based on the proof of the -contraction inequality presented in [5]. Using the interpolating paths introduced in (2.3) we set
[TABLE]
In the above display stands for an independent copy of . Arguing as in [5] we have
[TABLE]
We consider the symmetric and anti-symmetric matrices
[TABLE]
and we set
[TABLE]
By symmetry arguments and using some elementary manipulations we check the formula
[TABLE]
This shows that
[TABLE]
with the parameter given by
[TABLE]
We conclude that the -contraction inequality (2.10) is met with .
In a more recent article [69] the author presents some Wasserstein contraction inequalities of the same form as in (2.4) with replaced by some parameter , under the assumption
[TABLE]
Taking Dirac measures and we check that the above condition is equivalent to the fact that
[TABLE]
By symmetry arguments this implies that
[TABLE]
For the linear drift model discussed in (2.25) the above condition reads
[TABLE]
We also have with .
2.5 Comparisons with existing literature
The perturbation analysis developed in the article differs from the Otto differential calculus on introduced in [61] and further developed by Ambrosio and his co-authors [2, 3] and Otto and Villani in [62]. These sophisticated gradient flow techniques in Wasserstein metric spaces are based on optimal transport theory.
The central idea is to interpret as an infinite dimensional Riemannian manifold. In this context, the Benamou-Brenier formulation of the Wasserstein distance provides a natural way to define geodesics, gradients and Hessians w.r.t. the Wasserstein distance. The details of these gradient flow techniques are beyond the scope of the semigroup perturbation analysis considered herein.
This methodology is mainly used to quantify the entropy dissipation of Langevin-type nonlinear diffusions. Thus, it cannot be used to derive any Taylor expansion of the form (1.4) nor to analyze the stability properties of more general classes of McKean-Vlasov diffusions.
Besides some interesting contact points, the methodology developed in the present article doesn’t rely on the more recent differential calculus on developed by P.L. Lions and his co-authors in the seminal works on mean field game theory [14, 43]. In this context, the first order Lions differential of a smooth function from into is defined as the conventional derivative of lifted real valued function acting on the Hilbert space of square integrable random variables. In this interpretation, for a given test function, say the gradient of the first order differential in (1.4) can be seen as the Lions derivative of the lifted scalar function , for some random variable with distribution .
In the recent book [15], to distinguish these two notions, the authors called the random variable the linear functional derivative. For a more thorough discussion on the origins and the recent developments in mean field game theory, we refer to the book [15] as well as the more recent articles [13, 19, 23] and the references therein.
To the best of our knowledge, most of the literature on Lions’ derivatives is concerned with existence theorems without a refined analysis of the exponential decays of these differentials w.r.t. the time parameter. Last but not least, from the practical point of view all differential estimates we found in the literature are rather quite deceiving since after carefully checking, they grow exponentially fast with respect to the time horizon (cf. for instance [13, 19, 20, 23]).
Taylor expansions of the form (1.4) have already been discussed in the book [26] for discrete time nonlinear measure valued semigroups (cf. for instance chapters 3 and 10). We also refer to the more recent article [4] in the context of continuous time Feynman-Kac semigroups. In this context, we emphasize that the semigroup is explicitly given by a normalization of a linear semigroup of positive operators. Thus, a fairly simple Taylor expansion yields the second order formula (1.4). In contrast with Feynman-Kac models, McKean-Vlasov semigroups don’t have any explicit form nor an analytical description. As a result, none of above methodologies cannot be used to analyze nonlinear diffusions.
The second order perturbation analysis discussed in this article has been used with success in [27, 28, 30] to analyze the stability properties of Feynman-Kac type particle models, as well as the fluctuations and the exponential concentration of this class of interacting jump processes; see also [34, 37] for general classes of discrete generation mean field particle systems, a well as chapter 7 in [25] and [4, 31] for continuous time models.
These second order perturbation techniques have also been extended in the seminal book by V.N. Kolokoltsov [52] to general classes of nonlinear Markov processes and kinetic equations. Chapter 8 in [52] is dedicated to the analysis of the first and the second order derivatives of nonlinear semigroups with respect to initial data. The use of the first and the second order derivatives in the analysis of central limit theorems and propagation of chaos properties respectively is developed in Chapters 9 and Chapter 10 in [52]. We underline that these results are obtained for diffusion processes as well as for jump-type processes and their combinations, see also [53, 54].
Nevertheless none of these studies apply to derive non asymptotic Taylor expansions (2.14) and (2.20) with exponential decay-type remainder estimates for McKean-Vlasov diffusions nor to estimate the stability properties of the associated semigroups. In addition, to the best of our knowledge the stochastic perturbation theorem 2.7 is the first result of this type for mean field type interacting diffusions.
Last but not least, the idea of considering the flow of empirical measures of a mean field particle model as a stochastic perturbation of the limiting flow certainly goes back to the work by Dawson [24], itself based on the martingale approach developed by Papanicolaou, Stroock and Varadhan in [63], published in the end of the 1970’s. These two works are mainly centered on fluctuation type limit theorems. They don’t discuss any Taylor expansion on the limiting semigroup nor any question related to the stability properties of the underlying processes.
3 Some preliminary results
The first part of this section provides a review of tensor product theory and Fréchet differential on Hilbert spaces. Section 3.1 is concerned with conventional tensor products and Fréchet derivatives. Section 3.2 provides a short introduction to tensor integral operators.
In the second part of this section we review some basic tools of the theory of stochastic variational equations, including some differential properties of Markov semigroups. Section 3.3 is dedicated to variational equations. Section 3.5 discusses Bismut-Elworthy-Li extension formulae. We also provide some exponential inequalities for the gradient and the Hessian operators on bounded measurable functions.
The differential operator arising in the Taylor expansions (1.4) are defined in terms of tensor integral operators that depend on the gradient of the drift function of the nonlinear diffusion. These integro-differential operators are described in section 3.6. The last section, section 3.7 provides some differential formulae as well as some exponential decays estimates of the norm of these operators w.r.t. the time horizon.
3.1 Fréchet differential
We let stands for the set of multiple indexes over some finite set . Notice that . We denote by the space of -tensor with real entries . Given a -tensor and a -tensor we denote by the -tensor defined by
[TABLE]
For a given -tensor and a given tensor , the product and the transposition are the and tensors with entries
[TABLE]
We equip with the Frobenius inner product
[TABLE]
Identifying -tensors with column vectors the above quantities coincide with the conventional Euclidian inner product and norm on the product space . When we simplify notation and we set instead of . For any tensors and with appropriate dimensions, using Cauchy-Schwartz inequality we check that
[TABLE]
Let be the Hilbert space of -valued random variables defined on some probability space , equipped with the inner product
[TABLE]
induced by the inner product on . We denote by the entry-wise expected value of a -tensor.
When and the space coincides with be the Hilbert space of square integrable -valued and -measurable random variables.
We denote by
[TABLE]
the non decreasing sequence of Hilbert spaces associated with some increasing filtration .
In Landau notation, we recall that a function
[TABLE]
is said to be Fréchet differentiable at if there exists a continuous map
[TABLE]
such that
[TABLE]
3.2 Tensor integral operators
Let be the set of bounded measurable functions from a measurable space into some tensor space . Signed measures on act on bounded measurable functions from into . We extend these integral operators to tensor valued functions by setting for any
[TABLE]
Let and be some pair of measurable spaces. A -tensor integral operator
[TABLE]
is defined for and by the tensor valued and measurable function with entries given and by the integral formula
[TABLE]
for some collection of integral operators from into . We also consider the operator norm
[TABLE]
The tensor product of a couple of -tensor integral operators
[TABLE]
is a -tensor integral operator
[TABLE]
with the product spaces
[TABLE]
The entries of are given for any and any pair of multi-indices , by the integral formula
[TABLE]
with the tensor product measures defined for any and any by
[TABLE]
3.3 Variational equations
The gradient and the Hessian of a multivariate smooth function is defined by the and tensors and with entries given for any and by the formula
[TABLE]
We consider the tensor valued functions and defined for any by
[TABLE]
with the and -tensor valued functions
[TABLE]
In the above display, stands for the partial derivative of the scalar function w.r.t. the coordinate , with the drift function from into introduced in section 1.1, In the same vein, and stands for the second and third partial derivatives of w.r.t. the coordinates , and with .
For any and we also consider the tensor functions
[TABLE]
Recalling that has continuous and uniformly bounded derivatives up to the third order, the stochastic flow is a twice differentiable function of the initial state . In addition, when holds the gradient of the diffusion flow satifies the -matrix valued stochastic diffusion equation
[TABLE]
The above estimate is a direct consequence of well known log-norm estimates for exponential semigroups, see for instance [22] as well as section 1.3 in the recent article [11].
We have the stochastic tensor evolution equation
[TABLE]
This implies that
[TABLE]
from which we check that
[TABLE]
Using (3.2), this yields the estimate
[TABLE]
More generally, using the multivariate version of the de Faà di Bruno derivation formula [21] (see also formula (5.14) in the appendix), for any we also check the uniform estimate
[TABLE]
A detailed proof is provided in the appendix, on page Proof of (3.4).
3.4 Differential of Markov semigroups
We have the commutation formula
[TABLE]
with the -tensor integral operator defined for any and any differentiable function on by the formula
[TABLE]
The tensor product of is also given by the -tensor integral operator
[TABLE]
In the above display, stands for an independent copy of and stands for the matrix valued function defined in (1.14). We also have the commutation formula
[TABLE]
In the same vein, we have the second order differential formula
[TABLE]
with the and -tensor integral operators
[TABLE]
Iterating the above procedure, the -th differential of at any order takes the form
[TABLE]
for some integral operators . For instance, we have the third order differential formula
[TABLE]
with the and -tensor integral operators
[TABLE]
with the \mathbin{\mathchoice{\vbox{ \halign{#\cr\displaystyle{}{\frown}\displaystyle\otimes\cr} }}{\vbox{ \halign{#\cr\textstyle{}{\frown}\textstyle\otimes\cr} }}{\vbox{ \halign{#\cr\scriptstyle{}{\frown}\scriptstyle\otimes\cr} }}{\vbox{ \halign{#\cr\scriptscriptstyle{}{\frown}\scriptscriptstyle\otimes\cr} }}}-tensor product of type given for any and by
[TABLE]
The above formulae remains valid for any column vector multivariate function . An explicit description of the integral operators for any can be obtained using multivariate derivations and combinatorial manipulations, see for instance the multivariate version of the de Faà di Bruno derivation formulae (5.14) and (5.15) in the appendix. Following the proof of (3.4) we also check the uniform estimates
[TABLE]
Using the moment estimates (1.15) for any , , and any , we also check the rather crude estimate
[TABLE]
For instance, using the de Faà di Bruno derivation formula (5.15) for any function such that and for any we check that
[TABLE]
The estimates (1.15) implies that
[TABLE]
from which we conclude that
[TABLE]
3.5 Bismut-Elworthy-Li extension formulae
We have the Bismut-Elworthy-Li formula
[TABLE]
The above formula is valid for any function of the following form
[TABLE]
for some non decreasing differentiable function on with bounded continuous derivatives and such that
[TABLE]
In the same vein, for any we have
[TABLE]
with the stochastic process
[TABLE]
Besides the fact that is a nonlinear diffusion, the proof of the above formula follows the same proof as the one provided in [6, 12, 39, 57, 66] in the context of diffusions on differentiable manifolds. For the convenience of the reader, a detailed proof is provided in the appendix on page Proof of (3.22) and (3.24). Using (3.22), for any s.t. we check that
[TABLE]
Let with be some differentiable function on null on and such that and , for instance we can choose
[TABLE]
In this situation, we find the rather crude uniform estimate
[TABLE]
In the same vein, combining (3.24) with the estimate (3.3) for any and we also check the rather crude uniform estimate
[TABLE]
Choosing in the above display we readily check that
[TABLE]
3.6 Integro-differential operators
Let be the matrix-valued function defined for any , and any by the formulae
[TABLE]
For instance, for the linear model discussed in (2.24) we have
[TABLE]
We also consider the collection Weyl chambers defined for any by
[TABLE]
We consider the space-time Weyl chambers
[TABLE]
The coordinates of a generic point for some are denoted by
[TABLE]
We also use the convention and . We consider the measures on given on every set and any by
[TABLE]
with the tensor product measures
[TABLE]
Definition 3.1**.**
Let be the function defined for any , , and any and by the formula
[TABLE]
In the above display the product of matrices is understood as a directed product from to . For instance, for the linear model discussed in (2.24) we have
[TABLE]
For any , and any and we also set
[TABLE]
Definition 3.2**.**
For any and we let be the operator defined on differentiable functions on by
[TABLE]
with the -tensor integral operator defined by the integral formula
[TABLE]
Recall that is differentiable at any order with uniformly bounded derivatives. Thus, using the estimates (1.15) and (3.4), for any , we have
[TABLE]
Definition 3.3**.**
Let be the function defined for any and by the formula
[TABLE]
In this notation, we readily check the following proposition.
Proposition 3.4**.**
The -tensor integral operator can be rewritten as follows:
[TABLE]
For instance, for the linear model discussed in (2.24) the function defined in (3.33) reduces to
[TABLE]
We check this claim expanding in (3.33) the exponential series coming from the integration over the set . A detailed proof of the above formula is provided in the appendix on page Proof of (3.34).
3.7 Some differential formulae
The matrix defined in (3.27) can alternatively be written as follows
[TABLE]
We also have the and -tensor formulae
[TABLE]
For any with and for any we have the -tensor formulae
[TABLE]
We consider the -tensor valued function
[TABLE]
and we use the convention
[TABLE]
For instance, for the linear model discussed in (2.24) and (3.34) the above objects reduce to
[TABLE]
In this notation, we have the following proposition.
Proposition 3.5**.**
For any the -th differential of the operator is given by the formula
[TABLE]
with the -tensor integral operator given by
[TABLE]
In addition, when condition is satisfied for any we have the exponential estimates
[TABLE]
Proof.
The proof of the first assertion follows from (3.33). More precisely, using (3.33) we have
[TABLE]
On the other hand, by proposition 3.4 we also have
[TABLE]
This ends the proof of the first assertion. When condition is satisfied, for any and we have
[TABLE]
Using (3.4) we also check the uniform estimate
[TABLE]
The end of the proof is now a consequence of (3.2).
Proposition 3.6**.**
For any any bounded function on and for any function of the form (3.23) we have the Bismut-Elworthy-Li formula
[TABLE]
In the above display, stands for the stochastic process defined in (3.22). In addition, when condition is satisfied we have the exponential estimates
[TABLE]
Proof.
The proof of the first assertion is a direct application of the Bismut-Elworthy-Li formula (3.22). More precisely, using (3.22) we have
[TABLE]
The formula (3.40) is now a direct consequence of (3.36).
We check (3.41) combining (3.25) with (3.39). This ends the proof of the proposition.
When we drop the upper index and we write instead of .
The operators discussed above are indexed by a pair of measures . To simplify notation, when we suppress one of the indices and we write and instead of and .
4 Tangent processes
The tangent process associated with the diffusion flow introduced in (1.6) is given for any by the evolution equation
[TABLE]
In the above display, stands for the Fréchet differential of the drift function defined for any by
[TABLE]
where stands for an independent copy of .
4.1 Spectral estimate
This section is mainly concerned with the proof of theorem 2.1.
For any pair of random variables we have the duality formula
[TABLE]
with the dual operator defined by the formula
[TABLE]
In the above display, stands for an independent copy of . The symmetric part of is given by the formula
[TABLE]
We are now in position to prove theorem 2.1.
The first assertion is a direct consequence of the evolution equation
[TABLE]
Whenever is met we have for some . In this situation, the r.h.s. estimate in (2.2) is a direct consequence of (2.1). Given an independent copy of we have
[TABLE]
This yields the log-norm estimate
[TABLE]
The proof of theorem 2.1 is now completed.
4.2 Dyson-Phillips expansions
In the further development of this section we shall denote by
[TABLE]
a collection of independent copies of the stochastic flows and some given . To simplify notation, we also set
[TABLE]
We are now in position to state and prove the main result of this section.
Theorem 4.1**.**
The tangent process is given for any and any with distribution by the Dyson-Phillips series
[TABLE]
with the boundary conventions
[TABLE]
Proof.
For any and we have
[TABLE]
and
[TABLE]
In addition, for any and we have
[TABLE]
Combining the above with (4.1) we check that
[TABLE]
In the above display, stands for the gradient of the random function
[TABLE]
Equivalently, we have
[TABLE]
and therefore
[TABLE]
Now, the end of the proof of (4.4) follows a simple induction, thus it is skipped.
Corollary 4.2**.**
For any and for any with distribution we have
[TABLE]
with the boundary conditions
[TABLE]
4.3 Gradient semigroup analysis
This section is concerned with a gradient semigroup description of the dual of the tangent process.
Definition 4.3**.**
For any and we let be the operator defined on differentiable functions on by
[TABLE]
In the above display, stands for the operator defined in (3.31).
Rewritten in terms of expectation operators we have
[TABLE]
Recall that is differentiable at any order with uniformly bounded derivatives. Thus, arguing as in the proof of (3.21) and (3.32) for any , we have
[TABLE]
In the same vein, we check that
[TABLE]
The proof of the above estimate is rather technical, thus it is housed in the appendix on page Proof of (4.7).
Remark 4.4**.**
Using the Bismut-Elworthy-Li formula (3.40), we extend the operators with to non necessarily differentiable and bounded functions.
We also extend the operator to tensor functions by considering the tensor function with entries
[TABLE]
In this situation, the function introduced in (3.33) takes the form
[TABLE]
Let be the collection of integro-differential operators indexed by defined by
[TABLE]
We also set
[TABLE]
In this notation, we have the first order expansion
[TABLE]
Theorem 4.5**.**
For any and any the operator coincides with the evolution semigroup of the integro-differential operator ; that is, we have the forward evolution equation
[TABLE]
In addition, for any we have the backward evolution equation
[TABLE]
Proof.
The proof of the forward equation (4.10) is a direct consequence of the forward evolution equation
[TABLE]
associated with the Markov semigroup , thus it is skipped. The semigroup property (2.9) yields
[TABLE]
Combining the above with the forward equation (4.10) we check that
[TABLE]
This implies that
[TABLE]
from which we conclude that
[TABLE]
This yields the backward evolution equation (4.11). This ends the proof of the theorem.
Next proposition is a direct consequence of (4.5) combined with the formulae (3.5) and (3.36).
Proposition 4.6**.**
We have the commutation formula
[TABLE]
with the -tensor integral operator given by the column vector function
[TABLE]
In addition, when condition is satisfied we have
[TABLE]
Remark 4.7**.**
Following remark 4.4, using the Bismut-Elworthy-Li formula (3.40), we extend the gradient operators with to measurable and bounded functions. The exponential estimate stated in (3.41) are a direct consequence of the estimates presented in (3.41).
By (4.8) the commutation formula (4.12) is also satisfied for multivariate column functions . In this situation is a -matrix valued function.
The proof of theorem 2.2 is now a consequence of the estimate (4.14) and the fact that
[TABLE]
More precisely, using (4.9) the above formula implies that
[TABLE]
The operators discussed above are indexed by a pair of measures . To simplify notation, when we suppress one of the parameter and we write instead of .
Theorem 4.8**.**
For any , any function and any with distribution we have the gradient formula
[TABLE]
Proof.
Given a smooth function on we have
[TABLE]
Replacing by in (4.4) we check that
[TABLE]
This ends the proof of the theorem
5 Taylor expansions
This section is mainly concerned with the proof of the first and second order Taylor expansions stated in theorem 2.3 and theorem 2.4 . Section 5.1 presents some preliminary differential formulae used in the proof of the theorems.
5.1 Some differential formulae
The commutation formula (4.12) takes the form
[TABLE]
Combining (4.5) with proposition 3.5 and the second order formula (3.7) we also have
[TABLE]
In summary, we have the first and second order differential formulae
[TABLE]
Similar formulae for and can easily be found. In the same vein, using (3.9) we check the third order differential formula
[TABLE]
In addition, when condition is satisfied we have the exponential estimates
[TABLE]
Definition 5.1**.**
We let be the operator defined for any differentiable function on by
[TABLE]
with the -tensor integral operator defined by the formula
[TABLE]
Using (4.6) and (4.13) for any and we check that
[TABLE]
We also have the differential formula
[TABLE]
with the matrix valued functions
[TABLE]
When condition is satisfied we also have the exponential estimates
[TABLE]
In addition, using the Bismut-Elworthy-Li extension formulae and the estimates (2.7) and (2.8), or any bounded measurable function on we check that
[TABLE]
5.2 A first order expansion
This section is mainly concerned with the proof of theorem 2.3. The next technical lemma is pivotal.
Lemma 5.2**.**
For any for any we have the second order expansion
[TABLE]
Proof.
Combining (4.9) with the backward evolution equation (4.11) we check that
[TABLE]
On the other hand, we have
[TABLE]
Integrating from to we obtain the formula
[TABLE]
The end of the lemma is now completed.
Combining the above lemma with (4.7) and (5.5) we check (2.11) with the operator defined for any and by
[TABLE]
Remark 5.3**.**
The second order term in (2.11) can alternatively be expressed in terms of the Hessian of the semigroup ; that is, we have that
[TABLE]
with the interpolating path
[TABLE]
In the above display, stands for an independent copy of a pair of random variables with distribution . Also observe that
[TABLE]
with the centered second order operator
[TABLE]
In the above display, stands for the interpolating path
[TABLE]
Proposition 5.4**.**
We have commutation formula
[TABLE]
In addition, we have the estimate
[TABLE]
Proof.
The proof of the first assertion is a consequence of the commutation formula (4.12). Letting we have
[TABLE]
The proof of (5.12) now follows the same arguments as the ones we used in the proof of (4.14), thus it is skipped. This ends the proof of the proposition.
Combining (5.6) with the commutation formula (5.11), for any twice differentiable function and any and we check that
[TABLE]
with the operators discussed in (5.6). The proof of (2.12) is a direct consequence of (5.7) and (5.12). The proof of theorem 2.3 is now completed.
5.3 Second order analysis
This short section is mainly concerned with the proof of the first part of theorem 2.4.
Lemma 5.5**.**
For any and and we have the tensor product formula
[TABLE]
for some third order linear operator such that
[TABLE]
The proof of the above lemma is rather technical, thus it is housed in the appendix, on page Proof of lemma 5.5.
Combining the above lemma with (5.8) we readily check the second order decomposition (2.14) with a the remainder linear operator such that
[TABLE]
This ends the proof of the first part of theorem 2.4. The proof of the second part of the theorem is provided in the appendix, on page Proof of the estimate (2.15).
Acknowledgements
The authors are supported by the ANR Quamprocs on quantitative analysis of metastable processes. P. Del Moral is also supported in part from the Chair Stress Test, RISK Management and Financial Steering, led by the French Ecole polytechnique and its Foundation and sponsored by BNP Paribas.
Appendix
Proof of (2.22)
It is easy to check that this first assertion is true for any collection of generators , thus we skip the details. The proof of the second assertion is a also a direct consequence of a more general result which is valid for any collection of generators and non necessarily symmetric functions.
For any and we set
[TABLE]
We extend to functions on by setting
[TABLE]
For any function on we have
[TABLE]
with
[TABLE]
This implies that
[TABLE]
Recalling that
[TABLE]
we conclude that
[TABLE]
with the operator
[TABLE]
Observe that
[TABLE]
This yields the formula
[TABLE]
from which we conclude that
[TABLE]
with the function defined for any by
[TABLE]
The above formula readily implies (2.22) as soon as is the collection of generators associated with the stochastic flow defined in (1.1). This ends the proof of (2.22).
Proof of (3.4)
For any given , we denote by the set of partitions of the set with blocks of size , with . We also let the set of partitions of the set and the number of blocks in a given partition , and the subset of partitions s.t. .
Let be the set of multiple indexes . For any given and any subset we set
[TABLE]
For any and any multiple index we write instead of the -th partial derivatives w.r.t. the coordinates .
Let and be a couple of smooth function from into itself. In this notation for any and we have the multivariate Faà di Bruno derivation formula
[TABLE]
with the -gradient tensor
[TABLE]
We check the above formula by induction w.r.t. the parameter . In a more compact we have checked the following lemma.
Lemma 5.6**.**
For any we have the Faà di Bruno derivation formula
[TABLE]
Whenever is a random function we have
[TABLE]
with the collection of integral operators
[TABLE]
Using the above lemma we also check the stochastic tensor evolution equation
[TABLE]
with
[TABLE]
In a more compact form we have
[TABLE]
This implies that
[TABLE]
Taking the trace in the above display, we check that
[TABLE]
This yields the rather crude estimate
[TABLE]
from which we check that
[TABLE]
The summation in the above display is taken over all indices such that and and . Assume that (3.4) has been checked up to rank . In this case, we have
[TABLE]
This ends the proof of (3.4).
Proof of (3.22) and (3.24)
We recall the backward formula
[TABLE]
A detailed proof of the above formula based on backward stochastic flows can be found in theorem 3.1 in the article [5]. This implies that
[TABLE]
from which we check that
[TABLE]
This yields the formula
[TABLE]
We conclude that
[TABLE]
This ends the proof of (3.22). For any applying (3.22) to the function we have
[TABLE]
This implies that
[TABLE]
Applying (3.22) to the first term we check that
[TABLE]
We conclude that
[TABLE]
This ends the proof of (3.24).
Proof of (3.34)
We have
[TABLE]
Recalling that
[TABLE]
and using the rather well known exponential formulae
[TABLE]
we check that
[TABLE]
from which we find that
[TABLE]
This ends the proof of (3.34).
Proof of (4.7)
We have the tensor product formula
[TABLE]
We also have
[TABLE]
Recall that is differentiable at any order with uniformly bounded derivatives. Thus all differentials of the above function w.r.t. the coordinate have uniformly bounded derivatives. On the other hand, the mapping has at most linear growth. Thus, using the estimates (1.15) and (3.4), for any we check that
[TABLE]
In the same vein, we have the tensor product formula
[TABLE]
with
[TABLE]
Arguing as above and using the estimates (1.15) and (3.4) for any we check that
[TABLE]
Proof of lemma 5.5
Using the decomposition
[TABLE]
which is valid for any and any with , for any function
[TABLE]
we check that
[TABLE]
with the function
[TABLE]
In the above display, stands for the tensor product measures
[TABLE]
We also have the tensor product formula
[TABLE]
This yields the decomposition
[TABLE]
with the integral operator
[TABLE]
Arguing as in the proof of (3.21) and (4.6) we check that
[TABLE]
In the same vein, we have
[TABLE]
with
[TABLE]
and
[TABLE]
This yields the formula
[TABLE]
with the integral operator
[TABLE]
Arguing as above, we check that
[TABLE]
Combining the above decompositions we find that
[TABLE]
For any and we have
[TABLE]
We conclude that
[TABLE]
with the operator
[TABLE]
In the above display, stands for the integral operator operator
[TABLE]
We also check that
[TABLE]
This ends the proof of the lemma.
Proof of the estimate (2.15)
For any we set . In this notation, for any matrix valued function we have the tensor product formula
[TABLE]
with the matrix valued functions and given for any and by the formula
[TABLE]
Using (3.7) we have
[TABLE]
from which we check the formula
[TABLE]
By symmetry arguments, we also have
[TABLE]
Using (3.20) for any differentiable matrix valued function such that we have the uniform estimate
[TABLE]
In the same vein, we have
[TABLE]
Using the gradient and the Hessian estimates (3.2) and (3.3) for any we check that
[TABLE]
Combining the above estimates with (3.38) we check that
[TABLE]
In addition, for any we have
[TABLE]
We conclude that
[TABLE]
Arguing as above, for any we have
[TABLE]
In addition, for we have
[TABLE]
This implies that
[TABLE]
On the other hand, we have the decomposition
[TABLE]
with the matrix valued function
[TABLE]
Using the estimates (5.17) and (5.18), for any we check that
[TABLE]
Using the decomposition (5.16) we also check that
[TABLE]
with the matrix valued function
[TABLE]
Using (5.19) we find the uniform estimates
[TABLE]
On the other hand, using (4.5) and (2.5) we have
[TABLE]
Thus, recalling that
[TABLE]
we check that
[TABLE]
This implies that
[TABLE]
with the tensor integral operator
[TABLE]
On the other hand, using (5.10)
[TABLE]
with the interpolating path
[TABLE]
and
[TABLE]
In the above display, stands for independent copies of a pair of random variables with distribution .
Using the commutation formula (3.5) we check that
[TABLE]
Using (5.20) for any differentiable matrix valued function such that and for any we check that
[TABLE]
On the other hand, we have
[TABLE]
with the -tensor product
[TABLE]
Using (5.3) we check that
[TABLE]
We conclude that for any function s.t.
[TABLE]
The last assertion comes from the formula
[TABLE]
Proof of theorem 2.6
We extend the operators introduced in theorem 2.4 to tensor functions by considering the tensor function with entries
[TABLE]
By theorem 2.4 we have
[TABLE]
with the functions
[TABLE]
We also write instead of . Using (2.12) and (4.14) we check that
[TABLE]
as well as
[TABLE]
Using (2.15) we also have
[TABLE]
On the other hand, we have the second order expansions
[TABLE]
In the same vein, we have
[TABLE]
This implies that
[TABLE]
with the second order remainder term
[TABLE]
and the third order remainder term
[TABLE]
Combining (3.4) with (2.4) and (2.17) for any we check the uniform estimate
[TABLE]
We check (2.19) using (5.23) and (5.22).
Using (5.3) we also have the estimate
[TABLE]
Observe that
[TABLE]
with the matrix valued functions
[TABLE]
We also write instead of . Observe that
[TABLE]
with
[TABLE]
Observe that
[TABLE]
This yields the second order decompositionn (2.20) with the remainder term
[TABLE]
The end of the proof of is now a consequence of the estimates (5.24), (5.25) and (5.26). The proof of the theorem is completed.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] V.M. Alekseev, An estimate for the perturbations of the solutions of ordinary differential equations. II, Vestnik Moskov. Univ. Ser. I Mat. Mech, vol. 3 , pp. 3–10 (1961).
- 2[2] L. Ambrosio, N. Gigli, Construction of parallel transport in the Wasserstein space, Methods and Applications of Analysis, no. 15 , vol.1, pp. 1–30 (2008).
- 3[3] L. Ambrosio, N. Gigli, and G. Savare, Gradient flows in metric spaces and in spaces of probability measures, Birkäuser, 2005.
- 4[4] M. Arnaudon, P. Del Moral. A duality formula and a particle Gibbs sampler for continuous time Feynman-Kac measures on path spaces. Ar Xiv:1805.05044 (2018).
- 5[5] M. Arnaudon, P. Del Moral. A variational approach to nonlinear and interacting diffusions. Ar Xiv:1812.04269 (2018) . Stochastic Analysis and Applications DOI: 10.1080/07362994.2019.1609985 (2019).
- 6[6] M. Arnaudon, H. Plank, A. Thalmaier. A Bismut type formula for the Hessian of heat semigroups. C. R. Math. Acad. Sci. Paris, vol. 336, no. 8, pp. 661--666 (2003).
- 7[7] D. Benedetto, E. Caglioti, M. Pulvirenti. A kinetic equation for granular media. RAIRO Modèl. Math. Anal. Numér. vol. 31, no. 5, pp. 615--641 (1997).
- 8[8] D. Benedetto, E. Caglioti, E., Carrillo, M. Pulvirenti. A non-Maxwellian steady distribution for one-dimensional granular media. J. Statist. Phys.vol. 91, pp. 979--990 (1998).
