A second order analysis of McKean-Vlasov semigroups

M Arnaudon (IMB); P del Moral (CMAP; CQFD)

arXiv:1906.05140·math.PR·January 7, 2020

A second order analysis of McKean-Vlasov semigroups

M Arnaudon (IMB), P del Moral (CMAP, CQFD)

PDF

Open Access

TL;DR

This paper develops a second order differential calculus for McKean-Vlasov semigroups, enabling detailed analysis of their regularity, stability, and propagation of chaos properties in nonlinear diffusion processes.

Contribution

It introduces a novel second order calculus framework and explicit expansions for McKean-Vlasov semigroups, advancing understanding of their stability and chaos propagation.

Findings

01

Provides second order Taylor expansions with remainders

02

Derives Bismut-Elworthy-Li formulae for gradients and Hessians

03

Establishes exponential decay inequalities for semigroup stability

Abstract

We propose a second order differential calculus to analyze the regularity and the stability properties of the distribution semigroup associated with McKean-Vlasov diffusions. This methodology provides second order Taylor type expansions with remainder for both the evolution semigroup as well as the stochastic flow associated with this class of nonlinear diffusions. Bismut-Elworthy-Li formulae for the gradient and the Hessian of the integro-differential operators associated with these expansions are also presented. The article also provides explicit Dyson-Phillips expansions and a refined analysis of the norm of these integro-differential operators. Under some natural and easily verifiable regularity conditions we derive a series of exponential decays inequalities with respect to the time horizon. We illustrate the impact of these results with a second order extension of the…

Equations758

d X_{s, t}^{μ} (x) = b_{t} (X_{s, t}^{μ} (x), ϕ_{s, t} (μ)) d t + d W_{t} \mbox with b_{t} (x, μ) := \int μ (d y) b_{t} (x, y)

d X_{s, t}^{μ} (x) = b_{t} (X_{s, t}^{μ} (x), ϕ_{s, t} (μ)) d t + d W_{t} \mbox with b_{t} (x, μ) := \int μ (d y) b_{t} (x, y)

ϕ_{s, t} (μ) (d y) = μ P_{s, t}^{μ} (d y) := \int μ (d x) P_{s, t}^{μ} (x, d y) \mbox with P_{s, t}^{μ} (x, d y) := P (X_{s, t}^{μ} (x) \in d y)

ϕ_{s, t} (μ) (d y) = μ P_{s, t}^{μ} (d y) := \int μ (d x) P_{s, t}^{μ} (x, d y) \mbox with P_{s, t}^{μ} (x, d y) := P (X_{s, t}^{μ} (x) \in d y)

∥ b^{[i]} ∥_{2} := t \geq 0 sup (x_{1}, x_{2}) \in R^{2 d} sup ∥ b_{t}^{[i]} (x_{1}, x_{2}) ∥_{2} < \infty \mbox with b_{t}^{[i]} (x_{1}, x_{2}) := \nabla_{x_{i}} b_{t} (x_{1}, x_{2})

∥ b^{[i]} ∥_{2} := t \geq 0 sup (x_{1}, x_{2}) \in R^{2 d} sup ∥ b_{t}^{[i]} (x_{1}, x_{2}) ∥_{2} < \infty \mbox with b_{t}^{[i]} (x_{1}, x_{2}) := \nabla_{x_{i}} b_{t} (x_{1}, x_{2})

d ξ_{t}^{i} = b_{t} (ξ_{t}^{i}, m (ξ_{t})) d t + d W_{t}^{i} \mbox with 1 \leq i \leq N \mbox and m (ξ_{t}) := \frac{1}{N} 1 \leq j \leq N \sum δ_{ξ_{t}^{i}}

d ξ_{t}^{i} = b_{t} (ξ_{t}^{i}, m (ξ_{t})) d t + d W_{t}^{i} \mbox with 1 \leq i \leq N \mbox and m (ξ_{t}) := \frac{1}{N} 1 \leq j \leq N \sum δ_{ξ_{t}^{i}}

ϕ_{s, t} (μ_{1}) ≃ ϕ_{s, t} (μ_{0}) + (μ_{1} - μ_{0}) D_{μ_{0}} ϕ_{s, t} + \frac{1}{2} (μ_{1} - μ_{0})^{\otimes 2} D_{μ_{0}}^{2} ϕ_{s, t}

ϕ_{s, t} (μ_{1}) ≃ ϕ_{s, t} (μ_{0}) + (μ_{1} - μ_{0}) D_{μ_{0}} ϕ_{s, t} + \frac{1}{2} (μ_{1} - μ_{0})^{\otimes 2} D_{μ_{0}}^{2} ϕ_{s, t}

\begin{array}[]{l}\displaystyle X^{\mu_{1}}_{s,t}(x)-X^{\mu_{0}}_{s,t}(x)\simeq\int(\mu_{1}-\mu_{0})(dy)~{}D_{\mu_{0}}X^{\mu_{0}}_{s,t}(x,y)+\frac{1}{2}~{}~{}\int~{}(\mu_{1}-\mu_{0})^{\otimes 2}(dz)~{}D^{2}_{\mu_{0}}X^{\mu_{0}}_{s,t}(x,z)\end{array}

\begin{array}[]{l}\displaystyle X^{\mu_{1}}_{s,t}(x)-X^{\mu_{0}}_{s,t}(x)\simeq\int(\mu_{1}-\mu_{0})(dy)~{}D_{\mu_{0}}X^{\mu_{0}}_{s,t}(x,y)+\frac{1}{2}~{}~{}\int~{}(\mu_{1}-\mu_{0})^{\otimes 2}(dz)~{}D^{2}_{\mu_{0}}X^{\mu_{0}}_{s,t}(x,z)\end{array}

d ψ_{s, t} (Y) := B_{t} (ψ_{s, t} (Y)) d t + d W_{t}

d ψ_{s, t} (Y) := B_{t} (ψ_{s, t} (Y)) d t + d W_{t}

B_{t} (X) := E (b_{t} (X, \overline{X}) ∣ X)

B_{t} (X) := E (b_{t} (X, \overline{X}) ∣ X)

∥ ψ_{s, t} (Y_{1}) - ψ_{s, t} (Y_{0}) ∥_{H_{t} (R^{d})} \leq e^{- λ (t - s)} ∥ Y_{1} - Y_{0} ∥_{H_{t} (R^{d})}

∥ ψ_{s, t} (Y_{1}) - ψ_{s, t} (Y_{0}) ∥_{H_{t} (R^{d})} \leq e^{- λ (t - s)} ∥ Y_{1} - Y_{0} ∥_{H_{t} (R^{d})}

⟨ X_{1} - X_{0}, B_{t} (X_{1}) - B_{t} (X_{0}) ⟩_{H_{t} (R^{d})} \leq - 2 λ ∥ X_{1} - X_{0} ∥_{H_{t} (R^{d})}^{2}

⟨ X_{1} - X_{0}, B_{t} (X_{1}) - B_{t} (X_{0}) ⟩_{H_{t} (R^{d})} \leq - 2 λ ∥ X_{1} - X_{0} ∥_{H_{t} (R^{d})}^{2}

\partial ψ_{s, t} (Y)^{⋆} \cdot \nabla f (ψ_{s, t} (Y)) = \nabla D_{μ} ϕ_{s, t} (f) (Y)

\partial ψ_{s, t} (Y)^{⋆} \cdot \nabla f (ψ_{s, t} (Y)) = \nabla D_{μ} ϕ_{s, t} (f) (Y)

A_{t} (x_{1}, x_{2})_{sy m} \leq - λ_{0} I \mbox an d b_{t}^{[1]} (x_{1}, x_{2})_{sy m} \leq - λ_{1} I

A_{t} (x_{1}, x_{2})_{sy m} \leq - λ_{0} I \mbox an d b_{t}^{[1]} (x_{1}, x_{2})_{sy m} \leq - λ_{1} I

A_{t}(x_{1},x_{2}):=\left[\begin{array}[]{cc}b_{t}^{[1]}(x_{1},x_{2})&b_{t}^{[2]}(x_{2},x_{1})\\ b_{t}^{[2]}(x_{1},x_{2})&b_{t}^{[1]}(x_{2},x_{1})\end{array}\right]\quad\mbox{and we set}\quad\lambda_{1,2}:=\lambda_{1}-\|b^{[2]}\|_{2}

A_{t}(x_{1},x_{2}):=\left[\begin{array}[]{cc}b_{t}^{[1]}(x_{1},x_{2})&b_{t}^{[2]}(x_{2},x_{1})\\ b_{t}^{[2]}(x_{1},x_{2})&b_{t}^{[1]}(x_{2},x_{1})\end{array}\right]\quad\mbox{and we set}\quad\lambda_{1,2}:=\lambda_{1}-\|b^{[2]}\|_{2}

\lor_{k = 1, 2} ∣ ∣ ∣ D_{μ_{0}}^{k} ϕ_{s, t} ∣ ∣ ∣ ≃ e^{- λ (t - s)} \mbox andtherefore ∣ ∣ ∣ ϕ_{s, t} (μ_{1}) - ϕ_{s, t} (μ_{0}) ∣ ∣ ∣ ≃ e^{- λ (t - s)}

\lor_{k = 1, 2} ∣ ∣ ∣ D_{μ_{0}}^{k} ϕ_{s, t} ∣ ∣ ∣ ≃ e^{- λ (t - s)} \mbox andtherefore ∣ ∣ ∣ ϕ_{s, t} (μ_{1}) - ϕ_{s, t} (μ_{0}) ∣ ∣ ∣ ≃ e^{- λ (t - s)}

m_{t_{n}} - ϕ_{t_{0}, t_{n}} (m_{t_{0}}) = 1 \leq k \leq n \sum [ϕ_{t_{k}, t_{n}} (m_{t_{k}}) - ϕ_{t_{k}, t_{n}} (ϕ_{t_{k - 1}, t_{k}} (m_{t_{k - 1}}))] \mbox with m_{t_{k}} := m (ξ_{t_{k}})

m_{t_{n}} - ϕ_{t_{0}, t_{n}} (m_{t_{0}}) = 1 \leq k \leq n \sum [ϕ_{t_{k}, t_{n}} (m_{t_{k}}) - ϕ_{t_{k}, t_{n}} (ϕ_{t_{k - 1}, t_{k}} (m_{t_{k - 1}}))] \mbox with m_{t_{k}} := m (ξ_{t_{k}})

m_{t_{n}} - ϕ_{t_{0}, t_{n}} (m_{t_{0}}) ≃ \frac{1}{N} 1 \leq k \leq n \sum Δ M_{t_{k}} D_{m_{t_{k - 1}}} ϕ_{t_{k}, t_{n}} + \frac{1}{2 N} 1 \leq k \leq n \sum (Δ M_{t_{k}})^{\otimes 2} D_{m_{t_{k - 1}}}^{2} ϕ_{t_{k}, t}

m_{t_{n}} - ϕ_{t_{0}, t_{n}} (m_{t_{0}}) ≃ \frac{1}{N} 1 \leq k \leq n \sum Δ M_{t_{k}} D_{m_{t_{k - 1}}} ϕ_{t_{k}, t_{n}} + \frac{1}{2 N} 1 \leq k \leq n \sum (Δ M_{t_{k}})^{\otimes 2} D_{m_{t_{k - 1}}}^{2} ϕ_{t_{k}, t}

Δ M_{t_{k}} := N (m_{t_{k}} - \overline{m}_{t_{k}}) \mbox and \overline{m}_{t_{k}} := ϕ_{t_{k - 1}, t_{k}} (m_{t_{k - 1}}) ≃ m_{t_{k - 1}}

Δ M_{t_{k}} := N (m_{t_{k}} - \overline{m}_{t_{k}}) \mbox and \overline{m}_{t_{k}} := ϕ_{t_{k - 1}, t_{k}} (m_{t_{k - 1}}) ≃ m_{t_{k - 1}}

∣ ∣ ∣ D_{μ_{0}} ϕ_{s, t} ∣ ∣ ∣ ≃ e^{- λ (t - s)} ⟹ ∣ E [∥ m_{t} (f) - ϕ_{0, t} (m_{0}) (f) ∥^{n}]^{1/ n} ∣ \leq c_{n} / N

∣ ∣ ∣ D_{μ_{0}} ϕ_{s, t} ∣ ∣ ∣ ≃ e^{- λ (t - s)} ⟹ ∣ E [∥ m_{t} (f) - ϕ_{0, t} (m_{0}) (f) ∥^{n}]^{1/ n} ∣ \leq c_{n} / N

∣ ∣ ∣ D_{μ_{0}}^{2} ϕ_{s, t} ∣ ∣ ∣ ≃ e^{- λ (t - s)} ⟹ ∣ E [m_{t} (f) - ϕ_{0, t} (m_{0}) (f)] ∣ \leq c / N

∣ ∣ ∣ D_{μ_{0}}^{2} ϕ_{s, t} ∣ ∣ ∣ ≃ e^{- λ (t - s)} ⟹ ∣ E [m_{t} (f) - ϕ_{0, t} (m_{0}) (f)] ∣ \leq c / N

((\nabla \otimes \nabla) g)_{i, j}

((\nabla \otimes \nabla) g)_{i, j}

0 \leq k \leq n sup ∥ \nabla^{k} f (x) ∥ \leq c w_{m} (x) \mbox withtheweightfunction w_{m} (x) = (1 + ∥ x ∥)^{m} \mbox forsome m \geq 0.

0 \leq k \leq n sup ∥ \nabla^{k} f (x) ∥ \leq c w_{m} (x) \mbox withtheweightfunction w_{m} (x) = (1 + ∥ x ∥)^{m} \mbox forsome m \geq 0.

∥ f ∥_{C_{m}^{n} (R^{d})} := 0 \leq k \leq n \sum ∥ \nabla^{k} f / w_{m} ∥_{\infty} \mbox with ∥ \nabla^{k} f / w_{m} ∥_{\infty} = x \in R^{d} sup ∥ \nabla^{k} f (x) / w_{m} (x) ∥

∥ f ∥_{C_{m}^{n} (R^{d})} := 0 \leq k \leq n \sum ∥ \nabla^{k} f / w_{m} ∥_{\infty} \mbox with ∥ \nabla^{k} f / w_{m} ∥_{\infty} = x \in R^{d} sup ∥ \nabla^{k} f (x) / w_{m} (x) ∥

∥ e ∥_{μ, n} := [\int ∥ x ∥^{n} μ (d x)]^{1/ n}

∥ e ∥_{μ, n} := [\int ∥ x ∥^{n} μ (d x)]^{1/ n}

E (∥ X_{s, t}^{μ} (x) ∥^{n})^{1/ n} \leq c_{n} (t) (∥ x ∥ + ∥ e ∥_{μ, 2}) \mbox whichimpliesthat ϕ_{s, t} (μ) (∥ e ∥^{n})^{1/ n} \leq c_{n} (t) ∥ e ∥_{μ, n}

E (∥ X_{s, t}^{μ} (x) ∥^{n})^{1/ n} \leq c_{n} (t) (∥ x ∥ + ∥ e ∥_{μ, 2}) \mbox whichimpliesthat ϕ_{s, t} (μ) (∥ e ∥^{n})^{1/ n} \leq c_{n} (t) ∥ e ∥_{μ, n}

\partial ψ_{s, t} (Y) = e^{\oint_{s}^{t} \partial B_{u} (ψ_{s, u} (Y)) d u} \in \mbox Lin (H_{s} (R^{d}), H_{t} (R^{d}))

\partial ψ_{s, t} (Y) = e^{\oint_{s}^{t} \partial B_{u} (ψ_{s, u} (Y)) d u} \in \mbox Lin (H_{s} (R^{d}), H_{t} (R^{d}))

γ (T_{t}) := ∥ Z ∥_{H_{t} (R^{d})} = 1 sup ⟨ Z, (T_{t} + T_{t}^{⋆}) /2 \cdot Z ⟩_{H_{t} (R^{d})}

γ (T_{t}) := ∥ Z ∥_{H_{t} (R^{d})} = 1 sup ⟨ Z, (T_{t} + T_{t}^{⋆}) /2 \cdot Z ⟩_{H_{t} (R^{d})}

- \int_{s}^{t} γ (- \partial B_{u} (ψ_{s, u} (Y))) d u \leq \frac{1}{t} lo g ∣ ∣ ∣ e^{\oint_{s}^{t} \partial B_{u} (ψ_{s, u} (Y)) d u} ∣ ∣ ∣_{H_{t} (R^{d}) \to H_{t} (R^{d})} \leq \int_{s}^{t} γ (\partial B_{u} (ψ_{s, u} (Y))) d u

- \int_{s}^{t} γ (- \partial B_{u} (ψ_{s, u} (Y))) d u \leq \frac{1}{t} lo g ∣ ∣ ∣ e^{\oint_{s}^{t} \partial B_{u} (ψ_{s, u} (Y)) d u} ∣ ∣ ∣_{H_{t} (R^{d}) \to H_{t} (R^{d})} \leq \int_{s}^{t} γ (\partial B_{u} (ψ_{s, u} (Y))) d u

(H) ⟹ \partial B_{t} (X)_{sy m} \leq - λ_{0} I ⟹ \frac{1}{t} lo g ∣ ∣ ∣ e^{\oint_{s}^{t} \partial B_{u} (ψ_{s, u} (Y)) d u} ∣ ∣ ∣_{H_{t} (R^{d}) \to H_{t} (R^{d})} \leq - λ_{0}

(H) ⟹ \partial B_{t} (X)_{sy m} \leq - λ_{0} I ⟹ \frac{1}{t} lo g ∣ ∣ ∣ e^{\oint_{s}^{t} \partial B_{u} (ψ_{s, u} (Y)) d u} ∣ ∣ ∣_{H_{t} (R^{d}) \to H_{t} (R^{d})} \leq - λ_{0}

Y_{ϵ} := (1 - ϵ) Y_{0} + ϵ Y_{1} ⟹ \partial_{ϵ} ψ_{s, t} (Y_{ϵ}) = e^{\oint_{s}^{t} \partial B_{u} (ψ_{s, u} (Y)) d u} \cdot (Y_{1} - Y_{0})

Y_{ϵ} := (1 - ϵ) Y_{0} + ϵ Y_{1} ⟹ \partial_{ϵ} ψ_{s, t} (Y_{ϵ}) = e^{\oint_{s}^{t} \partial B_{u} (ψ_{s, u} (Y)) d u} \cdot (Y_{1} - Y_{0})

\partial B_{t} (X)_{sy m} \leq - λ_{0} I ⟹ W_{2} (ϕ_{s, t} (μ_{1}), ϕ_{s, t} (μ_{0})) \leq e^{- λ_{0} (t - s)} W_{2} (μ_{0}, μ_{1})

\partial B_{t} (X)_{sy m} \leq - λ_{0} I ⟹ W_{2} (ϕ_{s, t} (μ_{1}), ϕ_{s, t} (μ_{0})) \leq e^{- λ_{0} (t - s)} W_{2} (μ_{0}, μ_{1})

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic processes and financial applications · Mathematical Biology Tumor Growth · Markov Chains and Monte Carlo Methods

Full text

A second order analysis of McKean-Vlasov semigroups

M. Arnaudon

Univ. Bordeaux, CNRS, Bordeaux INP, IMB, UMR 5251, F-33400, Talence, France

P. Del Moral

INRIA, Bordeaux Research Center, Talence, France & CMAP, Polytechnique Palaiseau, France

Abstract

We propose a second order differential calculus to analyze the regularity and the stability properties of the distribution semigroup associated with McKean-Vlasov diffusions. This methodology provides second order Taylor type expansions with remainder for both the evolution semigroup as well as the stochastic flow associated with this class of nonlinear diffusions. Bismut-Elworthy-Li formulae for the gradient and the Hessian of the integro-differential operators associated with these expansions are also presented.

The article also provides explicit Dyson-Phillips expansions and a refined analysis of the norm of these integro-differential operators. Under some natural and easily verifiable regularity conditions we derive a series of exponential decays inequalities with respect to the time horizon. We illustrate the impact of these results with a second order extension of the Alekseev-Gröbner lemma to nonlinear measure valued semigroups and interacting diffusion flows. This second order perturbation analysis provides direct proofs of several uniform propagation of chaos properties w.r.t. the time parameter, including bias, fluctuation error estimate as well as exponential concentration inequalities.

Keywords : Nonlinear diffusions, mean field particle systems, variational equations, logarithmic norms, gradient flows, Taylor expansions, contraction inequalities, Wasserstein distance, Bismut-Elworthy-Li formulae.

Mathematics Subject Classification : 65C35, 82C80, 58J65, 47J20.

1 Introduction

1.1 Description of the models

For any $n\geq 1$ we let $P_{n}(\mathbb{R}^{d})$ be the convex set of probability measures $\eta,\mu$ on $\mathbb{R}^{d}$ with absolute $n$ -th moment and equipped with the Wasserstein distance of order $n$ denoted by $\mathbb{W}_{n}(\eta,\mu)$ . Also let $b_{t}(x_{1},x_{2})$ be some Lipschitz function from $\mathbb{R}^{2d}$ into $\mathbb{R}^{d}$ and let $W_{t}$ be an $d$ -dimensional Brownian motion defined on some filtered probability space $(\Omega,(\mathbb{F}_{t})_{t\geq 0},\mathbb{P})$ . We also consider the Hilbert space $\mathbb{H}_{t}(\mathbb{R}^{d}):=\mathbb{L}_{2}((\Omega,\mathbb{F}_{t},\mathbb{P}),\mathbb{R}^{d})$ equipped with the $\mathbb{L}_{2}$ inner product $\langle\mbox{\LARGE.},\mbox{\LARGE.}\rangle_{\mathbb{H}_{t}(\mathbb{R}^{d})}$ . Up to a probability space enlargement there is no loss of generality to assume that $\mathbb{H}_{t}(\mathbb{R}^{d})$ contains square integrable $\mathbb{R}^{d}$ -valued variables independent of the Brownian motion.

For any $\mu\in P_{2}(\mathbb{R}^{d})$ and any time horizon $s\geq 0$ we denote by $X_{s,t}^{\mu}(x)$ the stochastic flow defined for any $t\in[s,\infty[$ and any starting point $x\in\mathbb{R}^{d}$ by the McKean-Vlasov diffusion

[TABLE]

In the above display, $\phi_{s,t}$ stands for the evolution semigroup on $P_{2}(\mathbb{R}^{d})$ defined by the formulae

[TABLE]

We denote by $L_{t,\phi_{s,t}(\mu)}$ the generator of the stochastic flow $X^{\mu}_{s,t}(x)$ . The existence of the stochastic flow $X^{\mu}_{s,t}(x)$ is ensured by the Lipschitz property of the drift function see for instance [41, 47]. To analyze the smoothness of the semigroup $\phi_{s,t}$ we need to strengthen this condition.

We shall assume that the function $b_{t}(x_{1},x_{2})$ is differentiable at any order with uniformly bounded derivatives. In addition, the partial differential matrices w.r.t. the first and the second coordinate are uniformly bounded; that is for any $i=1,2$ we have

[TABLE]

In the above display, $\|A\|_{2}:=\lambda_{\tiny max}(AA^{\prime})^{1/2}$ stands for the spectral norm of some matrix $A$ , where $A^{\prime}$ stands for the transpose of $A$ , $\lambda_{\tiny max}(\mbox{\LARGE.})$ and $\lambda_{\tiny min}(\mbox{\LARGE.})$ the maximal and minimal eigenvalue. In the further development of the article, we shall also denote by $A_{\tiny sym}=(A+A^{\prime})/2$ the symmetric part of a matrix $A$ . In the further development of the article we represent the gradient of a real valued function as a column vector, or equivalently as the transpose of the differential-Jacobian operator which is, as any cotangent vector, represented by a row vector. The gradient and the Hessian of a column vector valued function as tensors of type $(1,1)$ and $(2,1)$ , see for instance (3.1).

The mean field particle interpretation of the nonlinear diffusion (1.1) is described by a system of $N$ -interacting diffusions $\xi_{t}=(\xi^{i}_{t})_{1\leq i\leq N}$ defined by the stochastic differential equations

[TABLE]

In the above display, $\xi_{0}^{i}$ stands for $N$ independent random variables $\xi_{0}^{i}$ with common distribution $\mu_{0}$ , and $W^{i}_{t}$ are $N$ independent copie of the Brownian motion $W_{t}$ .

McKean-Vlasov diffusions and their mean field type particle interpretations arise in a variety of application domains, including in porous media and granular flows [7, 8, 18, 67], fluid mechanics [58, 59, 61, 68], data assimilation [10, 26, 36], and more recently in mean field game theory [9, 14, 13, 15, 16, 17, 46, 43], and many others.

The origins of this subject certainly go back to the beginning of the 1950s with the article by Harris and Kahn [45] using mean field type splitting techniques for estimating particle transmission energies. We also refer to the pioneering article by Kac [50, 51] on particle interpretations of Boltzmann and Vlasov equations, and the seminal articles by McKean [58, 59] on mean field particle interpretations of nonlinear parabolic equations arising in fluid mechanics. Since this period, the analysis of this class of mean field type nonlinear diffusions and their discrete time versions have been developed in various directions. For a survey on these developments we refer to [15, 26, 65], and the references therein.

The McKean-Vlasov diffusions discussed in this article belong to the class of nonlinear Markov processes. One of the most important and difficult research questions concerns the regularity analysis and more particularly the stability and the long time behavior of these stochastic models.

In contrast with conventional Markov processes, one of the main difficulty of these Markov processes comes from the fact that the evolution semigroup $\phi_{s,t}(\mu)$ is nonlinear w.r.t. the initial condition $\mu$ of the system. The additional complexity in the analysis of these models is that their state space is the convex set of probability measures, thus conventional functional analysis and differential calculus on Banach space cannot be directly applied.

The main contribution of this article is the development of a second order differential calculus to analyze the regularity and the stability properties of the distribution semigroup associated with McKean-Vlasov diffusions. This methodology provides second order Taylor type expansions with remainder for both the evolution semigroup as well as the stochastic flow associated with this class of nonlinear diffusions. We also provide a refined analysis of the norm of these integro-differential operators with a series of exponential decays inequalities with respect to the time horizon.

The article is organized as follows:

The main contributions of this article are briefly discussed in section 1.2. The main theorems are stated in some detailed in section 2. Section 3 provides some pivotal results on tensor integral operators and on integro-differential operators associated with the second order Taylor expansions of the semigroup $\phi_{s,t}(\mu)$ . Section 4 is dedicated to the analysis of the tangent process associated with the nonlinear diffusion flow. We presents explicit Dyson-Phillips expansions as well as some spectral estimates. The last section, section 4 is mainly concerned with the proofs of the first and second order Taylor expansions. The proof of some technical results are collected in the appendix. Detailed comparisons with existing literature on this subject are also provided in section 2.5.

1.2 Statement of some main results

One of the main contribution of the present article is the derivation of a second order Taylor expansion with remainder of the semigroup $\phi_{s,t}$ on probability spaces. For any pair of measures $\mu_{0},\mu_{1}\in P_{2}(\mathbb{R}^{d})$ , these expansions take basically the following form:

[TABLE]

In the above display, $D^{k}_{\mu_{0}}\phi_{s,t}$ stands some first and second order operators, with $k=1,2$ . A more precise description of these expansions and the remainder terms is provided in section 2.2.

Section 2.3.1, also provides an almost sure second order Taylor expansions with remainder of the random state $X^{\mu}_{s,t}(x)$ of the McKean diffusion w.r.t. the initial distribution $\mu$ . These almost sure expansions take basically the following form

[TABLE]

for some random functions $D^{k}_{\mu_{0}}X^{\mu_{0}}_{s,t}$ from $\mathbb{R}^{(1+k)d}$ into $\mathbb{R}^{d}$ , with $k=1,2$ . A more precise description of these almost sure expansions is provided in section 2.3.1 (see for instance (2.19) and theorem 2.6).

Given some random variable $Y\in\mathbb{H}_{s}(\mathbb{R}^{d})$ with distribution $\mu\in P_{2}(\mathbb{R}^{d})$ , observe that the stochastic flow $\psi_{s,t}(Y):=X^{\mu}_{s,t}(Y)$ satisfies the $\mathbb{H}_{t}(\mathbb{R}^{d})$ -valued stochastic differential equation

[TABLE]

In the above display, $B_{t}$ stands for the drift function from $\mathbb{H}_{t}(\mathbb{R}^{d})$ into itself defined by the formula

[TABLE]

In the above display, $\overline{X}$ stands for an independent copy of $X$ . The above Hilbert space valued representation of the McKean-Vlasov diffusion (1.1) readily implies that for any $Y_{1},Y_{0}\in\mathbb{H}_{s}(\mathbb{R}^{d})$ we have the exponential contraction inequality

[TABLE]

for some $\lambda>0$ , as soon as the following condition is satisfied

[TABLE]

for any $t\geq 0$ and any $X_{1},X_{0}\in\mathbb{H}_{t}(\mathbb{R}^{d})$ . In addition, in this framework the first order differential $\partial\psi_{s,t}(Y)$ of the stochastic flow coincides with the conventional Fréchet derivative of functions from an Hilbert space into another. In addition, we shall see that the gradient of first order operator $D_{\mu}\phi_{s,t}$ coincides with the dual of the tangent process associated with the Hilbert space-valued representation (1.6) of the McKean-Vlasov diffusion (1.1); that is, for any smooth function $f$ we have that the dual tangent formula

[TABLE]

A more precise description of the Fréchet differential $\partial\psi_{s,t}(Y)$ and the dual operator is provided in section 2.1 and section 4. A proof of the above formula is provided in theorem 4.8.

The Taylor expansions discussed above are valid under fairly general and easily verifiable conditions on the drift function. For instance, the regularity condition (1.2) is clearly satisfied for linear drift functions. As it is well known, dynamical systems and hence stochastic models involving drift functions with quadratic growth require additional regularity conditions to ensure non explosion of the solution in finite time.

Of course the expansions (1.4) and (1.5) will be of rather poor practical interest without a better understanding of the differential operators and the remainder terms. To get some useful approximations, we need to quantify with some precision the norm of these operators. A important part of the article is concerned with developing a series of quantitative estimates of the differential operators $D^{k}_{\mu_{0}}\phi_{s,t}$ and the remainder term; see for instance theorem 2.3 and theorem 2.4.

To avoid estimates that grow exponentially fast with respect to the time horizon, we need to estimate with some precision the operator norms of the differential operators in (1.4). To this end, we shall consider an additional regularity condition:

$(H)$ * : There exists some $\lambda_{0}>0$ and $\lambda_{1}>\|b^{[2]}\|_{2}$ such that for any $(x_{1},x_{2})\in\mathbb{R}^{2d}$ and any time horizon $t\geq 0$ we have*

[TABLE]

In the above display, $I$ stands for the identity matrix and $A_{t}$ the matrix-valued function defined by

[TABLE]

Whenever (1.9) and (1.10) are met for some parameters $\lambda_{0}$ and $\lambda_{1}\in\mathbb{R}$ all the exponential estimates stated in the article remains valid but they grow exponentially fast with respect to the time horizon. More detailed comments on the above regularity conditions, including illustrations for linear drift and gradient flow models, as well as comparisons with related conditions used in the literature on this subject are also provided in section 2.4.

Under the above condition, we shall develop several exponential decays inequalities for the norm of the differential operators $D_{\mu_{0}}^{k}\phi_{s,t}$ as well as for the remainder terms in the Taylor expansions. The first order estimates are given in (2.6), the ones on the Bismut-Elworthy-Li gradient and Hessian extension formulae are provided in (2.7) and (2.8). Second and third order estimates can also be found in (2.12) and (2.15).

The second order differential calculus discussed above provides a natural theoretical basis to analyze the stability properties of the semigroup $\phi_{s,t}$ and the one of the mean field particle system discussed in (1.3).

For instance, a first order Taylor expansion of the form (1.4) already indicates that the sensitivity properties of the semigroup w.r.t. the initial condition $\mu$ are encapsulated in the first order differential operator $D_{\mu}\phi_{s,t}$ . Roughly speaking, whenever $(H)$ is satisfied, we show that there exists some parameter $\lambda>0$ such that

[TABLE]

for some operator norms ${|\kern-1.07639pt|\kern-1.07639pt|\mbox{\LARGE.}|\kern-1.07639pt|\kern-1.07639pt|}$ . For a more precise statement we refer to theorem 2.2 and the discussion following the theorem.

The second order expansion (1.4) also provides a natural basis to quantify the propagation of chaos properties of the mean field particle model (1.3). Combining these Taylor expansions with a backward semigroup analysis we derive a a variety of uniform mean error estimates w.r.t. the time horizon. This backward second order analysis can be seen a second order extension of the Alekseev-Gröbner lemma [1, 42] to nonlinear measure valued and stochastic semigroups. For a more precise statement we refer to theorem 2.7. As in (1.11), one of the main feature of the expansion (1.4) is that it allows to enter the stability properties of the limiting semigroup $\phi_{s,t}$ into the analysis of the flow of empirical measures $m(\xi_{t})$ .

Roughly speaking, this backward perturbation analysis can be interpreted as a second order variation-of-constants technique applied to nonlinear equations in distribution spaces. As in the Ito’s lemma, the second order term is essential to capture the quadratic variation of the processes, see for instance the recent articles [35, 48] in the context of conventional stochastic differential equation, as well as in [4, 31] in the context of interacting jump models.

The discrete time version of this backward perturbation semigroup methodology can also be found in chapter 7 in [25], a well as in the articles [27, 28, 30] and [34, 37] for general classes of mean field particle systems.

The central idea is to consider the telescoping sum on some time mesh $t_{n}\leq t_{n+1}$ given by the interpolating formula

[TABLE]

Applying (1.4) and whenever $(t_{k}-t_{k-1})\simeq 0$ we have the second order approximation

[TABLE]

with the local fluctuation random fields

[TABLE]

For discrete generation particle systems, $\xi_{t_{k}}^{i}$ are defined by $N$ conditionally independent variables given the system $\xi_{t_{k-1}}$ . For a more rigorous analysis we refer to section 2.3.2.

The above decomposition shows that the first order operator $D_{\mu}\phi_{s,t}$ reflects the fluctuation errors of the particle measures, while the second order term encapsulates their bias. In other words, estimating the norm of second order operator $D^{2}_{\mu}\phi_{s,t}$ allows to quantify the bias induced by the interaction function, while the estimation of first order term is used to derive central limit theorems as well as $\mathbb{L}_{p}$ -mean error estimates.

As in (1.11), these estimates take basically the following form. For $n\geq 1$ and any sufficiently regular function $f$ we have

[TABLE]

In addition, we have the uniform bias estimate w.r.t. the time horizon

[TABLE]

In the above display, ${|\kern-1.07639pt|\kern-1.07639pt|\mbox{\LARGE.}|\kern-1.07639pt|\kern-1.07639pt|}$ stands for some operator norm, and $(c,c_{n})$ stands for some finite constants whose values doesn’t depend on the time horizon. We emphasize that the above results are direct consequence of a second order extension of the Alekseev-Gröbner type lemma for particle density profiles. For more precise statements we refer to theorem 2.7 and the discussion following the theorem.

1.3 Some basic notation

Let $\mbox{\rm Lin}({\cal B}_{1},{\cal B}_{2})$ be the set of bounded linear operators from a normed space ${\cal B}_{1}$ into a possibly different normed space ${\cal B}_{2}$ equipped with the operator norm ${|\kern-1.07639pt|\kern-1.07639pt|\mbox{\LARGE.}|\kern-1.07639pt|\kern-1.07639pt|}_{{\cal B}_{1}\rightarrow{\cal B}_{2}}$ . When ${\cal B}_{1}={\cal B}_{2}$ we write $\mbox{\rm Lin}({\cal B}_{1})$ instead of $\mbox{\rm Lin}({\cal B}_{1},{\cal B}_{1})$ .

With a slight abuse of notation, we denote by $I$ the identity $(d\times d)$ -matrix, for any $d\geq 1$ , as well as the identity operator in $\mbox{\rm Lin}({\cal B}_{1},{\cal B}_{1})$ . We also denote by $\|\mbox{\LARGE.}\|$ any (equivalent) norm on some finite dimensional vector space over $\mathbb{R}$ .

We also use the conventional notation $\partial_{\epsilon}$ , $\partial_{x_{i}}$ , $\partial_{s}$ , $\partial_{t}$ and so on for the partial derivatives w.r.t. some real valued parameters $\epsilon$ , $x_{i}$ , $s$ and $t$ .

We let $\nabla f(x)=\left[\partial_{x_{i}}f(x)\right]_{1\leq i\leq d}$ be the gradient column vector associated with some smooth function $f(x)$ from $\mathbb{R}^{d}$ into $\mathbb{R}$ . Given some smooth function $h(x)$ from $\mathbb{R}^{d}$ into $\mathbb{R}^{d}$ we denote by $\nabla h=\left[\nabla h^{1},\ldots,\nabla h^{d}\right]$ the gradient matrix associated with the column vector function $h=(h^{i})_{1\leq i\leq d}$ . We also let $(\nabla\otimes\nabla)$ be the second order differential operator defined for any twice differentiable function $g(x_{1},x_{2})$ on $\mathbb{R}^{2d}$ by the Hessian-type formula

[TABLE]

We consider the space ${\cal C}^{n}(\mathbb{R}^{d})$ of $n$ -differentiable functions and we denote by ${\cal C}^{n}_{m}(\mathbb{R}^{d})$ the subspace of functions $f$ such that

[TABLE]

We equip ${\cal C}^{n}_{m}(\mathbb{R}^{d})$ with the norm

[TABLE]

When there are no confusions, we drop to lower symbol $\|\mbox{\LARGE.}\|_{\infty}$ and we write $\|f\|$ instead of $\|f\|_{\infty}$ the supremum norm of some real valued function. We let $e(x):=x$ be the identify function on $\mathbb{R}^{d}$ and for any $\mu\in P_{n}(\mathbb{R}^{d})$ and $n\geq 1$ we set

[TABLE]

For any $\mu_{1},\mu_{2}\in P_{n}(\mathbb{R}^{d})$ , we also denote by $\rho_{n}(\mu_{1},\mu_{2})$ some polynomial function of $\|e\|_{\mu_{i},n}$ with $i=1,2$ . When $\mu_{1}=\mu_{2}$ we write $\rho_{n}(\mu_{1})$ instead of $\rho_{n}(\mu_{1},\mu_{1})$ .

Under our regularity conditions on the drift function, using elementary stochastic calculus for any $n\geq 2$ and $\mu\in P_{n}(\mathbb{R}^{d})$ we check the following estimates

[TABLE]

In the above display and throughout the rest of the article, we write $c(t),c_{\epsilon}(t),c_{n}(t),c_{n,\epsilon}(t),c_{\epsilon,n}(t)$ and $c_{m,n}(t)$ with $m,n\geq 0$ and $\epsilon\in[0,1]$ some collection of non decreasing and non negative functions of the time parameter $t$ whose values may vary from line to line, but which only depend on the parameters $m,n,\epsilon$ , as well as on the drift function $b_{t}$ . Importantly these contants do not depend on the probability measures $\mu$ . We also write $c,c_{\epsilon},c_{n},c_{n,\epsilon},$ and $c_{m,n}$ when the constant do not depend on the time horizon.

2 Statement of the main theorems

2.1 First variational equation on Hilbert spaces

As expected, the Fréchet differential $\partial\psi_{s,t}(Y)$ of the stochastic flow $\psi_{s,t}(Y)$ associated with the stochastic differential equation (1.6) satisfies an Hilbert space-valued linear equation (cf. (4.1)). The drift-matrix of this evolution equation is given by the Fréchet differential $\partial B_{t}(\psi_{s,t}(Y))$ of the drift function $B_{t}$ evaluated along the solution of the flow. Mimicking the exponential notation of the solution of conventional homogeneous linear systems, the evolution semigroup (a.k.a. propagator) associated with the first variational equation is written as follows

[TABLE]

The above exponential is understood as an operator valued Peano-Baker series [64]. A more detailed presentation of these models is provided in section 4.

The $\mathbb{H}_{t}(\mathbb{R}^{d})$ -log-norm of an operator $T_{t}\in\mbox{\rm Lin}(\mathbb{H}_{t}(\mathbb{R}^{d}),\mathbb{H}_{t}(\mathbb{R}^{d}))$ is defined by

[TABLE]

Our first main result is an extension of an inequality of Coppel [22] to tangent processes associated with Hilbert-space valued stochastic flows.

Theorem 2.1.

For any time horizon $t\geq s$ and any $Y\in\mathbb{H}_{s}(\mathbb{R}^{d})$ we have the log-norm estimate

[TABLE]

In addition, we have

[TABLE]

The proof of the above theorem in provided in section 4.1.

Let $Y_{0},Y_{1}\in\mathbb{H}_{s}(\mathbb{R}^{d})$ be a pair of random variables with distributions $(\mu_{0},\mu_{1})\in P_{2}(\mathbb{R}^{d})^{2}$ . Also let $\mu_{\epsilon}$ be the probability distribution of the random variable

[TABLE]

This observation combined with the above theorem yields an alternative and more direct proof of an exponential Wasserstein contraction estimate obtained in [5]. Namely, using (2.2) we readily check the $\mathbb{W}_{2}$ -exponential contraction inequality

[TABLE]

For any function $f\in{\cal C}^{1}(\mathbb{R}^{d})$ with bounded derivative we also quote the first order expansion

[TABLE]

In the above display, $\langle\mbox{\LARGE.},\mbox{\LARGE.}\rangle_{\,\mathbb{H}_{t}(\mathbb{R}^{d})}$ stands for the conventional inner product on $\mathbb{L}_{2}((\Omega,\mathbb{F}_{t},\mathbb{P}),\mathbb{R}^{d})$ . The above assertion is a direct consequence of theorem 4.8.

2.2 Taylor expansions with remainder

The first expansion presented in this section is a first order linearization of the measure valued mapping $\phi_{s,t}$ in terms of a semigroup of linear integro-differential operators.

Theorem 2.2.

For any $m,n\geq 1$ and $\mu_{0},\mu_{1}\in P_{m\vee 2}(\mathbb{R}^{d})$ , there exists a semigroup of linear operators $D_{\mu_{1},\mu_{0}}\phi_{s,t}$ from ${\cal C}^{n}_{m}(\mathbb{R}^{d})$ into itself such that

[TABLE]

In addition, when $(H)$ is satisfied we have the gradient estimate

[TABLE]

The proof of the above theorem with a more explicit description of the first order operators $D_{\mu_{1},\mu_{0}}\phi_{s,t}$ are provided in section 4.3. In (2.6) we can choose $\lambda=\lambda_{1,2}$ , with the parameter $\lambda_{1,2}$ introduced in (1.10). The semigroup property is a consequence of theorem 4.5 and the gradient estimates is a reformulation of the operator norm estimate discussed in (4.14).

We also provide Bismut-Elworthy-Li-type formulae that allow to extend the gradient and Hessian operators $\nabla^{k}D_{\mu_{1},\mu_{0}}\phi_{s,t}$ with $k=1,2$ to measurable and bounded functions. When the condition $(H)$ is satisfied we show the following exponential estimates

[TABLE]

In addition, we have the Hessian estimate

[TABLE]

The proof of the first assertion can be found in remark 4.7 on page 4.7. The proof of the Hessian estimates is a consequence of the decomposition of $\nabla^{2}D_{\mu_{0},\mu_{1}}\phi_{s,t}$ discussed in (5.1) and the Hessian estimates (3.26) and (3.41).

It is worth mentioning that the semigroup property is equivalent to the chain rule formula

[TABLE]

which is valid for any $s\leq u\leq t$ . Without further work, theorem 2.2 also yields the exponential $\mathbb{W}_{1}$ -contraction inequality

[TABLE]

with the same parameter $\lambda$ a in (2.6). In the same vein, the estimate (2.7) yields the total variation estimate

[TABLE]

with the same parameter $\lambda$ a in (2.7). In all the inequalities discussed above we can choose any parameter $\lambda>0$ such that $\lambda<\lambda_{1,2}$ , with the parameter $\lambda_{1,2}$ introduced in (1.10). In the $\mathbb{W}_{1}$ -contraction inequality (2.10) we can choose $\lambda=\lambda_{1,2}$ . A more refined estimate is provided in section 2.4.

Next theorem provides a first order Taylor expansion with remainder.

Theorem 2.3.

For any $m,n\geq 0$ and $\mu_{0},\mu_{1}\in P_{m+2}(\mathbb{R}^{d})$ , there exists a linear operators $D^{2}_{\mu_{1},\mu_{0}}\phi_{s,t}$ from ${\cal C}^{n+2}_{m}(\mathbb{R}^{d})$ into ${\cal C}^{n}_{m+2}(\mathbb{R}^{2d})$ such that

[TABLE]

with the first order operator $D_{\mu_{0}}\phi_{s,t}:=D_{\mu_{0},\mu_{0}}\phi_{s,t}$ introduced in theorem 2.2. In addition, when $(H)$ is satisfied we also have the estimate

[TABLE]

The proof of the above theorem in provided in section 5.2. A more precise description of the second order operator $D^{2}_{\mu_{1},\mu_{0}}\phi_{s,t}$ is provided in (5.9) and (5.13). Using (2.11) and arguing as in the proof of proposition 2.1 in [4], for any twice differentiable function $f$ with bounded derivatives we check the backward evolution equation

[TABLE]

with the first order operator $D_{\mu}\phi_{s,t}$ introduced in theorem 2.3. The above equation is a central tool to derive an extended version of the Alekseev-Gröbner lemma [1, 42] to measure valued semigroups and interacting diffusions (cf. theorem 2.7).

Next theorem provides a second order Taylor expansion with remainder.

Theorem 2.4.

For any $m,n\geq 1$ and $\mu_{0},\mu_{1}\in P_{m+4}(\mathbb{R}^{d})$ , there exists a linear operators $D^{3}_{\mu_{1},\mu_{0}}\phi_{s,t}$ from ${\cal C}^{n+3}_{m}(\mathbb{R}^{d})$ into ${\cal C}^{n}_{m+4}(\mathbb{R}^{3d})$ such that

[TABLE]

with the second order operator $D^{2}_{\mu_{0}}\phi_{s,t}:=D^{2}_{\mu_{0},\mu_{0}}\phi_{s,t}$ introduced in theorem 2.3. In addition, when $(H)$ is satisfied we have the third order estimate

[TABLE]

The proof of the first part of the above theorem in provided in section 5.3. We can choose in (2.15) any parameter $\lambda>0$ such that $\lambda<\lambda_{1,2}$ , with the parameter $\lambda_{1,2}$ introduced in (1.10). The proof of the third order estimate (2.15) is rather technical, thus it is provided in the appendix, on page Proof of the estimate (2.15).

2.3 Illustrations

The first part of this section states with more details the almost sure expansions discussed in (1.5). Up to some differential calculus technicalities, this result is a more or less direct consequence of the Taylor expansions with remainder presented in theorem 2.3 and theorem 2.4 combining with a backward formula presented in [5].

The second part of this section is concerned with a second order extension of the Alekseev-Gröbner lemma to nonlinear measure valued semigroups and interacting diffusion flows. This second order stochastic perturbation analysis is also mainly based on the second order Taylor expansion with remainder presented in theorem 2.4 .

In the further development of this section without further mention we shall assume that condition $(H)$ is satisfied.

2.3.1 Almost sure expansions

We recall the backward formula

[TABLE]

The above formula combined with (2.4) and the tangent process estimates presented in section 3.3 yields the uniform almost sure estimates

[TABLE]

The above estimate is a consequence of (2.4) and conventional exponential estimates of the tangent process $\nabla X^{\mu}_{s,t}$ (cf. for instance (3.2)). A detailed proof of this claim and the backward formula (2.16) can be found in [5].

We extend the operators $D^{k}_{\mu}\phi_{s,t}$ introduced in theorem 2.4 to tensor valued functions $f=(f_{i})_{i\in[n]}$ with $i=(i_{1},\ldots,i_{n})\in[n]:=\{1,\ldots,d\}^{n}$ by considering the same type tensor function with entries

[TABLE]

for any $(x,y)\in\mathbb{R}^{2d}$ . A brief review on tensor spaces is provided in section 3.1. We also consider the function

[TABLE]

Combining the first order formulae stated in theorem 2.3 with conventional Taylor expansions we check the following theorem.

Theorem 2.5.

For any $x\in\mathbb{R}^{d}$ , $\mu_{0},\mu_{1}\in P_{2}(\mathbb{R}^{d})$ and $s\leq t$ we have the almost sure expansion

[TABLE]

with the second order remainder function $\Delta^{[2],\mu_{0},\mu_{1}}_{s,t}$ such that

[TABLE]

The detailed proof of the above theorem is provided in the appendix, on page Proof of theorem 2.6.

Second order expansions are expressed in terms of the functions defined for any $(x,y)\in\mathbb{R}^{2d}$ and for any $z\in\mathbb{R}^{2d}$ by the formulae

[TABLE]

We associate with these objects the function $D^{2}_{\mu_{0}}X^{\mu_{0}}_{s,t}$ defined by

[TABLE]

In the above display, $D^{[i,1]}_{\mu}X^{\mu}_{s,u}$ stands for the functions given by

[TABLE]

We are now in position to state the main result of this section.

Theorem 2.6.

For any $x\in\mathbb{R}^{d}$ , $\mu_{0},\mu_{1}\in P_{2}(\mathbb{R}^{d})$ and $s\leq t$ we have the almost sure expansion

[TABLE]

with a third order remainder function $\Delta^{[3],\mu_{1},\mu_{0}}_{s,t}$ such that

[TABLE]

The proof of the above theorem is provided in the appendix, on page Proof of theorem 2.6. In the remainder term estimates presented in the above theorems, we can choose any parameter $\lambda>0$ such that $\lambda<\lambda_{1,2}$ , with the parameter $\lambda_{1,2}$ introduced in (1.10).

2.3.2 Interacting diffusions

For any $N\geq 2$ , the $N$ -mean field particle interpretation associated with a collection of generators $L_{t,\eta}$ is defined by the Markov process $\xi_{t}=\left(\xi_{t}^{i}\right)_{1\leq i\leq N}\in(\mathbb{R}^{d})^{N}$ with generators $\Lambda_{t}$ given for any sufficiently smooth function $F$ and any $x=(x^{i})_{1\leq i\leq N}\in(\mathbb{R}^{d})^{N}$ by

[TABLE]

with the function

[TABLE]

We extend $L_{t,\mu}$ to symmetric functions $F(x^{1},x^{2})$ on $\mathbb{R}^{2d}$ by setting

[TABLE]

In this notation, in our context we readily check that

[TABLE]

for any symmetric function $F(x^{1},x^{2})=F(x^{2},x^{1})$ , with the function $\Gamma(F)$ on $\mathbb{R}^{d}$ defined for any $y\in\mathbb{R}^{d}$ by the formula

[TABLE]

A proof of the above formula is provided in the appendix, on page Proof of (2.22). Applying Ito’s formula, for any smooth function $g:t\in[0,\infty[\mapsto g_{t}\in{\cal C}^{2}_{b}(\mathbb{R}^{d})$ we prove that

[TABLE]

In the above display, $g\mapsto M_{t}(g)$ stands for a martingale random field with angle bracket

[TABLE]

The above evolution equation is rather standard in mean field type interacting particle system theory, a detailed proof can be found in [29] (see for instance section 4.3). In the same vein, with some obvious abusive notation, using (2.22) we have

[TABLE]

We fix a final time horizon $t\geq 0$ and we denote by

[TABLE]

the martingale associated with the predictable function

[TABLE]

Combining the Itô formula with the tensor product formula (2.22) and with the backward formula (2.13) we obtain

[TABLE]

This implies that

[TABLE]

This yields the following theorem.

Theorem 2.7.

For any time horizon $t\geq 0$ , the interpolating semigroup $s\in[0,t]\mapsto\phi_{s,t}(m_{s})$ satisfies for any $f\in{\cal C}^{2}(\mathbb{R}^{d})$ with $\sup_{k=1,2}\|\nabla^{k}f\|\leq 1$ the evolution equation

[TABLE]

The above theorem can be seen as a second order extension of the Alekseev-Gröbner lemma [1, 42] to nonlinear measure valued and stochastic semigroups. This result also extends the perturbation theorem obtained in [4] (cf. theorem 3.6) in the context of interacting jumps processes to McKean-Vlasov diffusions. The discrete time version of the backward perturbation analysis described above can also be found in [27, 28, 30] in the context of Feynman-Kac particle models (see also [25, 26, 31]).

We end this section with some direct consequences of the above theorem. Firstly, using (2.6) and (2.12) we have the almost sure estimates

[TABLE]

Without further work, the above inequality yields the uniform bias estimate stated in the r.h.s. of (1.13), for any twice differentiable function $f$ with bounded derivatives. Using well known martingale concentration inequalities (cf. for instance lemma 3.2 in [60]), there exists some finite parameter $c$ such that for any $t\geq 0$ and any $\delta\geq 1$ the probability of the following event

[TABLE]

is greater than $1-e^{-\delta}$ . In addition, using the Burkholder-Davis-Gundy inequality, for any $n\geq 1$ we obtain the time uniform estimates stated in the r.h.s. of (1.12). On the other hand, using (2.5) and (2.6) we have the almost sure exponential contraction inequality

[TABLE]

This yields the bias estimates

[TABLE]

for any twice differentiable function $f$ with bounded derivatives. The r.h.s. estimate comes from well known estimates of the average of the Wassertein distance for occupation measures, see for instance [38] and the more recent studies [40, 56]. The above inequality yields the following uniform bias estimate

[TABLE]

2.4 Comments on the regularity conditions

We discuss in this section the regularity condition $(H)$ introduced in (1.9). We illustrate these spectral conditions for linear-drift and gradient flow models. Comparisons with related conditions presented in other works are also provided.

Firstly, we mention that the condition stated in (1.9) has been introduced in the article [5] to derive several Wasserstein exponential contraction inequalities as well as uniform propagation of chaos estimates w.r.t. the time horizon.

Using the log-norm triangle inequality and recalling that the log-norm is dominated by the spectral norm we check that

[TABLE]

Choosing $\lambda_{0}$ and $\lambda_{1}$ as the supremum of the maximal eigenvalue functional of the matrices $A_{t}(x_{1},x_{2})_{\tiny sym}$ and $b_{t}^{[1]}(x_{1},x_{2})_{\tiny sym}$ , the Cauchy interlacing theorem (see for instance [55] on page 294) yields $\lambda_{1}\geq\lambda_{0}\geq\lambda_{1,2}$ .

For linear drift functions

[TABLE]

the matrix $A_{t}(x_{1},x_{2})_{\tiny sym}$ reduces to the two-by-two block partitioned matrix

[TABLE]

In this situation the diffusion flow $X_{s,t}^{\mu}(x)\in\mathbb{R}^{d}$ is given by the formula

[TABLE]

In the one dimensional case we have

[TABLE]

Nonlinear Langevin diffusions are associated with the drift function

[TABLE]

some confinement type potential function $U$ (a.k.a. the exterior potential) and some interaction potential function $V$ . In this context we have

[TABLE]

When the potential function $V$ is even and convex we have

[TABLE]

In the reverse angle, when the function $V$ is odd we have the formula

[TABLE]

In both situations, condition $(H)$ is satisfied when the strength of the confinement type potential dominates the one of the interaction potential; that is when we have that

[TABLE]

The decay rate $\lambda_{0}$ in the $\mathbb{W}_{2}$ -contraction inequality (2.4) is larger than the decay rate $\lambda_{1,2}$ in the $\mathbb{W}_{1}$ -contraction inequality (2.10). In addition, the $\mathbb{W}_{1}$ -exponential stability requires that $\lambda_{0}$ dominates the spectral norm of the matrix $b^{[2]}$ . Next we provide a more refined analysis based on the proof of the $\mathbb{W}_{2}$ -contraction inequality presented in [5]. Using the interpolating paths $(Y_{\epsilon},\mu_{\epsilon})$ introduced in (2.3) we set

[TABLE]

In the above display $(\overline{X}^{\mu_{\epsilon}}_{s,t}(x),\overline{Y}_{\epsilon})$ stands for an independent copy of $(X^{\mu_{\epsilon}}_{s,t}(x),Y_{\epsilon})$ . Arguing as in [5] we have

[TABLE]

We consider the symmetric and anti-symmetric matrices

[TABLE]

and we set

[TABLE]

By symmetry arguments and using some elementary manipulations we check the formula

[TABLE]

This shows that

[TABLE]

with the parameter $\widehat{\lambda}_{1,2}$ given by

[TABLE]

We conclude that the $\mathbb{W}_{1}$ -contraction inequality (2.10) is met with $\lambda=\widehat{\lambda}_{1,2}$ .

In a more recent article [69] the author presents some Wasserstein contraction inequalities of the same form as in (2.4) with $\lambda_{0}$ replaced by some parameter $\lambda^{-}_{0}=(\kappa_{1}-\kappa_{2})$ , under the assumption

[TABLE]

Taking Dirac measures $\mu_{1}=\delta_{x_{2}}$ and $\mu_{2}=\delta_{y_{2}}$ we check that the above condition is equivalent to the fact that

[TABLE]

By symmetry arguments this implies that

[TABLE]

For the linear drift model discussed in (2.25) the above condition reads

[TABLE]

We also have $(\ref{Hilbert-condition-sym})\Longrightarrow(\ref{Hilbert-condition})$ with $\lambda=\lambda^{-}_{0}$ .

2.5 Comparisons with existing literature

The perturbation analysis developed in the article differs from the Otto differential calculus on $(P_{2}(\mathbb{R}^{d}),\mathbb{W}_{2})$ introduced in [61] and further developed by Ambrosio and his co-authors [2, 3] and Otto and Villani in [62]. These sophisticated gradient flow techniques in Wasserstein metric spaces are based on optimal transport theory.

The central idea is to interpret $P_{2}(\mathbb{R}^{d})$ as an infinite dimensional Riemannian manifold. In this context, the Benamou-Brenier formulation of the Wasserstein distance provides a natural way to define geodesics, gradients and Hessians w.r.t. the Wasserstein distance. The details of these gradient flow techniques are beyond the scope of the semigroup perturbation analysis considered herein.

This methodology is mainly used to quantify the entropy dissipation of Langevin-type nonlinear diffusions. Thus, it cannot be used to derive any Taylor expansion of the form (1.4) nor to analyze the stability properties of more general classes of McKean-Vlasov diffusions.

Besides some interesting contact points, the methodology developed in the present article doesn’t rely on the more recent differential calculus on $(P_{2}(\mathbb{R}^{d}),\mathbb{W}_{2})$ developed by P.L. Lions and his co-authors in the seminal works on mean field game theory [14, 43]. In this context, the first order Lions differential of a smooth function from $P_{2}(\mathbb{R}^{d})$ into $\mathbb{R}$ is defined as the conventional derivative of lifted real valued function acting on the Hilbert space of square integrable random variables. In this interpretation, for a given test function, say $f$ the gradient $\nabla D_{\mu}\phi_{s,t}(f)(Y)$ of the first order differential in (1.4) can be seen as the Lions derivative $(\delta u_{s,t}/\delta\mu)(Y)$ of the lifted scalar function $Y\mapsto u_{s,t}(Y):=\mathbb{E}(f(X_{s,t}^{\mu}(Y)))$ , for some random variable $Y$ with distribution $\mu$ .

In the recent book [15], to distinguish these two notions, the authors called the random variable $D_{\mu}\phi_{s,t}(f)(Y)$ the linear functional derivative. For a more thorough discussion on the origins and the recent developments in mean field game theory, we refer to the book [15] as well as the more recent articles [13, 19, 23] and the references therein.

To the best of our knowledge, most of the literature on Lions’ derivatives is concerned with existence theorems without a refined analysis of the exponential decays of these differentials w.r.t. the time parameter. Last but not least, from the practical point of view all differential estimates we found in the literature are rather quite deceiving since after carefully checking, they grow exponentially fast with respect to the time horizon (cf. for instance [13, 19, 20, 23]).

Taylor expansions of the form (1.4) have already been discussed in the book [26] for discrete time nonlinear measure valued semigroups (cf. for instance chapters 3 and 10). We also refer to the more recent article [4] in the context of continuous time Feynman-Kac semigroups. In this context, we emphasize that the semigroup $\phi_{s,t}(\mu)$ is explicitly given by a normalization of a linear semigroup of positive operators. Thus, a fairly simple Taylor expansion yields the second order formula (1.4). In contrast with Feynman-Kac models, McKean-Vlasov semigroups don’t have any explicit form nor an analytical description. As a result, none of above methodologies cannot be used to analyze nonlinear diffusions.

The second order perturbation analysis discussed in this article has been used with success in [27, 28, 30] to analyze the stability properties of Feynman-Kac type particle models, as well as the fluctuations and the exponential concentration of this class of interacting jump processes; see also [34, 37] for general classes of discrete generation mean field particle systems, a well as chapter 7 in [25] and [4, 31] for continuous time models.

These second order perturbation techniques have also been extended in the seminal book by V.N. Kolokoltsov [52] to general classes of nonlinear Markov processes and kinetic equations. Chapter 8 in [52] is dedicated to the analysis of the first and the second order derivatives of nonlinear semigroups with respect to initial data. The use of the first and the second order derivatives in the analysis of central limit theorems and propagation of chaos properties respectively is developed in Chapters 9 and Chapter 10 in [52]. We underline that these results are obtained for diffusion processes as well as for jump-type processes and their combinations, see also [53, 54].

Nevertheless none of these studies apply to derive non asymptotic Taylor expansions (2.14) and (2.20) with exponential decay-type remainder estimates for McKean-Vlasov diffusions nor to estimate the stability properties of the associated semigroups. In addition, to the best of our knowledge the stochastic perturbation theorem 2.7 is the first result of this type for mean field type interacting diffusions.

Last but not least, the idea of considering the flow of empirical measures $m(\xi_{t})$ of a mean field particle model as a stochastic perturbation of the limiting flow $\phi_{0,t}(\mu_{0})$ certainly goes back to the work by Dawson [24], itself based on the martingale approach developed by Papanicolaou, Stroock and Varadhan in [63], published in the end of the 1970’s. These two works are mainly centered on fluctuation type limit theorems. They don’t discuss any Taylor expansion on the limiting semigroup $\phi_{s,t}$ nor any question related to the stability properties of the underlying processes.

3 Some preliminary results

The first part of this section provides a review of tensor product theory and Fréchet differential on Hilbert spaces. Section 3.1 is concerned with conventional tensor products and Fréchet derivatives. Section 3.2 provides a short introduction to tensor integral operators.

In the second part of this section we review some basic tools of the theory of stochastic variational equations, including some differential properties of Markov semigroups. Section 3.3 is dedicated to variational equations. Section 3.5 discusses Bismut-Elworthy-Li extension formulae. We also provide some exponential inequalities for the gradient and the Hessian operators on bounded measurable functions.

The differential operator arising in the Taylor expansions (1.4) are defined in terms of tensor integral operators that depend on the gradient of the drift function $b_{t}(x_{1},x_{2})$ of the nonlinear diffusion. These integro-differential operators are described in section 3.6. The last section, section 3.7 provides some differential formulae as well as some exponential decays estimates of the norm of these operators w.r.t. the time horizon.

3.1 Fréchet differential

We let $[n]$ stands for the set of $n$ multiple indexes $i=(i_{1},\ldots,i_{n})\in{\cal I}^{n}$ over some finite set ${\cal I}$ . Notice that $[n_{1}]\times[n_{2}]=[n_{1}+n_{2}]$ . We denote by ${\cal T}_{p,q}({\cal I})$ the space of $(p,q)$ -tensor $X$ with real entries $(X_{i,j})_{(i,j)\in[p]\times[q]}$ . Given a $(p_{1},q_{1})$ -tensor $X$ and a $(p_{2},q_{2})$ -tensor $Y$ we denote by $(X\otimes Y)$ the $((p_{1}+q_{1}),(p_{2}+q_{2}))$ -tensor defined by

[TABLE]

For a given $(p_{1},q)$ -tensor $X$ and a given $(q,p_{2})$ tensor $Y$ , the product $XY$ and the transposition $Y^{\prime}$ are the $(p_{1},p_{2})$ and $(p_{2},q)$ tensors with entries

[TABLE]

We equip ${\cal T}_{p,q}({\cal I})$ with the Frobenius inner product

[TABLE]

Identifying $(1,0)$ -tensors ${\cal T}_{1,0}({\cal I})=\mathbb{R}^{{\cal I}}$ with column vectors $(X_{i})_{i\in{\cal I}}\in\mathbb{R}^{{\cal I}}$ the above quantities coincide with the conventional Euclidian inner product and norm on the product space $\mathbb{R}^{{\cal I}}$ . When ${\cal I}=\{1,\ldots,d\}$ we simplify notation and we set $\mathbb{R}^{d}$ instead of $\mathbb{R}^{\{1,\ldots,d\}}$ . For any tensors $X$ and $Y$ with appropriate dimensions, using Cauchy-Schwartz inequality we check that

[TABLE]

Let $\mathbb{H}({\cal T}_{p,q}({\cal I})):=\mathbb{L}_{2}((\Omega,\mathbb{F},\mathbb{P}),{\cal T}_{p,q}({\cal I}))$ be the Hilbert space of ${\cal T}_{p,q}({\cal I})$ -valued random variables defined on some probability space $(\Omega,\mathbb{F},\mathbb{P})$ , equipped with the inner product

[TABLE]

induced by the inner product $\langle X,Y\rangle$ on ${\cal T}_{p,q}({\cal I})$ . We denote by $\mathbb{E}(X)=\mathbb{E}(X_{i,j})_{(i,j)\in[p]\times[q]}$ the entry-wise expected value of a $(p,q)$ -tensor.

When ${\cal I}=\{1,\ldots,d\}$ and $(p,q)=(1,0)$ the space $\mathbb{H}({\cal T}_{p,q}({\cal I}))$ coincides with be the Hilbert space $\mathbb{H}(\mathbb{R}^{d})=\mathbb{L}_{2}((\Omega,\mathbb{F},\mathbb{P}),\mathbb{R}^{d})$ of square integrable $\mathbb{R}^{d}$ -valued and $\mathbb{F}$ -measurable random variables.

We denote by

[TABLE]

the non decreasing sequence of Hilbert spaces associated with some increasing filtration $\mathbb{F}_{n}\subset\mathbb{F}_{n+1}$ .

In Landau notation, we recall that a function

[TABLE]

is said to be Fréchet differentiable at $X$ if there exists a continuous map

[TABLE]

such that

[TABLE]

3.2 Tensor integral operators

Let ${\cal B}(E,{\cal T}_{p,q}({\cal I}))$ be the set of bounded measurable functions from a measurable space $E$ into some tensor space ${\cal T}_{p,q}({\cal I})$ . Signed measures $\mu$ on $E$ act on bounded measurable functions $g$ from $E$ into $\mathbb{R}$ . We extend these integral operators to tensor valued functions $g=(g_{i,j})_{(i,j)\in[p]\times[q])}\in{\cal B}(E,{\cal T}_{p,q}({\cal I}))$ by setting for any $(i,j)\in[p]\times[q]$

[TABLE]

Let $(E,{\cal E})$ and $(F,{\cal F})$ be some pair of measurable spaces. A $(p,q)$ -tensor integral operator

[TABLE]

is defined for $r\geq 0$ and $g\in{\cal B}({\cal F},{\cal T}_{q,r}({\cal I}))$ by the tensor valued and measurable function ${\cal Q}(g)$ with entries given $x\in E$ and $(i,j)\in([p]\times[r])$ by the integral formula

[TABLE]

for some collection of integral operators ${\cal Q}_{i,k}(x_{1},dx_{2})$ from ${\cal B}(E,\mathbb{R})$ into ${\cal B}(F,\mathbb{R})$ . We also consider the operator norm

[TABLE]

The tensor product $({\cal Q}^{1}\otimes{\cal Q}^{2})$ of a couple of $(p_{i},q_{i})$ -tensor integral operators

[TABLE]

is a $(p,q)$ -tensor integral operator

[TABLE]

with the product spaces

[TABLE]

The entries of $({\cal Q}^{1}\otimes{\cal Q}^{2})(h)$ are given for any $x=(x_{1},x_{2})$ and any pair of multi-indices $i=(i_{1},i_{2})\in([p_{1}]\times[p_{2}])$ , $j=(j_{1},j_{2})\in([r_{1}]\times[r_{2}])$ by the integral formula

[TABLE]

with the tensor product measures defined for any $k=(k_{1},k_{2})\in([q_{1}]\times[q_{2}])$ and any $y=(y_{1},y_{2})$ by

[TABLE]

3.3 Variational equations

The gradient and the Hessian of a multivariate smooth function $h(x)=(h_{i}(x))_{i\in[p]}$ is defined by the $(1,p)$ and $(2,p)$ tensors $\nabla h(x)$ and $\nabla^{2}h(x)$ with entries given for any $1\leq k,l\leq d$ and $i\in[p]$ by the formula

[TABLE]

We consider the tensor valued functions $b_{t}^{[k_{1},k_{2}]}$ and $b_{t}^{[k_{1},k_{2},k_{3}]}$ defined for any $k_{1},k_{2},k_{3}=1,2$ by

[TABLE]

with the $(2,1)$ and $(3,1)$ -tensor valued functions

[TABLE]

In the above display, $\partial_{x_{k}^{i}}b^{j}_{t}(x_{1},x_{2})$ stands for the partial derivative of the scalar function $b_{t}^{j}(x_{1},x_{2})$ w.r.t. the coordinate $x_{k}^{i}$ , with the drift function $b_{t}(x_{1},x_{2})$ from $\mathbb{R}^{2d}$ into $\mathbb{R}^{d}$ introduced in section 1.1, In the same vein, $\partial_{x_{k_{1}}^{i_{1}}}\partial_{x_{k_{2}}^{i_{2}}}b^{j}_{t}(x_{1},x_{2})$ and $\partial_{x_{k_{1}}^{i_{1}}}\partial_{x_{k_{2}}^{i_{2}}}\partial_{x_{k_{3}}^{i_{3}}}b^{j}_{t}(x_{1},x_{2})$ stands for the second and third partial derivatives of $b_{t}^{j}(x_{1},x_{2})$ w.r.t. the coordinates $x_{k_{1}}^{i_{1}}$ , $x_{k_{2}}^{i_{2}}$ and $x_{k_{3}}^{i_{3}}$ with $k_{1},k_{2},k_{3}\in\{1,2\}$ .

For any $\mu\in P_{2}(\mathbb{R}^{d})$ and $x_{1}\in\mathbb{R}^{d}$ we also consider the tensor functions

[TABLE]

Recalling that $b_{t}(x,\phi_{s,t}(\mu))$ has continuous and uniformly bounded derivatives up to the third order, the stochastic flow $x\mapsto X_{s,t}^{\mu}(x)$ is a twice differentiable function of the initial state $x$ . In addition, when $(H)$ holds the gradient $\nabla X^{\mu}_{s,t}(x)$ of the diffusion flow $X^{\mu}_{s,t}(x)$ satifies the $(d\times d)$ -matrix valued stochastic diffusion equation

[TABLE]

The above estimate is a direct consequence of well known log-norm estimates for exponential semigroups, see for instance [22] as well as section 1.3 in the recent article [11].

We have the stochastic tensor evolution equation

[TABLE]

This implies that

[TABLE]

from which we check that

[TABLE]

Using (3.2), this yields the estimate

[TABLE]

More generally, using the multivariate version of the de Faà di Bruno derivation formula [21] (see also formula (5.14) in the appendix), for any $n\geq 1$ we also check the uniform estimate

[TABLE]

A detailed proof is provided in the appendix, on page Proof of (3.4).

3.4 Differential of Markov semigroups

We have the commutation formula

[TABLE]

with the $(1,1)$ -tensor integral operator ${\cal P}_{s,t}^{\mu}$ defined for any $x\in\mathbb{R}^{d}$ and any differentiable function $f$ on $\mathbb{R}^{d}$ by the formula

[TABLE]

The tensor product of ${\cal P}_{s,t}^{\mu}$ is also given by the $(2,2)$ -tensor integral operator

[TABLE]

In the above display, $\overline{X}_{s,t}^{\mu}(x)$ stands for an independent copy of $X_{s,t}^{\mu}(x)$ and $h=(\nabla\otimes\nabla)g$ stands for the matrix valued function defined in (1.14). We also have the commutation formula

[TABLE]

In the same vein, we have the second order differential formula

[TABLE]

with the $(2,1)$ and $(2,2)$ -tensor integral operators

[TABLE]

Iterating the above procedure, the $n$ -th differential of $P^{\mu}_{s,t}(f)$ at any order $n\geq 1$ takes the form

[TABLE]

for some integral operators ${\cal P}^{[n,k],\mu}_{s,t}$ . For instance, we have the third order differential formula

[TABLE]

with the $(2,1)$ and $(2,2)$ -tensor integral operators

[TABLE]

with the $\mathbin{\mathchoice{\vbox{ \halign{#\cr$ \displaystyle{}{\frown} $\kern-0.5pt\cr\nointerlineskip\kern-0.1pt\cr$ \displaystyle\otimes $\cr} }}{\vbox{ \halign{#\cr$ \textstyle{}{\frown} $\kern-0.5pt\cr\nointerlineskip\kern-0.1pt\cr$ \textstyle\otimes $\cr} }}{\vbox{ \halign{#\cr$ \scriptstyle{}{\frown} $\kern-0.5pt\cr\nointerlineskip\kern-0.1pt\cr$ \scriptstyle\otimes $\cr} }}{\vbox{ \halign{#\cr$ \scriptscriptstyle{}{\frown} $\kern-0.5pt\cr\nointerlineskip\kern-0.1pt\cr$ \scriptscriptstyle\otimes $\cr} }}}$ -tensor product of type $(3,2)$ given for any $i=(i_{1},i_{2},i_{3})$ and $l=(l_{1},l_{2})$ by

[TABLE]

The above formulae remains valid for any column vector multivariate function $f=(f_{i})_{1\leq i\leq d}$ . An explicit description of the integral operators ${\cal P}^{[n,k],\mu}_{s,t}$ for any $1\leq k\leq n$ can be obtained using multivariate derivations and combinatorial manipulations, see for instance the multivariate version of the de Faà di Bruno derivation formulae (5.14) and (5.15) in the appendix. Following the proof of (3.4) we also check the uniform estimates

[TABLE]

Using the moment estimates (1.15) for any $\mu\in P_{2}(\mathbb{R}^{d})$ , $m,n\geq 0$ , and any $s\leq t$ , we also check the rather crude estimate

[TABLE]

For instance, using the de Faà di Bruno derivation formula (5.15) for any function $f\in{\cal C}^{n}_{m}(\mathbb{R}^{d})$ such that $\|f\|_{{\cal C}^{n}_{m}(\mathbb{R}^{d})}\leq 1$ and for any $0\leq k\leq n$ we check that

[TABLE]

The estimates (1.15) implies that

[TABLE]

from which we conclude that

[TABLE]

3.5 Bismut-Elworthy-Li extension formulae

We have the Bismut-Elworthy-Li formula

[TABLE]

The above formula is valid for any function $\omega_{s,t}:u\in[s,t]\mapsto\omega_{s,t}(u)\in\mathbb{R}$ of the following form

[TABLE]

for some non decreasing differentiable function $\varphi$ on $[0,1]$ with bounded continuous derivatives and such that

[TABLE]

In the same vein, for any $s\leq u\leq t$ we have

[TABLE]

with the stochastic process

[TABLE]

Besides the fact that $X^{\mu}_{s,t}(x)$ is a nonlinear diffusion, the proof of the above formula follows the same proof as the one provided in [6, 12, 39, 57, 66] in the context of diffusions on differentiable manifolds. For the convenience of the reader, a detailed proof is provided in the appendix on page Proof of (3.22) and (3.24). Using (3.22), for any $f$ s.t. $\|f\|\leq 1$ we check that

[TABLE]

Let $\varphi_{\epsilon}$ with $\epsilon\in]0,1[$ be some differentiable function on $[0,1]$ null on $[0,1-\epsilon]$ and such that $|\partial\varphi_{\epsilon}(u)|\leq c/\epsilon$ and $(\varphi_{\epsilon}(1-\epsilon),\varphi(1))=(0,1)$ , for instance we can choose

[TABLE]

In this situation, we find the rather crude uniform estimate

[TABLE]

In the same vein, combining (3.24) with the estimate (3.3) for any $\epsilon\in]0,1[$ and $u\in]s,t[$ we also check the rather crude uniform estimate

[TABLE]

Choosing $u=s+(1-\epsilon)(t-s)$ in the above display we readily check that

[TABLE]

3.6 Integro-differential operators

Let $\mathbb{B}^{\mu}_{s,t}(x_{0},x_{1})$ be the matrix-valued function defined for any $(x_{0},x_{1})\in\mathbb{R}^{2d}$ , $\mu\in P_{2}(\mathbb{R}^{d})$ and any $s\leq t$ by the formulae

[TABLE]

For instance, for the linear model discussed in (2.24) we have

[TABLE]

We also consider the collection Weyl chambers $[s,t]_{n}$ defined for any $n\geq 1$ by

[TABLE]

We consider the space-time Weyl chambers

[TABLE]

The coordinates of a generic point $(u,y)\in\Delta_{s,t}^{n}$ for some $n\geq 1$ are denoted by

[TABLE]

We also use the convention $u_{0}=s$ and $u_{n+1}=t$ . We consider the measures $\Phi_{s,u}(\mu)$ on $\Delta_{s,t}$ given on every set $\Delta_{s,t}^{n}$ and any $n\geq 1$ by

[TABLE]

with the tensor product measures

[TABLE]

Definition 3.1.

Let $b^{\mu}_{s,u}(x,y)$ be the function defined for any $\mu\in P_{2}(\mathbb{R}^{d})$ , $x\in\mathbb{R}^{d}$ , and any $(u,y)\in\Delta_{s,t}^{n}$ and $n\geq 1$ by the formula

[TABLE]

In the above display the product of matrices is understood as a directed product from $k=1$ to $k=(n-1)$ . For instance, for the linear model discussed in (2.24) we have

[TABLE]

For any $x\in\mathbb{R}^{d}$ , and any $(u,y)\in\Delta_{s,t}^{n}$ and $n\geq 1$ we also set

[TABLE]

Definition 3.2.

For any $\mu_{0},\mu_{1}\in P_{2}(\mathbb{R}^{d})$ and $s\leq t$ we let $Q^{\mu_{1},\mu_{0}}_{s,t}$ be the operator defined on differentiable functions $f$ on $\mathbb{R}^{d}$ by

[TABLE]

with the $(0,1)$ -tensor integral operator ${\cal Q}^{\mu_{1},\mu_{0}}_{s,t}$ defined by the integral formula

[TABLE]

Recall that $b_{t}(x,y)$ is differentiable at any order with uniformly bounded derivatives. Thus, using the estimates (1.15) and (3.4), for any $m,n\geq 0$ , $\mu_{0},\mu_{1}\in P_{m\vee 2}(\mathbb{R}^{d})$ we have

[TABLE]

Definition 3.3.

Let $p^{\mu_{1},\mu_{0}}_{s,t}$ be the function defined for any $s\leq t$ and $x,z\in\mathbb{R}^{d}$ by the formula

[TABLE]

In this notation, we readily check the following proposition.

Proposition 3.4.

The $(0,1)$ -tensor integral operator ${\cal Q}^{\mu_{1},\mu_{0}}_{s,t}$ can be rewritten as follows:

[TABLE]

For instance, for the linear model discussed in (2.24) the function $p_{s,t}^{\mu_{1},\mu_{0}}(x,z)$ defined in (3.33) reduces to

[TABLE]

We check this claim expanding in (3.33) the exponential series coming from the integration over the set $\Delta_{s,t}$ . A detailed proof of the above formula is provided in the appendix on page Proof of (3.34).

3.7 Some differential formulae

The matrix $\nabla_{y_{0}}b_{s,t}^{\mu}(y_{0},y_{1})$ defined in (3.27) can alternatively be written as follows

[TABLE]

We also have the $(2,1)$ and $(3,1)$ -tensor formulae

[TABLE]

For any $(u,y)\in\Delta_{s,t}^{n}$ with $n\geq 1$ and for any $k\geq 1$ we have the $(k,1)$ -tensor formulae

[TABLE]

We consider the $(n,1)$ -tensor valued function

[TABLE]

and we use the convention

[TABLE]

For instance, for the linear model discussed in (2.24) and (3.34) the above objects reduce to

[TABLE]

In this notation, we have the following proposition.

Proposition 3.5.

For any $n\geq 0$ the $n$ -th differential of the operator $Q^{\mu_{1},\mu_{0}}_{s,t}$ is given by the formula

[TABLE]

with the $(n,1)$ -tensor integral operator given by

[TABLE]

In addition, when condition $(H)$ is satisfied for any $n\geq 1$ we have the exponential estimates

[TABLE]

Proof.

The proof of the first assertion follows from (3.33). More precisely, using (3.33) we have

[TABLE]

On the other hand, by proposition 3.4 we also have

[TABLE]

This ends the proof of the first assertion. When condition $(H)$ is satisfied, for any $x\in\mathbb{R}^{d}$ and $(u,y)\in\Delta_{s,t}^{n}$ we have

[TABLE]

Using (3.4) we also check the uniform estimate

[TABLE]

The end of the proof is now a consequence of (3.2).

Proposition 3.6.

For any $n\geq 0$ any bounded function $f$ on $\mathbb{R}^{d}$ and for any function $\omega$ of the form (3.23) we have the Bismut-Elworthy-Li formula

[TABLE]

In the above display, $\tau^{\mu,\omega}_{u,t}(y)$ stands for the stochastic process defined in (3.22). In addition, when condition $(H)$ is satisfied we have the exponential estimates

[TABLE]

Proof.

The proof of the first assertion is a direct application of the Bismut-Elworthy-Li formula (3.22). More precisely, using (3.22) we have

[TABLE]

The formula (3.40) is now a direct consequence of (3.36).

We check (3.41) combining (3.25) with (3.39). This ends the proof of the proposition.

When $n=1$ we drop the upper index and we write $\left(\mathbb{B}_{s,u}^{\mu},q_{s,t}^{\mu_{1},\mu_{0}}\right)$ instead of $\left(\mathbb{B}_{s,u}^{[1],\mu},q_{s,t}^{[1],\mu_{1},\mu_{0}}\right)$ .

The operators discussed above are indexed by a pair of measures $(\mu_{0},\mu_{1})$ . To simplify notation, when $\mu_{1}=\mu_{0}=\mu$ we suppress one of the indices and we write $(Q^{\mu}_{s,t},{\cal Q}^{[n],\mu}_{s,t})$ and $(p^{\mu}_{s,t},q^{[n],\mu}_{s,t})$ instead of $(Q^{\mu,\mu}_{s,t},{\cal Q}^{[n],\mu,\mu}_{s,t})$ and $(p^{\mu,\mu}_{s,t},q^{[n],\mu,\mu}_{s,t})$ .

4 Tangent processes

The tangent process associated with the diffusion flow $\psi_{s,t}(Y)$ introduced in (1.6) is given for any $U\in\mathbb{H}_{s}(\mathbb{R}^{d})$ by the evolution equation

[TABLE]

In the above display, $\partial B_{t}(X)\in\mbox{\rm Lin}(\mathbb{H}_{t}(\mathbb{R}^{d}),\mathbb{H}_{t}(\mathbb{R}^{d}))$ stands for the Fréchet differential of the drift function $B_{t}$ defined for any $Z\in\mathbb{H}_{t}(\mathbb{R}^{d})$ by

[TABLE]

where $(\overline{X},\overline{Z})$ stands for an independent copy of $(X,Z)$ .

4.1 Spectral estimate

This section is mainly concerned with the proof of theorem 2.1.

For any pair of random variables $Z_{1},Z_{2}\in\mathbb{H}_{t}(\mathbb{R}^{d})$ we have the duality formula

[TABLE]

with the dual operator $\partial B_{t}(X)^{\star}$ defined by the formula

[TABLE]

In the above display, $(\overline{X},\overline{Z}_{1})$ stands for an independent copy of $(X,Z_{1})$ . The symmetric part of $\partial B_{t}(X)$ is given by the formula

[TABLE]

We are now in position to prove theorem 2.1.

The first assertion is a direct consequence of the evolution equation

[TABLE]

Whenever $(H)$ is met we have $\partial B_{t}(X)_{\tiny sym}\leq-\lambda_{0}~{}I$ for some $\lambda_{0}>0$ . In this situation, the r.h.s. estimate in (2.2) is a direct consequence of (2.1). Given an independent copy $(\overline{X},\overline{Z}_{2})$ of $(X,Z_{2})$ we have

[TABLE]

This yields the log-norm estimate

[TABLE]

The proof of theorem 2.1 is now completed.

4.2 Dyson-Phillips expansions

In the further development of this section we shall denote by

[TABLE]

a collection of independent copies of the stochastic flows $(\psi_{s,t},X^{\mu}_{s,t})$ and some given $U,Y\in\mathbb{H}_{s}(\mathbb{R}^{d})$ . To simplify notation, we also set

[TABLE]

We are now in position to state and prove the main result of this section.

Theorem 4.1.

The tangent process $\partial\psi_{s,t}$ is given for any $U\in\mathbb{H}_{s}(\mathbb{R}^{d})$ and any $Y\in\mathbb{H}_{s}(\mathbb{R}^{d})$ with distribution $\mu\in P_{2}(\mathbb{R}^{d})$ by the Dyson-Phillips series

[TABLE]

with the boundary conventions

[TABLE]

Proof.

For any $s\leq u\leq t$ and $x\in\mathbb{R}^{d}$ we have

[TABLE]

and

[TABLE]

In addition, for any $s\leq u\leq t$ and $x_{0},x_{1}\in\mathbb{R}^{d}$ we have

[TABLE]

Combining the above with (4.1) we check that

[TABLE]

In the above display, $\nabla b_{t}\left(\psi_{s,t}(Y),\overline{X}^{\mu}_{s,t}(\mbox{\LARGE.})\right)(\overline{Y})=\nabla h(\overline{Y})$ stands for the gradient of the random function

[TABLE]

Equivalently, we have

[TABLE]

and therefore

[TABLE]

Now, the end of the proof of (4.4) follows a simple induction, thus it is skipped.

Corollary 4.2.

For any $V\in\mathbb{H}_{t}(\mathbb{R}^{d})$ and for any $Y\in\mathbb{H}_{s}(\mathbb{R}^{d})$ with distribution $\mu\in P_{2}(\mathbb{R}^{d})$ we have

[TABLE]

with the boundary conditions

[TABLE]

4.3 Gradient semigroup analysis

This section is concerned with a gradient semigroup description of the dual of the tangent process.

Definition 4.3.

For any $\mu_{0},\mu_{1}\in P_{2}(\mathbb{R}^{d})$ and $s\leq t$ we let $D_{\mu_{1},\mu_{0}}\phi_{s,t}$ be the operator defined on differentiable functions $f$ on $\mathbb{R}^{d}$ by

[TABLE]

In the above display, $Q^{\mu_{1},\mu_{0}}_{s,t}$ stands for the operator defined in (3.31).

Rewritten in terms of expectation operators we have

[TABLE]

Recall that $b_{t}(x,y)$ is differentiable at any order with uniformly bounded derivatives. Thus, arguing as in the proof of (3.21) and (3.32) for any $m,n\geq 1$ , $\mu_{0},\mu_{1}\in P_{m\vee 2}(\mathbb{R}^{d})$ we have

[TABLE]

In the same vein, we check that

[TABLE]

The proof of the above estimate is rather technical, thus it is housed in the appendix on page Proof of (4.7).

Remark 4.4.

Using the Bismut-Elworthy-Li formula (3.40), we extend the operators $D_{\mu_{1},\mu_{0}}\phi_{s,t}$ with $s<t$ to non necessarily differentiable and bounded functions.

We also extend the operator $D_{\mu_{1},\mu_{0}}\phi_{s,t}$ to tensor functions $f=(f_{i})_{i\in[n]}$ by considering the tensor function with entries

[TABLE]

In this situation, the function $p^{\mu_{1},\mu_{0}}_{s,t}$ introduced in (3.33) takes the form

[TABLE]

Let $G_{t,\mu_{1}}$ be the collection of integro-differential operators indexed by $\mu_{1}\in P_{2}(\mathbb{R}^{d})$ defined by

[TABLE]

We also set

[TABLE]

In this notation, we have the first order expansion

[TABLE]

Theorem 4.5.

For any $m,n\geq 1$ and any $\mu_{0},\mu_{1}\in P_{m\vee 2}(\mathbb{R}^{d})$ the operator $D_{\mu_{1},\mu_{0}}\phi_{s,t}$ coincides with the evolution semigroup of the integro-differential operator $H_{t,\phi_{s,t}(\mu_{0}),\phi_{s,t}(\mu_{1})}$ ; that is, we have the forward evolution equation

[TABLE]

In addition, for any $s\leq u<t$ we have the backward evolution equation

[TABLE]

Proof.

The proof of the forward equation (4.10) is a direct consequence of the forward evolution equation

[TABLE]

associated with the Markov semigroup $P^{\mu_{0}}_{s,t}$ , thus it is skipped. The semigroup property (2.9) yields

[TABLE]

Combining the above with the forward equation (4.10) we check that

[TABLE]

This implies that

[TABLE]

from which we conclude that

[TABLE]

This yields the backward evolution equation (4.11). This ends the proof of the theorem.

Next proposition is a direct consequence of (4.5) combined with the formulae (3.5) and (3.36).

Proposition 4.6.

We have the commutation formula

[TABLE]

with the $(1,1)$ -tensor integral operator given by the column vector function

[TABLE]

In addition, when condition $(H)$ is satisfied we have

[TABLE]

Remark 4.7.

Following remark 4.4, using the Bismut-Elworthy-Li formula (3.40), we extend the gradient operators $\nabla D_{\mu_{1},\mu_{0}}\phi_{s,t}$ with $s<t$ to measurable and bounded functions. The exponential estimate stated in (3.41) are a direct consequence of the estimates presented in (3.41).

By (4.8) the commutation formula (4.12) is also satisfied for multivariate column functions $f$ . In this situation ${\cal D}_{\mu_{1},\mu_{0}}\phi_{s,t}(\nabla f)$ is a $(d\times d)$ -matrix valued function.

The proof of theorem 2.2 is now a consequence of the estimate (4.14) and the fact that

[TABLE]

More precisely, using (4.9) the above formula implies that

[TABLE]

The operators discussed above are indexed by a pair of measures $(\mu_{0},\mu_{1})$ . To simplify notation, when $\mu_{1}=\mu_{0}=\mu$ we suppress one of the parameter and we write $(D_{\mu}\phi_{s,t},{\cal D}_{\mu}\phi_{s,t})$ instead of $(D_{\mu,\mu}\phi_{s,t},{\cal D}_{\mu,\mu}\phi_{s,t})$ .

Theorem 4.8.

For any $m,n\geq 1$ , any function $f\in{\cal C}^{n}_{m}(\mathbb{R}^{d})$ and any $Y\in\mathbb{H}_{s}(\mathbb{R}^{d})$ with distribution $\mu\in P_{2}(\mathbb{R}^{d})$ we have the gradient formula

[TABLE]

Proof.

Given a smooth function $f$ on $\mathbb{R}^{d}$ we have

[TABLE]

Replacing $V$ by $\nabla f(\psi_{s,t}(Y))$ in (4.4) we check that

[TABLE]

This ends the proof of the theorem

5 Taylor expansions

This section is mainly concerned with the proof of the first and second order Taylor expansions stated in theorem 2.3 and theorem 2.4 . Section 5.1 presents some preliminary differential formulae used in the proof of the theorems.

5.1 Some differential formulae

The commutation formula (4.12) takes the form

[TABLE]

Combining (4.5) with proposition 3.5 and the second order formula (3.7) we also have

[TABLE]

In summary, we have the first and second order differential formulae

[TABLE]

Similar formulae for $\nabla D_{\mu_{0},\mu_{1}}\phi_{s,t}$ and $\nabla^{2}D_{\mu_{0},\mu_{1}}\phi_{s,t}$ can easily be found. In the same vein, using (3.9) we check the third order differential formula

[TABLE]

In addition, when condition $(H)$ is satisfied we have the exponential estimates

[TABLE]

Definition 5.1.

We let $S_{s,t}^{\mu}$ be the operator defined for any differentiable function $f$ on $\mathbb{R}^{d}$ by

[TABLE]

with the $(0,1)$ -tensor integral operator ${\cal S}_{s,t}^{\mu}$ defined by the formula

[TABLE]

Using (4.6) and (4.13) for any $m,n\geq 0$ and $\mu\in P_{m\vee 2}(\mathbb{R}^{d})$ we check that

[TABLE]

We also have the differential formula

[TABLE]

with the matrix valued functions

[TABLE]

When condition $(H)$ is satisfied we also have the exponential estimates

[TABLE]

In addition, using the Bismut-Elworthy-Li extension formulae and the estimates (2.7) and (2.8), or any bounded measurable function $f$ on $\mathbb{R}^{d}$ we check that

[TABLE]

5.2 A first order expansion

This section is mainly concerned with the proof of theorem 2.3. The next technical lemma is pivotal.

Lemma 5.2.

For any $m\geq 1$ for any $\mu_{0},\mu_{1}\in P_{m+1}(\mathbb{R}^{d})$ we have the second order expansion

[TABLE]

Proof.

Combining (4.9) with the backward evolution equation (4.11) we check that

[TABLE]

On the other hand, we have

[TABLE]

Integrating $u$ from $u=s$ to $u=t$ we obtain the formula

[TABLE]

The end of the lemma is now completed.

Combining the above lemma with (4.7) and (5.5) we check (2.11) with the operator $D^{2}_{\mu_{1},\mu_{0}}\phi_{s,t}$ defined for any $m,n\geq 0$ and $\mu_{0},\mu_{1}\in P_{m+2}(\mathbb{R}^{d})$ by

[TABLE]

Remark 5.3.

The second order term in (2.11) can alternatively be expressed in terms of the Hessian of the semigroup $D^{2}_{\mu_{1},\mu_{0}}\phi_{s,t}$ ; that is, we have that

[TABLE]

with the interpolating path

[TABLE]

In the above display, $(\overline{Y}_{1},\overline{Y}_{0})$ stands for an independent copy of a pair of random variables $(Y_{0},Y_{1})$ with distribution $(\mu_{0},\mu_{1})$ . Also observe that

[TABLE]

with the centered second order operator

[TABLE]

In the above display, $Y_{\epsilon,\overline{\epsilon}}(x_{1},x_{2})$ stands for the interpolating path

[TABLE]

Proposition 5.4.

We have commutation formula

[TABLE]

In addition, we have the estimate

[TABLE]

Proof.

The proof of the first assertion is a consequence of the commutation formula (4.12). Letting $h=(\nabla\otimes\nabla)g$ we have

[TABLE]

The proof of (5.12) now follows the same arguments as the ones we used in the proof of (4.14), thus it is skipped. This ends the proof of the proposition.

Combining (5.6) with the commutation formula (5.11), for any twice differentiable function $f$ and any $s\leq t$ and $\mu_{0},\mu_{1}\in P_{2}(\mathbb{R}^{d})$ we check that

[TABLE]

with the operators $\mathbb{S}^{[2,k],\mu}_{s,t}$ discussed in (5.6). The proof of (2.12) is a direct consequence of (5.7) and (5.12). The proof of theorem 2.3 is now completed.

5.3 Second order analysis

This short section is mainly concerned with the proof of the first part of theorem 2.4.

Lemma 5.5.

For any $m\geq 1$ and $\mu_{0},\mu_{1}\in P_{m+3}(\mathbb{R}^{d})$ and $s\leq t$ we have the tensor product formula

[TABLE]

for some third order linear operator ${\cal R}_{\mu_{1},\mu_{0}}\phi_{s,t}$ such that

[TABLE]

The proof of the above lemma is rather technical, thus it is housed in the appendix, on page Proof of lemma 5.5.

Combining the above lemma with (5.8) we readily check the second order decomposition (2.14) with a the remainder linear operator $D^{3}_{\mu_{0},\mu_{1}}\phi_{s,t}$ such that

[TABLE]

This ends the proof of the first part of theorem 2.4. The proof of the second part of the theorem is provided in the appendix, on page Proof of the estimate (2.15).

Acknowledgements

The authors are supported by the ANR Quamprocs on quantitative analysis of metastable processes. P. Del Moral is also supported in part from the Chair Stress Test, RISK Management and Financial Steering, led by the French Ecole polytechnique and its Foundation and sponsored by BNP Paribas.

Appendix

Proof of (2.22)

It is easy to check that this first assertion is true for any collection of generators $L_{t,\mu}$ , thus we skip the details. The proof of the second assertion is a also a direct consequence of a more general result which is valid for any collection of generators and non necessarily symmetric functions.

For any $N\geq 2$ and $x=(x^{i})_{1\leq i\leq N}\in(\mathbb{R}^{d})^{N}$ we set

[TABLE]

We extend $L_{t,\mu}$ to functions $F(x^{1},x^{2})$ on $\mathbb{R}^{2d}$ by setting

[TABLE]

For any function $F(x^{1},x^{2})$ on $\mathbb{R}^{2d}$ we have

[TABLE]

with

[TABLE]

This implies that

[TABLE]

Recalling that

[TABLE]

we conclude that

[TABLE]

with the operator

[TABLE]

Observe that

[TABLE]

This yields the formula

[TABLE]

from which we conclude that

[TABLE]

with the function $\Gamma_{L_{t,m(x)}}(F)$ defined for any $y\in\mathbb{R}^{d}$ by

[TABLE]

The above formula readily implies (2.22) as soon as $L_{t,\mu}$ is the collection of generators associated with the stochastic flow defined in (1.1). This ends the proof of (2.22).

Proof of (3.4)

For any given $1\leq m\leq n$ , we denote by $\Pi_{n,m}$ the set of partitions $\pi=\{\pi_{1},\ldots,\pi_{m}\}$ of the set $\{1,\ldots,n\}$ with $m$ blocks $\pi_{i}$ of size $|\pi_{i}|$ , with $i\in\{1,\ldots,m\}$ . We also let $\Pi_{n}$ the set of partitions of the set $\{1,\ldots,n\}$ and $\flat(\pi)$ the number of blocks in a given partition $\pi$ , and $\Pi^{+}_{n}$ the subset of partitions $\pi$ s.t. $\flat(\pi)>1$ .

Let $[n]$ be the set of $m$ multiple indexes $i=(i_{1},\ldots,i_{n})\in\{1,\ldots,d\}^{n}$ . For any given $i\in[n]$ and any subset $S=\{j_{1},\ldots,j_{s}\}\subset\{1,\ldots,n\}$ we set

[TABLE]

For any $x=(x^{1},\ldots,x^{d})\in\mathbb{R}^{d}$ and any multiple index $i\in[n]$ we write $\partial_{i}$ instead of $\partial_{x^{i_{1}},\ldots x^{i_{n}}}=\partial_{x^{i_{1}}}\ldots\partial_{x^{i_{n}}}$ the $n$ -th partial derivatives w.r.t. the coordinates $(x^{i_{1}},\ldots x^{i_{n}})$ .

Let $f$ and $X$ be a couple of smooth function from $\mathbb{R}^{d}$ into itself. In this notation for any $i\in[n]$ and $1\leq j\leq d$ we have the multivariate Faà di Bruno derivation formula

[TABLE]

with the $\pi$ -gradient tensor

[TABLE]

We check the above formula by induction w.r.t. the parameter $n$ . In a more compact we have checked the following lemma.

Lemma 5.6.

For any $n\geq 1$ we have the Faà di Bruno derivation formula

[TABLE]

Whenever $X(x)$ is a random function we have

[TABLE]

with the collection of integral operators

[TABLE]

Using the above lemma we also check the stochastic tensor evolution equation

[TABLE]

with

[TABLE]

In a more compact form we have

[TABLE]

This implies that

[TABLE]

Taking the trace in the above display, we check that

[TABLE]

This yields the rather crude estimate

[TABLE]

from which we check that

[TABLE]

The summation in the above display is taken over all indices $l_{1},\ldots,l_{n-1}$ such that $l_{1}+\ldots+l_{n-1}=m$ and $l_{1}+2l_{2}+\ldots+(n-1)l_{n-1}=n$ and $1<m\leq n$ . Assume that (3.4) has been checked up to rank $(n-1)$ . In this case, we have

[TABLE]

This ends the proof of (3.4).

Proof of (3.22) and (3.24)

We recall the backward formula

[TABLE]

A detailed proof of the above formula based on backward stochastic flows can be found in theorem 3.1 in the article [5]. This implies that

[TABLE]

from which we check that

[TABLE]

This yields the formula

[TABLE]

We conclude that

[TABLE]

This ends the proof of (3.22). For any $s\leq u\leq t$ applying (3.22) to the function $P^{\phi_{s,u}(\mu)}_{u,t}(f)$ we have

[TABLE]

This implies that

[TABLE]

Applying (3.22) to the first term we check that

[TABLE]

We conclude that

[TABLE]

This ends the proof of (3.24).

Proof of (3.34)

We have

[TABLE]

Recalling that

[TABLE]

and using the rather well known exponential formulae

[TABLE]

we check that

[TABLE]

from which we find that

[TABLE]

This ends the proof of (3.34).

Proof of (4.7)

We have the tensor product formula

[TABLE]

We also have

[TABLE]

Recall that $b_{t}(x,y)$ is differentiable at any order with uniformly bounded derivatives. Thus all differentials of the above function w.r.t. the coordinate $x$ have uniformly bounded derivatives. On the other hand, the mapping $x\mapsto b_{t}(x,y)$ has at most linear growth. Thus, using the estimates (1.15) and (3.4), for any $m\geq 0$ we check that

[TABLE]

In the same vein, we have the tensor product formula

[TABLE]

with

[TABLE]

Arguing as above and using the estimates (1.15) and (3.4) for any $m\geq 0$ we check that

[TABLE]

Proof of lemma 5.5

Using the decomposition

[TABLE]

which is valid for any $\mu_{0},\mu_{1}\in P_{2}(\mathbb{R}^{d})$ and any $u=(u_{1},\ldots,u_{n})\in[s,t]_{n}$ with $n\geq 1$ , for any function

[TABLE]

we check that

[TABLE]

with the function

[TABLE]

In the above display, $\Upsilon^{\mu_{1},\mu_{0}}_{s,t}$ stands for the tensor product measures

[TABLE]

We also have the tensor product formula

[TABLE]

This yields the decomposition

[TABLE]

with the integral operator

[TABLE]

Arguing as in the proof of (3.21) and (4.6) we check that

[TABLE]

In the same vein, we have

[TABLE]

with

[TABLE]

and

[TABLE]

This yields the formula

[TABLE]

with the integral operator

[TABLE]

Arguing as above, we check that

[TABLE]

Combining the above decompositions we find that

[TABLE]

For any $n\geq 2$ and $m\geq 0$ we have

[TABLE]

We conclude that

[TABLE]

with the operator

[TABLE]

In the above display, ${\cal L}_{s,u,v,t}^{\mu_{0},\mu_{1}}$ stands for the integral operator operator

[TABLE]

We also check that

[TABLE]

This ends the proof of the lemma.

Proof of the estimate (2.15)

For any $x=(x_{1},x_{2})\in\mathbb{R}^{2d}$ we set $\sigma(x_{1},x_{2}):=\sigma(x_{2},x_{1})$ . In this notation, for any matrix valued function $h(x)=(h_{i,j}(x))_{1\leq i,j\leq d}$ we have the tensor product formula

[TABLE]

with the matrix valued functions $\mathbb{I}^{\,\mu_{0}}_{s,u,t}(h)$ and $\mathbb{J}^{\,\mu_{0}}_{s,u,v,t}(h)$ given for any $(u,y)\in\Delta^{n}_{s,t}$ and $(v,z)\in\Delta^{m}_{s,t}$ by the formula

[TABLE]

Using (3.7) we have

[TABLE]

from which we check the formula

[TABLE]

By symmetry arguments, we also have

[TABLE]

Using (3.20) for any differentiable matrix valued function $h(x_{1},x_{2})$ such that $\|h\|\vee\|\nabla_{x_{1}}h\|\leq 1$ we have the uniform estimate

[TABLE]

In the same vein, we have

[TABLE]

Using the gradient and the Hessian estimates (3.2) and (3.3) for any $1\leq k\leq n$ we check that

[TABLE]

Combining the above estimates with (3.38) we check that

[TABLE]

In addition, for any $1\leq k<n$ we have

[TABLE]

We conclude that

[TABLE]

Arguing as above, for any $1\leq k<n$ we have

[TABLE]

In addition, for $k=n$ we have

[TABLE]

This implies that

[TABLE]

On the other hand, we have the decomposition

[TABLE]

with the matrix valued function

[TABLE]

Using the estimates (5.17) and (5.18), for any $(u,y)\in\Delta^{n}_{s,t}$ we check that

[TABLE]

Using the decomposition (5.16) we also check that

[TABLE]

with the matrix valued function

[TABLE]

Using (5.19) we find the uniform estimates

[TABLE]

On the other hand, using (4.5) and (2.5) we have

[TABLE]

Thus, recalling that

[TABLE]

we check that

[TABLE]

This implies that

[TABLE]

with the tensor integral operator

[TABLE]

On the other hand, using (5.10)

[TABLE]

with the interpolating path

[TABLE]

and

[TABLE]

In the above display, $(\overline{Y}^{i}_{1},\overline{Y}^{i}_{0})_{i=1,2,3}$ stands for independent copies of a pair of random variables $(Y_{0},Y_{1})$ with distribution $(\mu_{0},\mu_{1})$ .

Using the commutation formula (3.5) we check that

[TABLE]

Using (5.20) for any differentiable matrix valued function $h(x_{1},x_{2})$ such that $\|h\|\vee\|\nabla_{x_{1}}h\|\leq 1$ and for any $\epsilon\in]0,1[$ we check that

[TABLE]

On the other hand, we have

[TABLE]

with the $\star$ -tensor product

[TABLE]

Using (5.3) we check that

[TABLE]

We conclude that for any function $f\in{\cal C}^{3}(\mathbb{R}^{d})$ s.t. $\sup_{k=1,2,3}\|\nabla^{k}f\|\leq 1$

[TABLE]

The last assertion comes from the formula

[TABLE]

Proof of theorem 2.6

We extend the operators $D^{k}_{\mu_{1},\mu_{0}}\phi_{s,t}$ introduced in theorem 2.4 to tensor functions $f=(f_{i})_{i\in[n]}$ by considering the tensor function with entries

[TABLE]

By theorem 2.4 we have

[TABLE]

with the functions

[TABLE]

We also write $d_{s,t}^{[k],\mu}$ instead of $d_{s,t}^{[k],\mu,\mu}$ . Using (2.12) and (4.14) we check that

[TABLE]

as well as

[TABLE]

Using (2.15) we also have

[TABLE]

On the other hand, we have the second order expansions

[TABLE]

In the same vein, we have

[TABLE]

This implies that

[TABLE]

with the second order remainder term

[TABLE]

and the third order remainder term

[TABLE]

Combining (3.4) with (2.4) and (2.17) for any $k=1,2$ we check the uniform estimate

[TABLE]

We check (2.19) using (5.23) and (5.22).

Using (5.3) we also have the estimate

[TABLE]

Observe that

[TABLE]

with the matrix valued functions

[TABLE]

We also write $d_{s,t}^{[1,1],\mu}$ instead of $d_{s,t}^{[1,1],\mu,\mu}$ . Observe that

[TABLE]

with

[TABLE]

Observe that

[TABLE]

This yields the second order decompositionn (2.20) with the remainder term

[TABLE]

The end of the proof of is now a consequence of the estimates (5.24), (5.25) and (5.26). The proof of the theorem is completed.

Bibliography69

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] V.M. Alekseev, An estimate for the perturbations of the solutions of ordinary differential equations. II, Vestnik Moskov. Univ. Ser. I Mat. Mech, vol. 3 , pp. 3–10 (1961).
2[2] L. Ambrosio, N. Gigli, Construction of parallel transport in the Wasserstein space, Methods and Applications of Analysis, no. 15 , vol.1, pp. 1–30 (2008).
3[3] L. Ambrosio, N. Gigli, and G. Savare, Gradient flows in metric spaces and in spaces of probability measures, Birkäuser, 2005.
4[4] M. Arnaudon, P. Del Moral. A duality formula and a particle Gibbs sampler for continuous time Feynman-Kac measures on path spaces. Ar Xiv:1805.05044 (2018).
5[5] M. Arnaudon, P. Del Moral. A variational approach to nonlinear and interacting diffusions. Ar Xiv:1812.04269 (2018) . Stochastic Analysis and Applications DOI: 10.1080/07362994.2019.1609985 (2019).
6[6] M. Arnaudon, H. Plank, A. Thalmaier. A Bismut type formula for the Hessian of heat semigroups. C. R. Math. Acad. Sci. Paris, vol. 336, no. 8, pp. 661--666 (2003).
7[7] D. Benedetto, E. Caglioti, M. Pulvirenti. A kinetic equation for granular media. RAIRO Modèl. Math. Anal. Numér. vol. 31, no. 5, pp. 615--641 (1997).
8[8] D. Benedetto, E. Caglioti, E., Carrillo, M. Pulvirenti. A non-Maxwellian steady distribution for one-dimensional granular media. J. Statist. Phys.vol. 91, pp. 979--990 (1998).

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

A second order analysis of McKean-Vlasov semigroups

Abstract

1 Introduction

1.1 Description of the models

1.2 Statement of some main results

1.3 Some basic notation

2 Statement of the main theorems

2.1 First variational equation on Hilbert spaces

Theorem 2.1**.**

2.2 Taylor expansions with remainder

Theorem 2.2**.**

Theorem 2.3**.**

Theorem 2.4**.**

2.3 Illustrations

2.3.1 Almost sure expansions

Theorem 2.5**.**

Theorem 2.6**.**

2.3.2 Interacting diffusions

Theorem 2.7**.**

2.4 Comments on the regularity conditions

2.5 Comparisons with existing literature

3 Some preliminary results

3.1 Fréchet differential

3.2 Tensor integral operators

3.3 Variational equations

3.4 Differential of Markov semigroups

3.5 Bismut-Elworthy-Li extension formulae

3.6 Integro-differential operators

Definition 3.1**.**

Definition 3.2**.**

Definition 3.3**.**

Proposition 3.4**.**

3.7 Some differential formulae

Proposition 3.5**.**

Proof.

Proposition 3.6**.**

Proof.

4 Tangent processes

4.1 Spectral estimate

4.2 Dyson-Phillips expansions

Theorem 4.1**.**

Proof.

Corollary 4.2**.**

4.3 Gradient semigroup analysis

Definition 4.3**.**

Remark 4.4**.**

Theorem 4.5**.**

Proof.

Proposition 4.6**.**

Remark 4.7**.**

Theorem 4.8**.**

Proof.

5 Taylor expansions

5.1 Some differential formulae

Definition 5.1**.**

5.2 A first order expansion

Lemma 5.2**.**

Proof.

Remark 5.3**.**

Proposition 5.4**.**

Proof.

5.3 Second order analysis

Lemma 5.5**.**

Acknowledgements

Appendix

Proof of (2.22)

Proof of (3.4)

Lemma 5.6**.**

Proof of (3.22) and (3.24)

Proof of (3.34)

Proof of (4.7)

Proof of lemma 5.5

Proof of the estimate (2.15)

Theorem 2.1.

Theorem 2.2.

Theorem 2.3.

Theorem 2.4.

Theorem 2.5.

Theorem 2.6.

Theorem 2.7.

Definition 3.1.

Definition 3.2.

Definition 3.3.

Proposition 3.4.

Proposition 3.5.

Proposition 3.6.

Theorem 4.1.

Corollary 4.2.

Definition 4.3.

Remark 4.4.

Theorem 4.5.

Proposition 4.6.

Remark 4.7.

Theorem 4.8.

Definition 5.1.

Lemma 5.2.

Remark 5.3.

Proposition 5.4.

Lemma 5.5.

Lemma 5.6.