A variational approach to nonlinear and interacting diffusions

Marc Arnaudon (IMB); Pierre Del Moral (CMAP; CQFD)

arXiv:1812.04269·math.PR·January 30, 2019

A variational approach to nonlinear and interacting diffusions

Marc Arnaudon (IMB), Pierre Del Moral (CMAP, CQFD)

PDF

TL;DR

This paper introduces a new variational calculus framework combining multiple techniques to analyze stability and chaos propagation in nonlinear, interacting diffusions across diverse stochastic models and manifolds.

Contribution

It develops a unified variational approach integrating gradient flows, stochastic interpolations, and spectral theory for nonlinear diffusions, including on manifolds, with new exponential contraction results.

Findings

01

First exponential contraction inequalities for this class of models.

02

Uniform propagation of chaos over time.

03

Applications to fluid mechanics and granular media.

Abstract

The article presents a novel variational calculus to analyze the stability and the propagation of chaos properties of nonlinear and interacting diffusions. This differential methodology combines gradient flow estimates with backward stochastic interpolations, Lyapunov linearization techniques as well as spectral theory. This framework applies to a large class of stochastic models including non homogeneous diffusions, as well as stochastic processes evolving on differentiable manifolds, such as constraint-type embedded manifolds on Euclidian spaces and manifolds equipped with some Riemannian metric. We derive uniform as well as almost sure exponential contraction inequalities at the level of the nonlinear diffusion flow, yielding what seems to be the first result of this type for this class of models. Uniform propagation of chaos properties w.r.t. the time parameter are also provided.…

Equations431

\partial_{t} X_{s, t} (x) = b_{t} (X_{s, t} (x)) ⟹ \partial_{t} \nabla X_{s, t} (x) = \nabla X_{s, t} (x) \nabla b_{t} (X_{s, t} (x)) \mbox with \nabla X_{s, s} (x) = I

\partial_{t} X_{s, t} (x) = b_{t} (X_{s, t} (x)) ⟹ \partial_{t} \nabla X_{s, t} (x) = \nabla X_{s, t} (x) \nabla b_{t} (X_{s, t} (x)) \mbox with \nabla X_{s, s} (x) = I

- \int_{s}^{t} ρ (- \nabla b_{u} (X_{s, u} (x))) d u \leq lo g ∥\nabla X_{s, t} (x) ∥_{2} \leq \int_{s}^{t} ρ (\nabla b_{u} (X_{s, u} (x))) d u

- \int_{s}^{t} ρ (- \nabla b_{u} (X_{s, u} (x))) d u \leq lo g ∥\nabla X_{s, t} (x) ∥_{2} \leq \int_{s}^{t} ρ (\nabla b_{u} (X_{s, u} (x))) d u

\begin{array}[]{l}\displaystyle X_{s,t}(x)-X_{s,t}(y)=\int_{0}^{1}~{}\langle\nabla X_{s,t}(\epsilon x+(1-\epsilon)y),(x-y)\rangle~{}d\epsilon\\ \\ \displaystyle\Longrightarrow\|X_{s,t}(x)-X_{s,t}(y)\|\leq e^{-\lambda(t-s)}~{}\|x-y\|\end{array}

\begin{array}[]{l}\displaystyle X_{s,t}(x)-X_{s,t}(y)=\int_{0}^{1}~{}\langle\nabla X_{s,t}(\epsilon x+(1-\epsilon)y),(x-y)\rangle~{}d\epsilon\\ \\ \displaystyle\Longrightarrow\|X_{s,t}(x)-X_{s,t}(y)\|\leq e^{-\lambda(t-s)}~{}\|x-y\|\end{array}

W_{2} (η, μ) = in f E (∥ X - Y ∥^{2})^{1/2}

W_{2} (η, μ) = in f E (∥ X - Y ∥^{2})^{1/2}

d X_{s, t}^{μ} (x) = b_{t} (ϕ_{s, t} (μ), X_{s, t}^{μ} (x)) d t + σ_{t} (ϕ_{s, t} (μ), X_{s, t}^{μ} (x)) d W_{t}

d X_{s, t}^{μ} (x) = b_{t} (ϕ_{s, t} (μ), X_{s, t}^{μ} (x)) d t + σ_{t} (ϕ_{s, t} (μ), X_{s, t}^{μ} (x)) d W_{t}

ϕ_{s, t} (μ) (d y) = μ P_{s, t}^{μ} (d y) := \int μ (d x) P_{s, t}^{μ} (x, d y) \mbox with P_{s, t}^{μ} (x, d y) := P (X_{s, t}^{μ} (x) \in d y)

ϕ_{s, t} (μ) (d y) = μ P_{s, t}^{μ} (d y) := \int μ (d x) P_{s, t}^{μ} (x, d y) \mbox with P_{s, t}^{μ} (x, d y) := P (X_{s, t}^{μ} (x) \in d y)

b_{t} (η, y) := \int η (d x) b_{t} (x, y) \mbox and σ_{t} (η, y) := \int η (d x) σ_{t} (x, y)

b_{t} (η, y) := \int η (d x) b_{t} (x, y) \mbox and σ_{t} (η, y) := \int η (d x) σ_{t} (x, y)

d ξ_{s, t}^{i} (z) = b_{t} (m (ξ_{s, t}^{i} (z)), ξ_{s, t}^{i} (z)) d t + σ_{t} (m (ξ_{s, t}^{i} (z)), ξ_{s, t}^{i} (z)) d W_{t}^{i}

d ξ_{s, t}^{i} (z) = b_{t} (m (ξ_{s, t}^{i} (z)), ξ_{s, t}^{i} (z)) d t + σ_{t} (m (ξ_{s, t}^{i} (z)), ξ_{s, t}^{i} (z)) d W_{t}^{i}

m (ξ_{s, t}^{i} (z)) := \frac{1}{N} 1 \leq j \leq N \sum δ_{ξ_{s, t}^{j} (z)}

m (ξ_{s, t}^{i} (z)) := \frac{1}{N} 1 \leq j \leq N \sum δ_{ξ_{s, t}^{j} (z)}

r = d σ (x, y) = σ_{0} I \mbox and b (x, y) = - \nabla U (y) - \nabla V (y - x)

r = d σ (x, y) = σ_{0} I \mbox and b (x, y) = - \nabla U (y) - \nabla V (y - x)

∥ X_{s, t}^{η} (x) - X_{s, t}^{μ} (y) ∥ \leq ∥ \nabla^{2} V ∥_{2} (t - s) e^{- λ (t - s)} W_{2} (η, μ) + e^{- λ (t - s)} ∥ x - y ∥

∥ X_{s, t}^{η} (x) - X_{s, t}^{μ} (y) ∥ \leq ∥ \nabla^{2} V ∥_{2} (t - s) e^{- λ (t - s)} W_{2} (η, μ) + e^{- λ (t - s)} ∥ x - y ∥

Z_{ϵ} := (1 - ϵ) Z_{0} + ϵ Z_{1} μ_{ϵ} := \mbox Law (Z_{ϵ}) \mbox and X_{s, t}^{ϵ} := X_{s, t}^{μ_{ϵ}} (Z_{ϵ})

Z_{ϵ} := (1 - ϵ) Z_{0} + ϵ Z_{1} μ_{ϵ} := \mbox Law (Z_{ϵ}) \mbox and X_{s, t}^{ϵ} := X_{s, t}^{μ_{ϵ}} (Z_{ϵ})

[ϕ_{s, t} (μ_{1}) - ϕ_{s, t} (μ_{0})] (f) = \int_{0}^{1} \partial_{ϵ} ϕ_{s, t} (μ_{ϵ}) (f) d ϵ

[ϕ_{s, t} (μ_{1}) - ϕ_{s, t} (μ_{0})] (f) = \int_{0}^{1} \partial_{ϵ} ϕ_{s, t} (μ_{ϵ}) (f) d ϵ

\partial_{ϵ} ϕ_{s, t} (μ_{ϵ}) (f) := E (⟨ \partial_{ϵ} X_{s, t}^{ϵ}, \nabla f (X_{s, t}^{ϵ}) ⟩) \mbox s.t. ∣ \partial_{ϵ} ϕ_{s, t} (μ_{ϵ}) (f) ∣ \leq e^{- λ (t - s)} ∥\nabla f ∥

\partial_{ϵ} ϕ_{s, t} (μ_{ϵ}) (f) := E (⟨ \partial_{ϵ} X_{s, t}^{ϵ}, \nabla f (X_{s, t}^{ϵ}) ⟩) \mbox s.t. ∣ \partial_{ϵ} ϕ_{s, t} (μ_{ϵ}) (f) ∣ \leq e^{- λ (t - s)} ∥\nabla f ∥

∥\nabla ξ_{s, t} (z) ∥_{2} \leq e^{- λ (t - s)}

∥\nabla ξ_{s, t} (z) ∥_{2} \leq e^{- λ (t - s)}

\begin{array}[]{l}\displaystyle\nabla X^{\mu}_{s,t}(x):=\left(\nabla X^{1,\mu}_{s,t}(x),\ldots,\nabla X^{d,\mu}_{s,t}(x)\right)\\ \\ \displaystyle\Longrightarrow d\,\nabla X^{\mu}_{s,t}(x)=\nabla X^{\mu}_{s,t}(x)~{}\left[\nabla b_{t}\left(\phi_{s,t}(\mu),X^{\mu}_{s,t}(x)\right)~{}dt+\sum_{1\leq k\leq r}\nabla\sigma_{t,k}\left(\phi_{s,t}(\mu),X^{\mu}_{s,t}(x)\right)~{}dW^{k}_{t}\right]\end{array}

\begin{array}[]{l}\displaystyle\nabla X^{\mu}_{s,t}(x):=\left(\nabla X^{1,\mu}_{s,t}(x),\ldots,\nabla X^{d,\mu}_{s,t}(x)\right)\\ \\ \displaystyle\Longrightarrow d\,\nabla X^{\mu}_{s,t}(x)=\nabla X^{\mu}_{s,t}(x)~{}\left[\nabla b_{t}\left(\phi_{s,t}(\mu),X^{\mu}_{s,t}(x)\right)~{}dt+\sum_{1\leq k\leq r}\nabla\sigma_{t,k}\left(\phi_{s,t}(\mu),X^{\mu}_{s,t}(x)\right)~{}dW^{k}_{t}\right]\end{array}

A_{t} (x, y) := \nabla_{y} b_{t} (x, y) + \nabla_{y} b_{t} (x, y)^{'} + 1 \leq k \leq r \sum \nabla_{y} σ_{k, t} (x, y) \nabla_{y} σ_{k, t} (x, y)^{'} \leq - 2 λ_{A} I

A_{t} (x, y) := \nabla_{y} b_{t} (x, y) + \nabla_{y} b_{t} (x, y)^{'} + 1 \leq k \leq r \sum \nabla_{y} σ_{k, t} (x, y) \nabla_{y} σ_{k, t} (x, y)^{'} \leq - 2 λ_{A} I

(H_{A}) ⟹ E (∥\nabla X_{s, t}^{μ} (x) ∥_{2}^{2})^{1/2} \leq E (∥\nabla X_{s, t}^{μ} (x) ∥_{F}^{2})^{1/2} \leq d e^{- λ_{A} (t - s)}

(H_{A}) ⟹ E (∥\nabla X_{s, t}^{μ} (x) ∥_{2}^{2})^{1/2} \leq E (∥\nabla X_{s, t}^{μ} (x) ∥_{F}^{2})^{1/2} \leq d e^{- λ_{A} (t - s)}

(H_{A}) \mbox and \nabla_{y} σ_{k, t} (x, y) = 0 ⟹ ∥\nabla X_{s, t}^{μ} (x) ∥_{2} \leq e^{- λ_{A} (t - s)}

(H_{A}) \mbox and \nabla_{y} σ_{k, t} (x, y) = 0 ⟹ ∥\nabla X_{s, t}^{μ} (x) ∥_{2} \leq e^{- λ_{A} (t - s)}

(H_{A}) ⟺ \nabla^{2} U (y) + \nabla^{2} V (y - x) \geq λ_{A} I ⟹ ∥\nabla X_{s, t}^{μ} (x) ∥_{2} \leq e^{- λ_{A} (t - s)}

(H_{A}) ⟺ \nabla^{2} U (y) + \nabla^{2} V (y - x) \geq λ_{A} I ⟹ ∥\nabla X_{s, t}^{μ} (x) ∥_{2} \leq e^{- λ_{A} (t - s)}

E (∥ X_{t}^{μ} (x) - X_{t}^{μ} (y) ∥^{2})^{1/2} \leq d e^{- λ_{A} (t - s)} ∥ x - y ∥

E (∥ X_{t}^{μ} (x) - X_{t}^{μ} (y) ∥^{2})^{1/2} \leq d e^{- λ_{A} (t - s)} ∥ x - y ∥

\nabla_{y} σ_{k, t} = 0 ⟹ ∥ X_{t}^{μ} (x) - X_{t}^{μ} (y) ∥ \leq e^{- λ_{A} (t - s)} ∥ x - y ∥

\nabla_{y} σ_{k, t} = 0 ⟹ ∥ X_{t}^{μ} (x) - X_{t}^{μ} (y) ∥ \leq e^{- λ_{A} (t - s)} ∥ x - y ∥

W_{2} (η_{0} P_{s, t}^{μ}, η_{1} P_{s, t}^{μ}) \leq c exp [- λ_{A} (t - s)] W_{2} (η_{0}, η_{1})

W_{2} (η_{0} P_{s, t}^{μ}, η_{1} P_{s, t}^{μ}) \leq c exp [- λ_{A} (t - s)] W_{2} (η_{0}, η_{1})

B_{t}(z_{1},z_{2}):=\left[\begin{array}[]{cc}\nabla_{y}b_{t}\left(z_{2},z_{1}\right)&\nabla_{x}b_{t}\left(z_{1},z_{2}\right)\\ &\\ \nabla_{x}b_{t}\left(z_{2},z_{1}\right)&\nabla_{y}b_{t}\left(z_{1},z_{2}\right)\end{array}\right]\quad D_{t}:=\sum_{1\leq k\leq r}\left[\begin{array}[]{cc}\nabla_{x}\sigma_{t,k}~{}\nabla_{x}\sigma_{t,k}^{\prime}&\nabla_{x}\sigma_{t,k}~{}\nabla_{y}\sigma_{t,k}^{\prime}\\ &\\ \nabla_{y}\sigma_{t,k}~{}\nabla_{x}\sigma_{t,k}^{\prime}&\nabla_{y}\sigma_{t,k}~{}\nabla_{y}\sigma_{t,k}^{\prime}\end{array}\right]

B_{t}(z_{1},z_{2}):=\left[\begin{array}[]{cc}\nabla_{y}b_{t}\left(z_{2},z_{1}\right)&\nabla_{x}b_{t}\left(z_{1},z_{2}\right)\\ &\\ \nabla_{x}b_{t}\left(z_{2},z_{1}\right)&\nabla_{y}b_{t}\left(z_{1},z_{2}\right)\end{array}\right]\quad D_{t}:=\sum_{1\leq k\leq r}\left[\begin{array}[]{cc}\nabla_{x}\sigma_{t,k}~{}\nabla_{x}\sigma_{t,k}^{\prime}&\nabla_{x}\sigma_{t,k}~{}\nabla_{y}\sigma_{t,k}^{\prime}\\ &\\ \nabla_{y}\sigma_{t,k}~{}\nabla_{x}\sigma_{t,k}^{\prime}&\nabla_{y}\sigma_{t,k}~{}\nabla_{y}\sigma_{t,k}^{\prime}\end{array}\right]

C_{t} (x, y) := \frac{1}{2} [B_{t} (x, y) + B_{t} (x, y)^{'}] + D_{t} (x, y) \leq - λ_{C} I

C_{t} (x, y) := \frac{1}{2} [B_{t} (x, y) + B_{t} (x, y)^{'}] + D_{t} (x, y) \leq - λ_{C} I

X_{s, t}^{ϵ} := X_{s, t}^{μ_{ϵ}} (Z_{ϵ}) \mbox and Y_{s, t}^{ϵ} := Y_{s, t}^{μ_{ϵ}} (\overline{Z}_{ϵ})

X_{s, t}^{ϵ} := X_{s, t}^{μ_{ϵ}} (Z_{ϵ}) \mbox and Y_{s, t}^{ϵ} := Y_{s, t}^{μ_{ϵ}} (\overline{Z}_{ϵ})

d Y_{s, t}^{ϵ} = E_{X} [b_{t} (X_{s, t}^{ϵ}, Y_{s, t}^{ϵ})] d t + E_{X} [σ_{t} (X_{s, t}^{ϵ}, Y_{s, t}^{ϵ})] d \overline{W}_{t}

d Y_{s, t}^{ϵ} = E_{X} [b_{t} (X_{s, t}^{ϵ}, Y_{s, t}^{ϵ})] d t + E_{X} [σ_{t} (X_{s, t}^{ϵ}, Y_{s, t}^{ϵ})] d \overline{W}_{t}

\begin{array}[]{l}\displaystyle d\left[\partial_{\epsilon}Y^{\epsilon}_{s,t}\right]=\mathbb{E}_{X}\left[\nabla_{x}b_{t}\left(X^{\epsilon}_{s,t},Y^{\epsilon}_{s,t}\right)^{\prime}\partial_{\epsilon}X^{\epsilon}_{s,t}+\nabla_{y}b_{t}\left(X^{\epsilon}_{s,t},Y^{\epsilon}_{s,t}\right)^{\prime}\partial_{\epsilon}Y^{\epsilon}_{s,t}\right]~{}dt\\ \\ \hskip 85.35826pt\displaystyle+\sum_{1\leq k\leq r}\mathbb{E}_{X}\left[\nabla_{x}\sigma_{t,k}\left(X^{\epsilon}_{s,t},Y^{\epsilon}_{s,t}\right)^{\prime}\partial_{\epsilon}X^{\epsilon}_{s,t}+\nabla_{y}\sigma_{t,k}\left(X^{\epsilon}_{s,t},Y^{\epsilon}_{s,t}\right)^{\prime}\partial_{\epsilon}Y^{\epsilon}_{s,t}\ \right]~{}d\overline{W}^{k}_{t}\end{array}

\begin{array}[]{l}\displaystyle d\left[\partial_{\epsilon}Y^{\epsilon}_{s,t}\right]=\mathbb{E}_{X}\left[\nabla_{x}b_{t}\left(X^{\epsilon}_{s,t},Y^{\epsilon}_{s,t}\right)^{\prime}\partial_{\epsilon}X^{\epsilon}_{s,t}+\nabla_{y}b_{t}\left(X^{\epsilon}_{s,t},Y^{\epsilon}_{s,t}\right)^{\prime}\partial_{\epsilon}Y^{\epsilon}_{s,t}\right]~{}dt\\ \\ \hskip 85.35826pt\displaystyle+\sum_{1\leq k\leq r}\mathbb{E}_{X}\left[\nabla_{x}\sigma_{t,k}\left(X^{\epsilon}_{s,t},Y^{\epsilon}_{s,t}\right)^{\prime}\partial_{\epsilon}X^{\epsilon}_{s,t}+\nabla_{y}\sigma_{t,k}\left(X^{\epsilon}_{s,t},Y^{\epsilon}_{s,t}\right)^{\prime}\partial_{\epsilon}Y^{\epsilon}_{s,t}\ \right]~{}d\overline{W}^{k}_{t}\end{array}

\partial_{ϵ} Y_{s, s}^{ϵ} = \partial_{ϵ} \overline{Z}_{ϵ} = \overline{Z}_{1} - \overline{Z}_{0}

\partial_{ϵ} Y_{s, s}^{ϵ} = \partial_{ϵ} \overline{Z}_{ϵ} = \overline{Z}_{1} - \overline{Z}_{0}

\displaystyle\partial_{t}\,\mathbb{E}\left[\left\|\partial_{\epsilon}Y^{\epsilon}_{s,t}\right\|^{2}\right]\leq\mathbb{E}\left(\left[\partial_{\epsilon}X^{\epsilon}_{s,t},\partial_{\epsilon}Y^{\epsilon}_{s,t}\right]^{\prime}C_{t}\left(X^{\epsilon}_{s,t},Y^{\epsilon}_{s,t}\right)\left[\begin{array}[]{c}\partial_{\epsilon}X^{\epsilon}_{s,t}\\ \partial_{\epsilon}Y^{\epsilon}_{s,t}\end{array}\right]\right)

\displaystyle\partial_{t}\,\mathbb{E}\left[\left\|\partial_{\epsilon}Y^{\epsilon}_{s,t}\right\|^{2}\right]\leq\mathbb{E}\left(\left[\partial_{\epsilon}X^{\epsilon}_{s,t},\partial_{\epsilon}Y^{\epsilon}_{s,t}\right]^{\prime}C_{t}\left(X^{\epsilon}_{s,t},Y^{\epsilon}_{s,t}\right)\left[\begin{array}[]{c}\partial_{\epsilon}X^{\epsilon}_{s,t}\\ \partial_{\epsilon}Y^{\epsilon}_{s,t}\end{array}\right]\right)

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

A variational approach to nonlinear and interacting diffusions

M. Arnaudon

Institut de Mathématiques de Bordeaux (IMB), Bordeaux University, France

P. Del Moral P. Del Moral was supported in part by funding from the Chaire Stress Test, BNP Paribas SFTS and CMAP, Polytechnique Palaiseau, France INRIA, Bordeaux Research Center & CMAP, Polytechnique Palaiseau, France

Abstract

The article presents a novel variational calculus to analyze the stability and the propagation of chaos properties of nonlinear and interacting diffusions. This differential methodology combines gradient flow estimates with backward stochastic interpolations, Lyapunov linearization techniques as well as spectral theory. This framework applies to a large class of stochastic models including non homogeneous diffusions, as well as stochastic processes evolving on differentiable manifolds, such as constraint-type embedded manifolds on Euclidian spaces and manifolds equipped with some Riemannian metric. We derive uniform as well as almost sure exponential contraction inequalities at the level of the nonlinear diffusion flow, yielding what seems to be the first result of this type for this class of models. Uniform propagation of chaos properties w.r.t. the time parameter are also provided. Illustrations are provided in the context of a class of gradient flow diffusions arising in fluid mechanics and granular media literature. The extended versions of these nonlinear Langevin-type diffusions on Riemannian manifolds are also discussed.

Keywords : Nonlinear diffusions, mean field particle systems, variational equations, logarithmic norms, gradient flows, contraction inequalities, Wasserstein distance, Riemannian manifolds.

Mathematics Subject Classification : 65C35, 82C80, 58J65, 47J20.

1 Introduction

1.1 Description of the models

We denote by $\|A\|_{2}:=\lambda_{\tiny max}(AA^{\prime})^{1/2}$ , resp. $\|A\|_{F}=\mbox{\rm Tr}(AA^{\prime})^{1/2}$ and $\rho(A)=\lambda_{\tiny max}((A+A^{\prime})/2)$ the spectral norm, the Frobenius norm, and the logarithmic norm of some matrix $A$ , where $A^{\prime}$ stands for the transpose of $A$ , and $\lambda_{\tiny max}(\mbox{\LARGE.})$ the maximal eigenvalue. With a slight abuse of notation, we denote by $I$ the identity $(d\times d)$ -matrix, for any $d\geq 1$ .

Let $b_{t}$ be some time varying differentiable vector field with Jacobian matrix $\nabla b_{t}$ on $\mathbb{R}^{d}$ , for some parameter $d\geq 1$ . Consider the deterministic flow $t\in[s,\infty[\mapsto X_{s,t}(x)$ starting at $X_{s,s}(x)=x$ associated with the evolution equation

[TABLE]

The r.h.s. equation is often called the first order variational equation associated with the flow $X_{s,t}(x)$ along the trajectory $X_{s,t}(x)$ . This equation plays a central role in the sensitivity analysis of nonlinear dynamical systems w.r.t. their initial conditions. For instance, the spectral norm of $\nabla X_{s,t}(x)$ can be estimated in terms of the logarithmic norm using the inequalities

[TABLE]

A proof of this assertion can be found in [14], see also [27] for extensions to Lipschitz functions on Banach spaces. Whenever $\rho\left(\nabla b_{u}(x)\right)\leq-\lambda$ for some $\lambda>0$ , the r.h.s. estimate in (1.2) readily implies the exponential stability estimate

[TABLE]

The linearization technique discussed above is often referred as the Lyapunov first or indirect method to analyze the stability of nonlinear dynamical systems. For a more thorough discussion on this subject we refer to the pioneering work by Lyapunov [24], as well as to chapter 4 in the more recent monograph by Khalil [23].

The main objective of this article is to extend these results to nonlinear diffusions and their mean field particle interpretations on Euclidian as well as on differentiable manifolds. The differential analysis of conventional diffusions w.r.t. initial conditions is also one of the stepping stones of Bismut and Malliavin calculus. This framework is mainly designed to study the existence and the properties of smooth probability densities in terms of the differential properties of the diffusion semigroup. For a more thorough discussion on this subject we refer to [13, 30], and references therein.

The relevant mathematical apparatus for the description and the variational analysis of stochastic processes on manifolds being technically more sophisticated than conventional differential calculus, this introduction only discusses nonlinear and interacting diffusions on Euclidian spaces. The extended versions of these models on Riemannian manifolds are discussed in some details in section 3.2, as well as in section 4.3.

Let ${\cal P}_{2}(\mathbb{R}^{d})$ be the set of Borel probability measures on $\mathbb{R}^{d}$ with finite second absolute moment, equipped with the $2$ -Wasserstein distance given by

[TABLE]

In the above display, the infimum is taken over all pairs of random variables $(X,Y)$ with respective distributions $\eta$ and $\mu\in{\cal P}_{2}(\mathbb{R}^{d})$ ; and $\|X-Y\|$ stands for the Euclidian distance between $X$ and $Y$ on the product space $\mathbb{R}^{d}$ .

Also let $b_{t}$ and $\sigma_{t}$ be differentiable functions from $\mathbb{R}^{2d}$ into $\mathbb{R}^{d}$ and $\mathbb{R}^{d\times r}$ , for some $r\geq 1$ ; and let $W_{t}$ be an $r$ -dimensional Brownian motion. For any $\mu\in{\cal P}_{2}(\mathbb{R}^{d})$ and any time horizon $s\geq 0$ we denote by $X_{s,t}^{\mu}(x)$ be the stochastic flow defined for any $t\in[s,\infty[$ and any starting point $x\in\mathbb{R}^{d}$ by the McKean-Vlasov diffusion

[TABLE]

In the above display, $\phi_{s,t}$ stands for the evolution semigroup

[TABLE]

We further assume that the mean field drift and diffusion functions are given by

[TABLE]

We shall assume that the nonlinear diffusion flow (1.4) is well defined. For instance, the existence of this flow is ensured as son as $b_{t}$ and $\sigma_{t}$ are Lipschitz, see for instance [18, 22].

The mean field particle system associated with (1.4) is defined by the stochastic flow $\xi_{s,t}(z)=(\xi_{s,t}^{i}(z))_{1\leq i\leq N}$ of a system of $N$ interacting diffusions

[TABLE]

with the empirical measures

[TABLE]

In the above displayed formulae, $\xi_{s,s}(z)=z=(z^{i})_{1\leq N}\in(\mathbb{R}^{d})^{N}$ stands for the initial configuration and $W^{i}_{t}$ are $N$ independent copies of $W_{t}$ .

1.2 Statement of some main results and article organisation

To motivate this study, the variational calculus developed in the article is illustrated with the following example

[TABLE]

for some $\sigma_{0}>0$ , some confinement type potential function $U$ (a.k.a. the exterior potential) and some interaction potential function $V$ . This class of nonlinear diffusions and the corresponding particle interpretations were introduced by H. P. McKean in [28, 29]. The extended versions of these Langevin-type nonlinear diffusions on Riemannian manifolds are discussed in the end of section 3.2 as well as in section 4.3.

Nonlinear diffusions (1.4) with constant diffusion and gradient-type drifts (1.6) arise in fluid mechanics, and more particularly in the modeling of granular flows [6, 7, 35, 42]. In this context, $\phi_{s,t}$ represents the evolution semigroup of the velocity of a diffusive particule interacting with the distribution of the particles around its location and following some confinement exterior potential. In this interpretation, the mean field particle model (1.5) can be seen as a particle-type representation of the granular flow.

In the last two decades, the analysis of the long time behavior of this particular class of gradient type flow diffusions have been developed in various directions:

The first articles on the long time behavior of these models are the couple of articles by Tamura [33, 34]. The stability properties of one dimensional models has been started in [4, 5] as well as in [6], see also [9, 11, 35].

Since this period, several sophisticated probabilistic techniques have been developed to analyze the long time behavior of these Langevin-type nonlinear diffusions, including log-Sobolev functional inequalities [25, 26], entropy dissipation [10, 15], as well as gradient flows in Wasserstein metric spaces and optimal transportation inequalities [8, 10, 12, 31], combining the functional $\Gamma_{2}$ Bakry-Emery method [3], with the Otto-Villani approach [32]. The long time self-stabilizing behavior of this class of processes in multi-wells landscapes has also been developed by J. Tugaut in a series of articles [36, 37, 39, 40, 41]. For a more thorough discussion on this subject we refer to the recent article [17], and the references therein.

Unfortunately, most of the probabilistic techniques discussed above only apply to gradient flow type diffusions of the form (1.6). The variational calculus developed in the present article is not restricted to this class of gradient-type nonlinear models. Nevertheless, because of their importance in practice this introduction illustrates some of our main results in this context.

Firstly, and rather surprisingly, the variational methodology developed in the present article applies directly to gradient flow models of the form (1.6), simplifying considerably both of their stability analysis as well as the convergence analysis of their mean field particle interpretations.

This framework also allows to relax unnecessary technical conditions such as the symmetry of the interaction potential function, or the invariance of the center of mass, currently used in the literature on this subject (see for instance [33], as well as section 2 in [10], and section 1 in the more recent article [8]). It also allows to derive uniform as well as almost sure exponential stability inequalities at the level of the nonlinear diffusion flow. For instance, when $V$ is an even convex function with bounded Hessian $\|\nabla^{2}V\|_{2}:=\sup_{x}\|\nabla^{2}V(x)\|_{2}<\infty$ , and when $\nabla^{2}U\geq\lambda~{}I$ , for some $\lambda>0$ we have the almost sure estimates

[TABLE]

The above estimate is also met for odd interaction potential, as soon as $\nabla^{2}U(y)+\nabla^{2}V(y-x)\geq\lambda~{}I$ . In the above display, it is implicitly assumed that the stochastic flows are driven by the same Brownian motion.

These almost sure inequalities are direct consequence of the contraction inequality (2.6), the remark (2.15) and the almost sure estimates stated in corollary 3.2.

To the best of our knowledge, the almost sure exponential decays (1.7) are the first result of this type for this class of nonlinear gradient flow diffusions.

Consider a pair of random variables $(Z_{0},Z_{1})$ with distributions $(\mu_{0},\mu_{1})$ on $\mathbb{R}^{d}$ and set

[TABLE]

Under the assumptions on the potential functions discussed above, for any differentiable function $f$ on $\mathbb{R}^{d}$ with bounded gradient we have the first order differential formula

[TABLE]

with the linear differential operator

[TABLE]

For a more precise statement we refer to theorem 2.2. Almost sure and uniform estimates of the first order differential maps $\epsilon\mapsto\partial_{\epsilon}X^{\epsilon}_{s,t}$ are also provided in theorem 2.3.

Section 4.1 also presents a differential calculus to estimate the gradient $\nabla\xi_{s,t}(z)$ of the stochastic flow $\xi_{s,t}(z)$ of the interacting particle model (1.5). Under the assumptions on the potential functions discussed above, we shall prove the following uniform spectral norm estimate

[TABLE]

The above result is a direct consequence of theorem 4.1. The above estimate ensures that the $N$ -particle model converges exponentially fast to its invariant measure with some exponential decay that doesn’t depends on the number of particles. The latter property can also be checked using more sophisticated Logarithmic Sobolev inequalities [25]. To the best of our knowledge, the almost sure exponential decays stated above are the first result of this type for this class of interacting Langevin-type diffusions.

Section 4.2 also provides a natural differential calculus to derive quantitative and uniform propagation of chaos estimates for nonlinear diffusions of the form (1.5). Applying these results to interacting Langevin-type diffusions, without further work we recover the uniform estimates stated in theorem 1.2 in [25].

We emphasize that the differential calculus presented in this article allows to consider nonlinear diffusions evolving in differential manifolds. This should not come as a surprise since our framework allows to enter the variations of the diffusion matrices associated with these stochastic models which encapsulates the Riemannian structure of the manifold.

We illustrate these comments in the end of section 2.2 with a rather detailed discussion of an elementary nonlinear geometric-type diffusion. The manifold version of (1.9) is also provided in theorem 3.14.

We also underline that the variational calculus on differentiable manifolds developed in section 3.2 provides another view and additional results for the diffusions in $\mathbb{R}^{d}$ endowed, when possible, with the Riemannian metric under which these diffusions are Brownian motion with drift. In this context, different types of synchronous coupling lead to gradient flow estimates where gradients of the diffusion functions are replaced by Ricci curvatures.

Quantitative propagation of chaos estimates of mean field particle systems on Riemannian manifolds are provided in section 4.3. Special attention is paid to derive uniform estimates w.r.t. the time horizon.

2 Nonlinear diffusion semigroups

2.1 Some gradient flow estimates

This section presents some basic properties of the first variational equation associated with the nonlinear diffusion (1.4). Let $\sigma_{k,t}$ be the $k$ -th column vector of $\sigma_{t}$ , and let $\nabla_{u}b_{t}(x,y)$ and $\nabla_{u}\sigma_{k,t}(x,y)$ be the gradient of the functions $b_{t}(x,y)$ and $\sigma_{k,t}(x,y)$ w.r.t. the coordinate $u\in\{x,y\}$ . We also let $X^{i,\mu}_{s,t}(x)$ be the $i$ -th coordinate of the column vector $X^{\mu}_{s,t}(x)$ . The Jacobian $\nabla X^{\mu}_{s,t}(x)$ of the diffusion flow $X^{\mu}_{s,t}(x)$ is given by the gradient $(d\times d)$ -matrix

[TABLE]

Consider the regularity condition stated below:

$(H_{A})$ * : There exists some $\lambda_{A}\in\mathbb{R}$ such that for any $x,y\in\mathbb{R}^{d}$ and $t\geq 0$ we have*

[TABLE]

This spectral condition produces several gradient estimates. For instance, we have the following uniform estimate

[TABLE]

In addition, we have the almost sure estimate

[TABLE]

The proofs of the above assertions are provided in the appendix, on page Proof of (2.2) and (2.3). For the nonlinear Langevin diffusion discussed in (1.6) we have

[TABLE]

Arguing as in (1.3) we readily check the following proposition.

Proposition 2.1.

Assume $(H_{A})$ is satisfied. In this situation, we have

[TABLE]

In addition, we have the almost sure estimate

[TABLE]

Whenever $\lambda_{A}<0$ the above estimates ensure that the transition semigroup $P^{\mu}_{s,t}$ is exponentially stable, that is we have that

[TABLE]

These contraction inequalities quantify the stability of the stochastic flow $X^{\mu}_{s,t}(x)$ w.r.t. the initial state $x$ , but they don’t give any information of the stability properties of the nonlinear semigroup $\phi_{s,t}(\mu)$ w.r.t. the initial measure $\mu$ .

2.2 A first order differential calculus

This section presents a natural first order differential calculus to analyze the stability properties of the nonlinear semigroup $\phi_{s,t}(\mu)$ . Consider the matrices

[TABLE]

In this notation, our second regularity condition takes the following form:

$(H_{C})$ * : There exists some $\lambda_{C}\in\mathbb{R}$ such that for any $x,y\in\mathbb{R}^{d}$ and $t\geq 0$ we have*

[TABLE]

Let $Z_{\epsilon}$ be the collection of random variables with distribution $\mu_{\epsilon}$ defined in (1.8). We also consider a couple of independent stochastic flows

[TABLE]

driven by independent Brownian motions, say $W_{t}=(W^{k}_{t})_{1\leq k\leq d}$ and $\overline{W}_{t}=(\overline{W}^{k}_{t})_{1\leq k\leq d}$ , and starting from a couple of independent random variables $Z_{\epsilon}$ and $\overline{Z}_{\epsilon}$ with the same law.

In the further development of this section, we denote by ${\mathbb{E}}_{X}(\mbox{\LARGE.})$ the expectation operator w.r.t. the Brownian motion $W_{t}=({W}^{k}_{t})_{1\leq k\leq d}$ and the random variable $Z_{\epsilon}$ . In this notation, we have

[TABLE]

This implies that

[TABLE]

with the initial condition

[TABLE]

A simple calculation yields the following estimate

[TABLE]

The inequality in the above display can be turned into an equality when $D_{t}=0$ . Also note that

[TABLE]

Let ${\cal C}^{1}_{b}(\mathbb{R}^{d})$ be the set of differentiable functions on $\mathbb{R}^{d}$ with bounded derivative. A direct consequence of the fundamental theorem of calculus yields the following theorem.

Theorem 2.2.

For any $s\leq t$ and any $f\in{\cal C}^{1}_{b}(\mathbb{R}^{d})$ and $\mu_{0},\mu_{1}\in{\cal P}_{2}(\mathbb{R}^{d})$ we have the first order differential formula (1.9). In addition, we have the exponential contraction inequality

[TABLE]

When $\lambda_{C}>0$ , the above theorem provides an alternative and rather elementary proof of the exponential asymptotic stability of time varying McKean-Vlasov diffusions with non necessarily homogenous diffusion functions. To the best of our knowledge this stability property is the first result of this type for this general class of nonlinear diffusions.

For the Langevin-type diffusion discussed in (1.6) we have $D_{t}=0$ and the matrix $C_{t}$ reduces to

[TABLE]

When $V$ is odd we have

[TABLE]

In the reverse angle, if $V$ is even and convex then we have

[TABLE]

As expected, explicit formulae are available for linear and Gaussian models. For instance, when

[TABLE]

the diffusion flow $X_{s,t}^{\mu}(x)\in\mathbb{R}^{d}$ is linear w.r.t. $\mu$ and given for any $x\in\mathbb{R}^{d}$ by the formula

[TABLE]

In the above display, $e(x)=x$ stands for the identity function on $\mathbb{R}^{d}$ . In this context, the process $X^{\epsilon}_{s,t}$ defined in (2.10) is also given by the formula

[TABLE]

This yields the rather crude estimate

[TABLE]

Up to some constant, this shows that the r.h.s. Wasserstein contraction estimate in (2.13) is met with $-\lambda_{C}=\rho(A_{1}+A_{2})\vee\rho(A_{2})$ . Applying Coppel’s inequality (cf. Proposition 3 in [14]) we can also choose $-\lambda_{C}=\left[\varsigma(A_{1}+A_{2})\vee\varsigma(A_{2})\right](1-\delta)$ for any $0<\delta<1$ , where $\varsigma(A):=\max_{i}{\left\{\mbox{\rm Re}[\lambda_{i}(A)]\right\}}\leq\rho(A)$ stands for the spectral abscissa of a square matrix $A$ .

It may happen the stochastic flow (1.4) remains in some domain $S\subset\mathbb{R}^{d}$ . The simplest model we have in head is the geometric diffusion on $S=[0,\infty[$ associated with the parameters

[TABLE]

In this situation, the diffusion flow $X_{s,t}^{\mu}(x)\in S$ is nonlinear w.r.t. $\mu$ and given for any $x\in S$ by

[TABLE]

with the function $\psi_{t}$ defined by

[TABLE]

In the above display, we have used the convention $\theta_{0}(t)=t$ . In this context, the process $X^{\epsilon}_{s,t}$ defined in (2.10) is also given by the formula

[TABLE]

Assume that $a_{1}<0$ is chosen so that $|a_{1}|>\sigma_{0}^{2}/2$ . In this situation, for any $x,y\in S$ we have

[TABLE]

as well as

[TABLE]

This yields the estimate

[TABLE]

Up to some constant, this shows that the r.h.s. Wasserstein contraction estimate in (2.13) is met with $\lambda_{C}=|a_{1}|-\sigma_{0}^{2}/2$ .

The analysis of nonlinear diffusions on more general differentiable manifolds is based on more sophisticated differential techniques. The extension of the variational calculus developed above to this class of stochastic processes on manifolds is provided in section 3.2.

We end this section with some illustrations of our results on time homogeneous models $(b_{t},\sigma_{t})=(b,\sigma)$ satisfying condition $(H_{C})$ . We set $\phi_{t}:=\phi_{0,t}$ , and $P^{\mu}_{t}:=P^{\mu}_{0,t}$ . By theorem 2.2, there exists an unique invariant measure

[TABLE]

For the nonlinear Langevin diffusion discussed in (1.6) condition $(H_{C})$ is met when (2.14) or (2.15) are satisfied. In this context, $X^{\pi}_{t}:=X^{\pi}_{0,t}$ is a conventional Langevin diffusion given by the time homogeneous stochastic differential equation

[TABLE]

In this situation, the unique invariant measure of $X^{\pi}_{t}$ is given by

[TABLE]

In the above display, $dx$ stands for the Lebesgue measure on $\mathbb{R}^{d}$ . In this case the measure $\pi=\phi_{t}(\pi)=\pi P^{\pi}_{t}$ is the unique solution of the equation $\pi=\varpi(\pi)$ . We underline that the uniqueness of the invariant measure is not ensured for double-well confinement potential functions and too small noise. Further details on this subject including a description of the invariant measures for small noise can be found in the series of articles [19, 20, 21].

Whenever $(H_{C})$ is met, we also have the uniform moment estimates

[TABLE]

In the same vein, when when $(H_{A})$ and $(H_{C})$ are met we have

[TABLE]

for some finite constant $c$ . The last assertion comes from the fact that

[TABLE]

2.3 Some almost sure estimates

We fix the parameters $\epsilon$ and some given time horizon $s\geq 0$ , and we set $y_{t}:=\partial_{\epsilon}Y^{\epsilon}_{s,t}$ , for any $t\in[s,\infty[$ , with the process $Y^{\epsilon}_{s,t}$ defined in (2.11). Also consider the processes

[TABLE]

with the collection of processes

[TABLE]

In this notation, the evolution equation (2.11) reduces to

[TABLE]

Let $t\in[s,\infty[\mapsto{\cal E}_{t}$ be the solution of the matrix evolution equation

[TABLE]

In this notation, we readily check that

[TABLE]

Whenever condition $(H_{A})$ is met, for any given $u\geq 0$ and any $t\in[u,\infty[$ we have

[TABLE]

This shows that

[TABLE]

In addition, when $\nabla_{x}b_{t}$ is uniformly bounded, $\nabla_{x}\sigma_{k,t}=0$ and $(H_{C})$ is met, using (2.12) we have almost sure estimate

[TABLE]

with the uniform spectral norm

[TABLE]

We summarize the above discussion with the following theorem.

Theorem 2.3.

Assume that $\nabla_{x}b_{t}$ is uniformly bounded, $\nabla_{x}\sigma_{k,t}=0=\nabla_{y}\sigma_{k,t}$ and conditions $(H_{A})$ and $(H_{C})$ are met. In this situation, we have the almost sure estimate

[TABLE]

with the process $X^{\epsilon}_{s,t}$ defined in (1.8) and the parameter $\lambda:=\lambda_{A}\wedge\lambda_{C}$ .

3 Some extensions

3.1 A backward variational formula

The stochastic transition semigroup associated with the flow $X^{\mu}_{s,t}(x)$ is defined for any mesurable function $f$ on $\mathbb{R}^{d}$ by the formula

[TABLE]

For twice differentiable function $f$ we have the gradient and the Hessian formulae

[TABLE]

In the above display, $\nabla^{2}X^{\mu}_{s,t}(x)$ stand for the tensors functions

[TABLE]

Also recall that the infinitesimal generator $L_{t,\phi_{s,t}(\mu)}$ of the stochastic flow (1.4) is given for any twice differentiable function $f$ by the second order operator

[TABLE]

Next theorem is an extension of a theorem by Da Prato-Menaldi-Tubaro [16] to nonlinear diffusions.

Theorem 3.1.

Assume that $b_{t}(x,y)$ and $\sigma_{t}(x,y)$ are Lipschitz functions w.r.t. the parameters $(t,x,y)$ . In this situation, for any $\mu\in{\cal P}_{2}(\mathbb{R}^{d})$ we have

[TABLE]

where $\widehat{d}\,W_{u}$ stands for the backward integration notation, so that the r.h.s. term in the above formula is a square integrable backward martingale.

The proof of the above formula follows the elegant stochastic backward variational analysis developed in [16]. A sketched proof is provided in the appendix, on page Proof of (3.1).

We further assume that $\nabla_{x}\sigma_{k,t}(x,y)=0$ . In this situation, using the backward formula (3.1) we check the stochastic interpolation formula

[TABLE]

Equivalently, we have

[TABLE]

Combining (2.2) and (2.3) with (2.13) we obtain the following corollary.

Corollary 3.2.

Assume the conditions of theorem 3.1 are satisfied and we have $\nabla_{x}\sigma_{k,t}=0$ and $\|\nabla_{x}b_{t}(x,y)\|_{2}\leq c$ , for some constant $c<\infty$ . Also assume that $(H_{A})$ and $(H_{C})$ are met for some parameters $\lambda_{A}$ and $\lambda_{C}$ . In this situation we have the exponential decay estimates

[TABLE]

In addition, when $\nabla_{y}\sigma_{k,t}=0$ we have the uniform and almost sure estimates

[TABLE]

3.2 Diffusions on smooth manifolds

This section is concerned with the extension of our results to nonlinear diffusions on Riemannian manifolds. Let us begin with the general necessary facts about nonlinear diffusions in manifolds. Our presentation will be made as similar as possible to the one in Euclidean space. For this, we will need Itô differentials of manifold valued diffusions, parallel translation, covariant differential of tangent bundle valued semimartingales.

Let $M$ be a smooth manifold of dimension $d$ . Stratonovich calculus is similar on $M$ and on $\mathbb{R}^{d}$ . So we are able to deal with Stratonovich SDE’s of the type

[TABLE]

where for $y\in M$

[TABLE]

$W_{t}$ is a $\mathbb{R}^{m}$ -valued Brownian motion and $\sigma(y)$ is a linear map $\mathbb{R}^{m}\to T_{y}M$ . For simplicity $\sigma$ will not depend on time, but the time-dependent $\sigma$ can also be treated, we refer to [1] for this extension, and also for the details of the constructions below.

The only situation we will be interested in is when for all $y\in M$ the map

[TABLE]

is a linear diffeomorphism. In this situation a scalar product can be defined in $T_{y}^{\ast}M$ and then in $T_{y}M$ , leading to a Riemannian structure on $M$ . The scalar product in $T_{y}^{\ast}M$ is

[TABLE]

and the scalar product in $T_{y}M$ is

[TABLE]

Associated to the metric $g$ is the Levi-Civita connection $\nabla$ , which will be used to define parallel transport, Itô equations, Itô covariant differentials. Recall that the parallel transport along a continuous $M$ -valued semimartingale $X$ is the linear map $//_{t}:T_{X_{0}}M\to T_{X_{t}}M$ which satisfies $//_{0}={\rm Id}$ and the Stratonovich SDE $\nabla_{\circ dX_{t}}//_{t}=0$ . It is the natural extension to parallel transport along smooth paths, and it is an isometry. Parallel translation allows to anti-develop $X_{t}$ in $T_{X_{0}}M$ with the Stratonovich integral

[TABLE]

The process ${\cal A}(X)$ takes its values in the vector space, it has an Itô differential $d{\cal A}(X)_{t}$ , which allows to define the Itô differential of $X_{t}$

[TABLE]

This Itô differential is formally a vector which can be expressed in local coordinates as

[TABLE]

The next object to consider is Itô covariant derivative $DU_{t}$ of a $T_{X_{t}}M$ -valued continuous semimartingale $U_{t}$ :

[TABLE]

easily defined from the fact that $//_{t}^{-1}U_{t}$ is vector valued. From the isometry property of parallel translation we easily get the formula for $V_{t}$ another $T_{X_{t}}M$ -valued semimartingale and $\langle\cdot,\cdot\rangle:=g$ ,

[TABLE]

Defining $\displaystyle b_{t}(x,y):=b^{S}_{t}(x,y)+\frac{1}{2}\sum_{k=1}^{m}\nabla\sigma_{k}(\sigma_{k}(y))$ (where for two vector fields $A,B$ , $\nabla A(B(y))$ denotes the covariant derivative of $A$ in the direction $B(y)$ ), it is well known that the Stratonovich SDE (3.3) is equivalent to the Itô SDEs

[TABLE]

A remarkable fact concerning this equation, is that whenever it exists, a solution to equation (3.9) is a diffusion with nonlinear generator $L_{t,\phi_{s,t}}(\mu)$ , where

[TABLE]

So we can consider that the starting point of our study is SDE (3.9) in a Riemannian manifold $(M,g)$ .

Let us adapt the regularity conditions $(H_{A})$ and $(H_{C})$ :

Define $A^{g}_{t}(x,y):=\nabla_{y}b_{t}(x,y)+\nabla_{y}b_{t}(x,y)^{\prime}$ , where $\nabla_{y}b_{t}(x,y)$ is the covariant derivative with respect to the variable $y$ , it is a linear map from $T_{y}M$ into itself, and $\nabla_{y}b_{t}(x,y)^{\prime}$ is its adjoint with respect to the Riemannian metric.

$(H^{g}_{A})$ * : There exists some $\lambda_{A}^{g}\in\mathbb{R}$ such that for any $x,y\in M$ and $t\geq 0$ we have*

[TABLE]

where $\rm Ric$ is the Ricci curvature tensor of $M$ .

Let $B_{t}^{g}$ be as in (2.8) with gradient replaced by covariant derivative.

Define $C_{t}^{g}(x,y):=\frac{1}{2}\left[B^{g}_{t}\left(x,y\right)+B^{g}_{t}\left(x,y\right)^{\prime}\right]$ .

$(H^{g}_{C})$ * : There exists some $\lambda_{C}^{g}\in\mathbb{R}$ such that for any $x,y\in M$ and $t\geq 0$ we have*

[TABLE]

where $g_{M\times M}(x,y)$ , ${\rm Ric}_{M\times M}(x,y)$ are the product metric and Ricci curvature on $M\times M$ .

Theorem 3.3.

We have the exponential expansion or contraction inequalities

[TABLE]

for some finite constant $c$ . In addition, we have

[TABLE]

Remark:

The results of Theorem 3.3 still hold when $\sigma=\sigma_{t}$ and $g=g_{t}$ depend on time, one just has to replace in $(H^{g}_{A})$ $\rm Ric$ by ${\rm Ric}-\dot{g}$ and in $(H^{g}_{C})$ $\rm Ric_{M\times M}$ by ${\rm Ric_{M\times M}}-\dot{g}_{M\times M}$ .

Proof.

The proof of the first estimate is similar to the proof of Theorem 4.1 in [1] (where time dependent metrics are considered), so we will omit it. The proof of the second one is a combination of this proof and to the one of Theorem 2.2 in the present article. Let us go into the details.

Let $Z_{0}$ , $Z_{1}$ two random variables with values in $M$ , and such that $(Z_{0},Z_{1})$ minimizes $\mathbb{E}[d^{2}(Z_{0},Z_{1})]$ under the condition that $Z_{0}$ has law $\mu_{0}$ and $Z_{1}$ has law $\mu_{1}$ . For all $\omega$ , let $\epsilon\mapsto Z_{\epsilon}(\omega)$ be a geodesic between $Z_{0}(\omega)$ and $Z_{1}(\omega)$ .

As in the proof of Theorem 2.2, let $Y_{s,s}^{\mu_{0}}(x)=x$ and $t\in[s,\infty[\mapsto Y_{s,t}^{\mu_{0}}(x)$ solve the equation

[TABLE]

where $\bar{W}_{t}$ is a $\mathbb{R}^{m}$ valued Brownian motion independent of $W_{t}$ . Let $(\bar{Z}_{\epsilon})_{\epsilon\in[0,1]}$ be independent of $(Z_{\epsilon})_{\epsilon\in[0,1]}$ with the same law, $Y_{s,s}^{\epsilon}=\bar{Z}_{\epsilon}$ and $Y_{s,t}^{\epsilon}$ the solution to the Itô SDE

[TABLE]

where $\epsilon\mapsto//_{s,t}^{\,0,\epsilon}(\omega)$ is the parallel transport along the $\epsilon\mapsto Y_{s,t}^{\epsilon}(\omega)$ . Notice that $Y_{s,t}^{0}\equiv Y_{s,t}^{\mu_{0}}(\bar{Z}_{0})$ .

The equation (3.15) is not an SDE on the manifold $M$ , it is an SDE on $C^{1}$ $M$ -valued paths. Existence of solutions have been established in [1]. The processes $t\mapsto Y_{s,t}^{\epsilon}$ are obtained one from the others by infinitesimal synchronious coupling, and it is the only construction where a.s. the paths $\epsilon\mapsto Y_{s,t}^{\epsilon}(\omega)$ has finite variation. Moreover, the derivatives of theses paths satisfy

[TABLE]

where ${\rm Ric}^{\sharp}(u)$ is the vector such that $\langle{\rm Ric}^{\sharp}(u),v\rangle={\rm Ric}(u,v)$ . The advantage of this construction is that the above covariant derivative has finite variation, and this implies

[TABLE]

Then the proof is similar to the one of Theorem 2.2:

[TABLE]

This implies that

[TABLE]

On the other hand, we have

[TABLE]

This ends the proof of the theorem.

An important example of nonlinear diffusions in manifolds is again given by Langevin diffusions, defined as in (3.9), with now

[TABLE]

where $U$ is a potential function, $\rho$ is the Riemannian distance associated to the metric $g$ , $\rho_{x}$ is the distance to $x$ and $F:\mathbb{R}_{+}\to\mathbb{R}$ is a $C^{2}$ function. A sufficient condition $b_{t}(x,y)$ defined by (3.17) to be well defined and smooth is that the derivative of $F$ is nul at the origin and the support of $F$ is included in $[0,\imath(M))$ , where $\imath(M)$ denotes the injectivity radius of $M$ . But smoothness of $b_{t}(x,y)$ is not a necessary condition for defining nonlinear diffusions.

We find that for $u,v\in T_{y}M$ ,

[TABLE]

In this context, condition $(H^{g}_{A})$ reduces to

[TABLE]

If for instance $M$ is simply connected with nonpositive curvature (which implies that the distance function $\rho$ is convex), and $F$ is nondecreasing, a sufficient condition is

[TABLE]

The computation of $B_{t}$ reveals that it is symmetric, and that for $(u,v)\in T_{x}M\times T_{y}M$ ,

[TABLE]

In this context condition $(H^{g}_{C})$ reduces to

[TABLE]

where $U^{\oplus 2}(x,y)=U(x)+U(y)$ . Here again, when $M$ is simply connected with nonpositive curvature, $F$ is convex and nondecreasing, the above condition is met as soon as

[TABLE]

4 Mean field interacting diffusions

4.1 Stability properties

The interacting diffusion flow $\xi_{s,t}^{j}(z)=(\xi_{s,t}^{j,k}(z))_{1\leq k\leq d}\in\mathbb{R}^{d}$ presented in (1.5) can be rewritten as

[TABLE]

with the drift and the diffusion functions defined for any $z=(z_{1},\ldots,z_{N})\in(\mathbb{R}^{d})^{N}$ with $z_{i}=(z_{i}^{l})_{1\leq l\leq d}\in\mathbb{R}^{d}$ by the formulae

[TABLE]

For any differentiable function ${\cal H}:z\in(\mathbb{R}^{d})^{N}\mapsto{\cal H}(z)\in(\mathbb{R}^{d})^{N}$ and any $1\leq i,j\leq N$ and $1\leq l,k\leq d$ we consider the gradient blocks

[TABLE]

In this notation, for any $i\not=j$ we have

[TABLE]

and the diagonal term

[TABLE]

Using the composition rule

[TABLE]

we check that

[TABLE]

$({\cal H}_{{\cal A}})$ * : There exists some $\lambda_{{\cal A}}\in\mathbb{R}$ such that for any $z\in(\mathbb{R}^{d})^{N}$ and $t\geq 0$ we have*

[TABLE]

This spectral condition produces several gradient estimates. For instance, arguing as in (2.6) we have the following theorem.

Theorem 4.1.

Assume condition $({\cal H}_{{\cal A}})$ is satisfied. In this situation we have the uniform exponential decay estimates

[TABLE]

In addition, when $\nabla{\cal G}_{t,\alpha}(z)=0$ we have the uniform almost sure exponential decay estimate

[TABLE]

The proof of the above theorem is provided in the appendix, on page Proof of theorem 4.1.

For the nonlinear Langevin diffusion discussed in (1.6) we have $\nabla{\cal G}_{t,\alpha}(z)=0$ and

[TABLE]

In this situation we have

[TABLE]

with the matrix ${\cal E}_{t}(z)$ with block entries

[TABLE]

When $V$ is odd we have

[TABLE]

When $V$ is even and convex we have ${\cal E}_{t}(z)\geq 0$ and therefore

[TABLE]

In this situation, we also have

[TABLE]

Last but not least, whenever $\nabla V(0)=0$ we have

[TABLE]

Note that $\nabla V(0)=0$ holds when $V$ is even. In this situation, the diffusion $\xi_{t}$ reduces to a conventional Langevin diffusion

[TABLE]

In this context, the stationary measure of the particle model $\xi_{t}$ is given by the Gibbs measure

[TABLE]

4.2 Propagation of chaos properties

For any differentiable function $g(x,y)$ from $\mathbb{R}^{2d}$ into $\mathbb{R}^{d}$ we let $\nabla_{u}g(x,y)$ be the gradient matrices w.r.t. the coordinate $u\in\{x,y\}$ , and we set

[TABLE]

We extend matrix-valued functions $G:z\in\mathbb{R}^{k}\mapsto G(z)\in\mathbb{R}^{d\times d}$ to the product space $\mathbb{R}^{2k}$ by setting

[TABLE]

We also consider the mapping $\delta:\mathbb{R}^{d}\to\mathbb{R}^{d}\times\mathbb{R}^{d}$ , $x\mapsto(x,x)$ , and for any $x;\overline{x}\in\mathbb{R}^{d}$ we set

[TABLE]

Let ${\cal B}_{t}(z,\overline{z})$ and ${\cal D}_{t}(z,\overline{z})$ be the functions defined for any $z=(x,y)$ and $\overline{z}:=(\overline{x},\overline{y})\in\mathbb{R}^{2d}$ by

[TABLE]

The matrices ${\cal B}^{(i)}_{t}(z,\overline{z})$ in the above display are given by

[TABLE]

and the matrices ${\cal D}^{(i)}_{t}(z,\overline{z})$ are given by

[TABLE]

Consider the following regularity condition:

$({\cal H}_{{\cal C}})$ * : There exists some $\lambda_{{\cal C}}\in\mathbb{R}$ such that for any $z,\overline{z}\in\mathbb{R}^{2d}$ and $t\geq 0$ we have*

[TABLE]

Let $\zeta_{0}=(\zeta_{0}^{i})_{1\leq i\leq N}$ be $N$ independent copies of a random variable with distribution $\mu$ on $\mathbb{R}^{d}$ . Let $\xi_{t}:=\xi_{0,t}(\zeta_{0})$ and consider the diffusion processes $\zeta_{t}=(\zeta_{t}^{i})_{1\leq i\leq N}$ defined as $\xi_{t}$ by replacing the occupation measures $m(\xi_{t})$ by the distributions $\mu_{t}=\phi_{t}(\mu):=\phi_{0,t}(\mu)$ ; that is, for any $1\leq i\leq N$ we have

[TABLE]

Theorem 4.2.

Assume condition $({\cal H}_{{\cal C}})$ is satisfied. In this situation, for any $\epsilon>0$ and any distribution $\mu$ on $\mathbb{R}^{d}$ we have

[TABLE]

with the parameters

[TABLE]

Proof.

We set $S_{t}:=\mathbb{E}\left(\|\xi_{t}^{1}-\zeta_{t}^{1}\|^{2}\right)$ . Using the decomposition

[TABLE]

we check that

[TABLE]

with $\Sigma_{t}$ and $\Gamma_{t}$ defined by

[TABLE]

Applying Cauchy-Schwartz inequality we find that

[TABLE]

with

[TABLE]

To estimate the term $\Sigma_{t}$ we observe that

[TABLE]

On the other hand, for any differentiable function $g$ from $\mathbb{R}^{2d}$ into $\mathbb{R}^{d}$ , and for any $z=(x,y)$ and $\overline{z}=(\overline{x},\overline{y})\in\mathbb{R}^{2d}$ we have the first order decomposition

[TABLE]

with the matrix

[TABLE]

By symmetry arguments, this implies that

[TABLE]

In the same vein, we have

[TABLE]

This yields the estimate

[TABLE]

To estimate the term $I_{t}$ we use the decomposition

[TABLE]

Also notice that

[TABLE]

We also have

[TABLE]

This implies that

[TABLE]

from which we check that

[TABLE]

Combining the above decompositions we check that

[TABLE]

Combining the above estimate with (4.13) we find that

[TABLE]

from which we conclude that

[TABLE]

Applying twice Cauchy-Schwartz inequality we check the estimate

[TABLE]

On the other hand, we have

[TABLE]

This implies that

[TABLE]

Recalling that $2ab\leq\epsilon a^{2}+b^{2}/\epsilon$ for any $\epsilon>0$ and $a,b\in\mathbb{R}$ , we check that

[TABLE]

This ends the proof of the theorem.

We end this section with some comments on the regularity condition $({\cal H}_{{\cal C}})$ .

For the nonlinear Langevin diffusion discussed in (1.6) we have ${\cal D}_{t}(z,\overline{z})=0$ and

[TABLE]

In this context, we have

[TABLE]

Also observe that for any $z\in\mathbb{R}^{2d}$ we have the decomposition

[TABLE]

with the matrices

[TABLE]

In the above display $C_{t}(z)$ stands for the matrix defined in (2.9), $B^{(1)}_{t}(z)$ and $D^{(1)}_{t}(z)$ stand for the matrices defined for any $z=(x,y)\in\mathbb{R}^{2d}$ by

[TABLE]

Consider the following regularity condition:

$\left({\cal H}_{{\cal C}^{(1)}}\right)$ * : There exists some $\lambda_{{\cal C}^{(1)}}\in\mathbb{R}$ such that for any $(x,y)\in\mathbb{R}^{2d}$ and $t\geq 0$ we have*

[TABLE]

Assume that $\left({\cal H}_{{\cal C}^{(1)}}\right)$ is met. Using the fact that $\mathbb{E}(\Sigma^{\prime})\mathbb{E}(\Sigma)\leq\mathbb{E}(\Sigma^{\prime}\Sigma)$ , for any random matrix $\Sigma$ , we check that

[TABLE]

Several uniform estimates can be derived combining (4.12) with the moments estimates (2.17). For instance, suppose we are given a time homogeneous model $(b_{t},\sigma_{t})=(b,\sigma)$ , for some functions $(b,\sigma)$ with uniformly bounded first order derivatives. Also assume $\left({\cal H}_{{\cal C}^{(1)}}\right)$ is met for some $\lambda_{{\cal C}^{(1)}}>0$ . In this context, the moments estimates (2.17) ensure that

[TABLE]

for some constant $c(\mu)$ whose values only depends on the measure $\mu$ . Choosing $\epsilon=\lambda_{{\cal C}}/2$ in (4.12) we readily check that

[TABLE]

4.3 Propagation of chaos in manifolds

Our aim is to state an analogous of Theorem 4.2 in a Riemannian manifold $(M,g)$ . We will take the notations of Section 3.2. Let us denote by $\rho$ the Riemannian distance in $M$ . Now $\zeta_{0}=(\zeta_{0}^{i})_{1\leq i\leq N}$ are independent copies of a random variable with distribution $\mu$ on $M$ . For $1\leq i\leq N$ the diffusions $\zeta_{s,t}^{i}(x)$ satisfy the Itô SDE

[TABLE]

with $\sigma(y):\mathbb{R}^{m}\to T_{y}M$ linear, $\sigma\sigma^{\ast}=g^{\ast}$ , and $(W_{t}^{i}),1\leq i\leq N$ independent $\mathbb{R}^{m}$ -valued Brownian motions independent of $\zeta_{0}$ . Denote $\mu_{t}:=\phi_{0,t}(\mu)$ , $\zeta_{t}:=\zeta_{0,t}(\zeta_{0})$ . The diffusions $\zeta_{t}^{i}$ are independent and identically distributed, with law $\mu_{t}$ at time $t$ . Define an approximation of $\zeta_{t}$ with the Markov process $\xi_{t}=(\xi_{t}^{i})_{1\leq i\leq N}$ satisfying $\xi_{0}=\zeta_{0}$ and for all $i$ ,

[TABLE]

where for $x,y\in M$ , $//_{x,y}$ denotes parallel translation along the minimal geodesic from $x$ to $y$ . It is well-known that such an equation has a solution, which realizes the coupling by parallel translation of martingale parts of $\zeta_{t}^{i}$ and $\xi_{t}^{i}$ (see e.g. [2] or [43]). The only difficulty is when $\xi_{t}^{i}$ is in the cutlocus of $\zeta_{t}^{i}$ , but this difficulty can be overcome by constructing approximations of the solutions which are decoupled in an $\epsilon$ -neighbourhood of the cutlocus, and by letting then $\epsilon$ tend to [math]. However the solution obtained is not strong. Anyway, since $//_{\zeta_{t}^{i},\xi_{t}^{i}}$ is an isometry and the $W_{t}^{i}$ are independent, the process $\xi_{t}$ is a Brownian motion in $M^{N}$ with drift $(b_{t}(m(\xi_{t}),\xi^{i}_{t}))_{1\leq i\leq N}$ , so it is a diffusion process. Moreover independent $\mathbb{R}^{m}$ valued Brownian motions $w_{t}^{i}$ can be found such that

[TABLE]

they satisfy

[TABLE]

for some “complementary” martingale $m_{t}^{i}$ .

The important fact about this construction is that the distance $\rho^{2}(\zeta_{t}^{i},\xi_{t}^{i})$ has finite variation. More precisely, letting for $x,y\in M$ with $y$ not belonging to the cutlocus of $x$ , $s\mapsto\gamma(x,y)(s)$ the geodesic from $x$ to $y$ in time $1$ and $\overrightarrow{xy}=\dot{\gamma}(x,y)(0)$ we have

[TABLE]

In the above display $L_{t}$ stands for a nondecreasing process which increases only when $\xi_{t}^{1}$ is in the cutlocus of $\zeta_{t}^{1}$ , and $I$ is the index map defined for $x,y\in M$ , and $y\not\in{\rm Cut(x)}$ , by

[TABLE]

where $\varphi$ is a unit speed geodesic from $x$ to $y$ started at time [math], $(J_{i}(0))_{1\leq i\leq d-1}$ is an orthonormal basis of $\dot{\varphi}(0)^{\perp}$ , $J_{i}(\rho(x,y))=//_{x,y}J_{i}(0)$ and $s\mapsto J(s)$ is a Jacobi field along $s\mapsto\varphi(s)$ (see e.g. [2]). It is well known that when ${\rm Ric}_{M}\geq\kappa$ then $I(x,y)\leq\bar{I}(\rho(x,y),\kappa)$ where $\bar{I}(\rho(x,y),\kappa)$ is the same quantity computed in a constant curvature manifold, for two points at the same distance. Moreover we have the explicit values

[TABLE]

In any case, $\bar{I}(\rho,\kappa)\leq-\kappa\rho$ , so we obtain as a general result that when ${\rm Ric}_{M}\geq\kappa$

[TABLE]

So we have

[TABLE]

Define similarly to the previous section for a Riemannian manifold $M$ and a map $G:M\times M\to TM$ such that $G(x,y)\in T_{y}M$ : for $z=(x,y)$ , $\bar{z}=(\bar{x},\bar{y})$ elements of $M\times M$

[TABLE]

Also define

[TABLE]

where $\delta:M\to M\times M$ , $x\mapsto(x,x)$ , and set

[TABLE]

Consider the following regularity condition:

$({\cal H}_{{\cal C}}^{g})$ * : There exists some $\lambda_{{\cal C}}\in\mathbb{R}$ such that for any $z,\overline{z}\in M\times M$ and $t\geq 0$ we have*

[TABLE]

Theorem 4.3.

Assume that the Ricci curvatures of $M$ are bounded below by $\kappa\in\mathbb{R}$ and that the condition $({\cal H}_{{\cal C}}^{g})$ is satisfied. Then

[TABLE]

with the parameter $\beta_{t}(\mu)$ defined as in Theorem 4.2.

Remark:

The result of Theorem 4.3 extends to the case when $\sigma=\sigma_{t}$ and $g=g_{t}$ depend on time, if we replace the bound below of the Ricci curvatures by the assumption that ${\rm Ric}_{M}-\dot{g}\geq\kappa g$ .

Proof.

The proof is completely similar to the one of Theorem 4.2, thus it is only sketched. Letting $S_{t}:=\mathbb{E}\left[\rho^{2}(\zeta_{t}^{1},\xi_{t}^{1})\right]$ we arrive at

[TABLE]

where

[TABLE]

and

[TABLE]

which leads to

[TABLE]

so letting $s_{t}=\sqrt{S_{t}}$ we get

[TABLE]

This ends the proof of (4.32).

Let us investigate condition (4.31) for the Langevin diffusion with drift (3.17), namely

[TABLE]

We need the additional assumption $\partial F(0)=0$ . In this situation, the computation of $I_{t}$ in (4.34) yields the formula

[TABLE]

where we denoted $\overrightarrow{\zeta_{t}^{1}\xi_{t}^{1}}=\dot{\gamma}(\zeta_{t}^{1},\xi_{t}^{1})(0)$ , leading to the condition $({\cal H}_{{\cal C}}^{g})$ : for all $z,\bar{z}\in M\times M$ ,

[TABLE]

This condition is met for instance when for all $z\in M\times M$ ,

[TABLE]

Appendix

Proof of (2.2) and (2.3)

After some calculations we check that

[TABLE]

with the matrix valued martingale

[TABLE]

and

[TABLE]

In the above display, we have used the fact that $\mathbb{E}(\Sigma^{\prime})\mathbb{E}(\Sigma)\leq\mathbb{E}(\Sigma^{\prime}\Sigma)$ , for any random matrix $\Sigma$ . The end of the proof of (2.2) and (2.3) is now clear.

Proof of (3.1)

For any time mesh $t_{k}\leq t_{k+1}$ with $s_{0}=s$ and $s_{n}=t$ with $h:=\max{|s_{k}-s_{k-1}|}$ we have

[TABLE]

Also observe that

[TABLE]

with the random fields

[TABLE]

Using elementary manipulations, for any $0\leq h\leq 1$ we check that

[TABLE]

for some finite constants $c$ and $c_{n}$ . Recalling that $(t,x,y)\mapsto b_{t}(x,y)$ and $(t,x,y)\mapsto\sigma_{t}(x,y)$ are Lipschitz functions we check that the almost sure convergence

[TABLE]

Using the Taylor expansion

[TABLE]

we check that

[TABLE]

Rearranging the terms we find that

[TABLE]

with the remainder term

[TABLE]

This yields the decomposition

[TABLE]

with the remainder term

[TABLE]

On the other hand, we have

[TABLE]

This implies that

[TABLE]

We end the proof of (3.1) by letting the time step $h\rightarrow 0$ .

Proof of theorem 4.1

Observe that

[TABLE]

This implies that

[TABLE]

This ends the proof of (4.1). The proof of (4.4) and (4.5) come from the formula

[TABLE]

with the martingale

[TABLE]

defined in terms of the diffusion processes

[TABLE]

The end of the proof of (4.4) and (4.5) follows the same lines of arguments as the proof of (2.2) and (2.3), thus it is skipped. This ends the proof of the theorem.

Bibliography43

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] M. Arnaudon, A. Coulibaly, A. Thalmaier, Horizontal diffusions in C 1 superscript 𝐶 1 C^{1} path space. Séminaire de Probabilités XLIII, 73–94, Lecture Notes in Mathematics 2006, Springer (2010)
2[2] M. Arnaudon, A. Thalmaier and F.Y. Wang, Harnack inequality and heat kernel estimates on manifolds with curvature unbounded below, Bull. Sci. Math. 130 (2006) 223–233
3[3] D. Bakry, M. Emery. Diffusions hypercontractives. In Séminaire de probabilités, XIX, 1983/84, pp. 177–206. Lect. Notes in Math. 1123, Springer (1985).
4[4] S. Benachour, B. Roynette, D. Talay, P. Vallois. Nonlinear self-stabilizing processes, part I. Existence, invariant probability, propagation of chaos. Stochastic processes and their applications, vol. 75, no. 2, pp. 173–201 (1998).
5[5] S. Benachour, B. Roynette, P. Vallois. Nonlinear self-stabilizing processes, part II: Convergence to invariant probability. Stochastic processes and their applications, vol. 75, no.2, pp. 203–224 (1998).
6[6] D. Benedetto, E. Caglioti, M. Pulvirenti. A kinetic equation for granular media. RAIRO Modèl. Math. Anal. Numér. vol. 31, no. 5, pp. 615–641 (1997).
7[7] D. Benedetto, E. Caglioti, E., Carrillo, M. Pulvirenti. A non-Maxwellian steady distribution for one-dimensional granular media. J. Statist. Phys.vol. 91, pp. 979–990 (1998).
8[8] F. Bolley, I. Gentil, A. Guillin. Uniform convergence to equilibrium for granular media. Archive for Rational Mechanics and Analysis, vol. 208, no. 2, pp. 429–445 (2013).

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

A variational approach to nonlinear and interacting diffusions

Abstract

1 Introduction

1.1 Description of the models

1.2 Statement of some main results and article organisation

2 Nonlinear diffusion semigroups

2.1 Some gradient flow estimates

Proposition 2.1**.**

2.2 A first order differential calculus

Theorem 2.2**.**

2.3 Some almost sure estimates

Theorem 2.3**.**

3 Some extensions

3.1 A backward variational formula

Theorem 3.1**.**

Corollary 3.2**.**

3.2 Diffusions on smooth manifolds

Theorem 3.3**.**

Proof.

4 Mean field interacting diffusions

4.1 Stability properties

Theorem 4.1**.**

4.2 Propagation of chaos properties

Theorem 4.2**.**

Proof.

4.3 Propagation of chaos in manifolds

Theorem 4.3**.**

Proof.

Appendix

Proof of (2.2) and (2.3)

Proof of (3.1)

Proof of theorem 4.1

Proposition 2.1.

Theorem 2.2.

Theorem 2.3.

Theorem 3.1.

Corollary 3.2.

Theorem 3.3.

Theorem 4.1.

Theorem 4.2.

Theorem 4.3.