Backward It{\^o}-Ventzell and stochastic interpolation formulae

Pierre del Moral (ASTRAL); Sumeetpal Sidhu Singh

arXiv:1906.09145·math.PR·May 5, 2021

Backward It{\^o}-Ventzell and stochastic interpolation formulae

Pierre del Moral (ASTRAL), Sumeetpal Sidhu Singh

PDF

TL;DR

This paper introduces a new backward Itô-Ventzell formula and extends stochastic interpolation formulas to stochastic flows, providing spectral conditions for uniform estimates and applications in diffusion perturbation and approximation theories.

Contribution

The paper presents a novel backward Itô-Ventzell formula and extends stochastic interpolation formulas to stochastic flows, with spectral conditions for uniform flow difference estimates.

Findings

01

New backward Itô-Ventzell formula introduced

02

Spectral conditions enable simple proofs of flow estimates

03

Applications demonstrated in diffusion perturbation and approximation

Abstract

We present a novel backward It{\^o}-Ventzell formula and an extension of the Aleeksev-Gr\"obner interpolating formula to stochastic flows. We also present some natural spectral conditions that yield direct and simple proofs of time uniform estimates of the difference between the two stochastic flows when their drift and diffusion functions are not the same, yielding what seems to be the first results of this type for this class of anticipative models. We illustrate the impact of these results in the context of diffusion perturbation theory, interacting diffusions and discrete time approximations

Equations707

d X_{s, t} (x) = b_{t} (X_{s, t} (x)) d t + σ_{t} (X_{s, t} (x)) d W_{t}

d X_{s, t} (x) = b_{t} (X_{s, t} (x)) d t + σ_{t} (X_{s, t} (x)) d W_{t}

\sigma_{t}(x)=\Sigma_{t}\quad\mbox{\rm and}\quad\overline{\sigma}_{t}(x)=\overline{\Sigma}_{t}\quad\mbox{\rm for some matrices $\Sigma_{t}$ and $\overline{\Sigma}_{t}$.}\quad

\sigma_{t}(x)=\Sigma_{t}\quad\mbox{\rm and}\quad\overline{\sigma}_{t}(x)=\overline{\Sigma}_{t}\quad\mbox{\rm for some matrices $\Sigma_{t}$ and $\overline{\Sigma}_{t}$.}\quad

P_{s, t} (f) (x) := E (f (X_{s, t} (x))) \mbox and \overline{P}_{s, t} (f) (x) := E (f (\overline{X}_{s, t} (x)))

P_{s, t} (f) (x) := E (f (X_{s, t} (x))) \mbox and \overline{P}_{s, t} (f) (x) := E (f (\overline{X}_{s, t} (x)))

Δ a_{t} := a_{t} - \overline{a}_{t} Δ b_{t} := b_{t} - \overline{b}_{t} \mbox and Δ σ_{t} = σ_{t} - \overline{σ}_{t}

Δ a_{t} := a_{t} - \overline{a}_{t} Δ b_{t} := b_{t} - \overline{b}_{t} \mbox and Δ σ_{t} = σ_{t} - \overline{σ}_{t}

\left\{\begin{array}[]{rcl}\displaystyle Y_{s,t}&=&\displaystyle y+\int_{s}^{t}~{}B_{s,u}~{}du+\int_{s}^{t}~{}\Sigma_{s,u}~{}dW_{u}\\ \displaystyle F_{s,t}(x)&=&\displaystyle F(x)+\int_{s}^{t}~{}G_{u,t}(x)~{}du+\int_{s}^{t}~{}H_{u,t}(x)~{}dW_{u}\end{array}\right.

\left\{\begin{array}[]{rcl}\displaystyle Y_{s,t}&=&\displaystyle y+\int_{s}^{t}~{}B_{s,u}~{}du+\int_{s}^{t}~{}\Sigma_{s,u}~{}dW_{u}\\ \displaystyle F_{s,t}(x)&=&\displaystyle F(x)+\int_{s}^{t}~{}G_{u,t}(x)~{}du+\int_{s}^{t}~{}H_{u,t}(x)~{}dW_{u}\end{array}\right.

\begin{array}[]{l}\displaystyle F_{v,t}(Y_{s,v})-F_{u,t}(Y_{s,u})=\int_{u}^{v}(\nabla F_{r,t}(Y_{s,r})^{\prime}~{}B_{s,r}+\frac{1}{2}~{}\nabla^{2}F_{r,t}(Y_{s,r})^{\prime}~{}\Sigma_{s,r}\Sigma_{s,r}^{\prime}-G_{r,t}(Y_{s,r}))~{}dr\\ \\ \hskip 142.26378pt+\displaystyle\int_{u}^{v}~{}\left(\nabla F_{r,t}(Y_{s,r})^{\prime}~{}\Sigma_{s,r}-H_{r,t}(Y_{s,r})\right)~{}dW_{r}\end{array}

\begin{array}[]{l}\displaystyle F_{v,t}(Y_{s,v})-F_{u,t}(Y_{s,u})=\int_{u}^{v}(\nabla F_{r,t}(Y_{s,r})^{\prime}~{}B_{s,r}+\frac{1}{2}~{}\nabla^{2}F_{r,t}(Y_{s,r})^{\prime}~{}\Sigma_{s,r}\Sigma_{s,r}^{\prime}-G_{r,t}(Y_{s,r}))~{}dr\\ \\ \hskip 142.26378pt+\displaystyle\int_{u}^{v}~{}\left(\nabla F_{r,t}(Y_{s,r})^{\prime}~{}\Sigma_{s,r}-H_{r,t}(Y_{s,r})\right)~{}dW_{r}\end{array}

\begin{array}[]{l}X_{s,t}(x)-X_{s-h,t}(x)=X_{s,t}(x)-(X_{s,t}\circ X_{s-h,s})(x)\\ \\ \simeq X_{s,t}(x)-X_{s,t}\left(x+b_{s}(x)~{}h+\sigma_{s}(x)~{}\left(W_{s}-W_{s-h}\right)\right)\\ \\ \displaystyle\simeq-\left[\left(\nabla X_{s,t}(x)^{\prime}~{}b_{s}(x)+\frac{1}{2}~{}\nabla^{2}X_{s,t}(x)^{\prime}~{}a_{s}(x)\right)~{}h+\nabla X_{s,t}(x)^{\prime}\sigma_{s}(x)~{}(W_{s}-W_{s-h})\right]\end{array}

\begin{array}[]{l}X_{s,t}(x)-X_{s-h,t}(x)=X_{s,t}(x)-(X_{s,t}\circ X_{s-h,s})(x)\\ \\ \simeq X_{s,t}(x)-X_{s,t}\left(x+b_{s}(x)~{}h+\sigma_{s}(x)~{}\left(W_{s}-W_{s-h}\right)\right)\\ \\ \displaystyle\simeq-\left[\left(\nabla X_{s,t}(x)^{\prime}~{}b_{s}(x)+\frac{1}{2}~{}\nabla^{2}X_{s,t}(x)^{\prime}~{}a_{s}(x)\right)~{}h+\nabla X_{s,t}(x)^{\prime}\sigma_{s}(x)~{}(W_{s}-W_{s-h})\right]\end{array}

\begin{array}[]{l}d_{s}X_{s,t}(x)=-\left[\left(\nabla X_{s,t}(x)^{\prime}~{}b_{s}(x)+\frac{1}{2}~{}\nabla^{2}X_{s,t}(x)^{\prime}~{}a_{s}(x)\right)~{}ds+\nabla X_{s,t}(x)^{\prime}\sigma_{s}(x)~{}dW_{s}\right]\end{array}

\begin{array}[]{l}d_{s}X_{s,t}(x)=-\left[\left(\nabla X_{s,t}(x)^{\prime}~{}b_{s}(x)+\frac{1}{2}~{}\nabla^{2}X_{s,t}(x)^{\prime}~{}a_{s}(x)\right)~{}ds+\nabla X_{s,t}(x)^{\prime}\sigma_{s}(x)~{}dW_{s}\right]\end{array}

\begin{array}[]{l}X_{u+h,t}\circ\overline{X}_{s,u+h}-X_{u,t}\circ\overline{X}_{s,u}\\ \\ =(X_{u+h,t}-X_{u,t})\circ\overline{X}_{s,u}+\left(X_{u+h,t}\circ\overline{X}_{s,u+h}-X_{u+h,t}\circ\overline{X}_{s,u}\right)\end{array}

\begin{array}[]{l}X_{u+h,t}\circ\overline{X}_{s,u+h}-X_{u,t}\circ\overline{X}_{s,u}\\ \\ =(X_{u+h,t}-X_{u,t})\circ\overline{X}_{s,u}+\left(X_{u+h,t}\circ\overline{X}_{s,u+h}-X_{u+h,t}\circ\overline{X}_{s,u}\right)\end{array}

\begin{array}[]{l}X_{u+h,t}\left(\,\overline{X}_{s,u}(x)+\left(\overline{X}_{s,u+h}(x)-\overline{X}_{s,u}(x)\right)\right)-X_{u+h,t}(\overline{X}_{s,u}(x))\\ \\ \displaystyle\simeq\left(\nabla X_{u+h,t}\right)(\overline{X}_{s,u}(x))^{\prime}~{}(\overline{X}_{s,u+h}(x)-\overline{X}_{s,u}(x))+\frac{1}{2}\,\left(\nabla^{2}X_{u+h,t}\right)(\overline{X}_{s,u}(x))^{\prime}~{}\overline{a}_{u}(\overline{X}_{s,u}(x))~{}h\end{array}

\begin{array}[]{l}X_{u+h,t}\left(\,\overline{X}_{s,u}(x)+\left(\overline{X}_{s,u+h}(x)-\overline{X}_{s,u}(x)\right)\right)-X_{u+h,t}(\overline{X}_{s,u}(x))\\ \\ \displaystyle\simeq\left(\nabla X_{u+h,t}\right)(\overline{X}_{s,u}(x))^{\prime}~{}(\overline{X}_{s,u+h}(x)-\overline{X}_{s,u}(x))+\frac{1}{2}\,\left(\nabla^{2}X_{u+h,t}\right)(\overline{X}_{s,u}(x))^{\prime}~{}\overline{a}_{u}(\overline{X}_{s,u}(x))~{}h\end{array}

\begin{array}[]{l}d_{u}\left(X_{u,t}\circ\overline{X}_{s,u}\right)(x)\\ \\ \displaystyle=\left(d_{u}X_{u,t}\right)(\overline{X}_{s,u}(x))+\left(\nabla X_{u,t}\right)(\overline{X}_{s,u}(x))^{\prime}~{}d_{u}\overline{X}_{s,u}(x)+\frac{1}{2}\,\left(\nabla^{2}X_{u,t}\right)(\overline{X}_{s,u}(x))^{\prime}~{}\overline{a}_{u}(\overline{X}_{s,u}(x))~{}du\end{array}

\begin{array}[]{l}d_{u}\left(X_{u,t}\circ\overline{X}_{s,u}\right)(x)\\ \\ \displaystyle=\left(d_{u}X_{u,t}\right)(\overline{X}_{s,u}(x))+\left(\nabla X_{u,t}\right)(\overline{X}_{s,u}(x))^{\prime}~{}d_{u}\overline{X}_{s,u}(x)+\frac{1}{2}\,\left(\nabla^{2}X_{u,t}\right)(\overline{X}_{s,u}(x))^{\prime}~{}\overline{a}_{u}(\overline{X}_{s,u}(x))~{}du\end{array}

X_{s, t} (x) - \overline{X}_{s, t} (x) = T_{s, t} (Δ a, Δ b) (x) + S_{s, t} (Δ σ) (x)

X_{s, t} (x) - \overline{X}_{s, t} (x) = T_{s, t} (Δ a, Δ b) (x) + S_{s, t} (Δ σ) (x)

\begin{array}[]{l}\displaystyle T_{s,t}(\Delta a,\Delta b)(x)\\ \\ \displaystyle:=\int_{s}^{t}\left[\left(\nabla X_{u,t}\right)(\overline{X}_{s,u}(x))^{\prime}~{}\Delta b_{u}(\overline{X}_{s,u}(x))+\frac{1}{2}~{}\left(\nabla^{2}X_{u,t}\right)(\overline{X}_{s,u}(x))^{\prime}~{}\Delta a_{u}(\overline{X}_{s,u}(x))\right]~{}du\end{array}

\begin{array}[]{l}\displaystyle T_{s,t}(\Delta a,\Delta b)(x)\\ \\ \displaystyle:=\int_{s}^{t}\left[\left(\nabla X_{u,t}\right)(\overline{X}_{s,u}(x))^{\prime}~{}\Delta b_{u}(\overline{X}_{s,u}(x))+\frac{1}{2}~{}\left(\nabla^{2}X_{u,t}\right)(\overline{X}_{s,u}(x))^{\prime}~{}\Delta a_{u}(\overline{X}_{s,u}(x))\right]~{}du\end{array}

S_{s, t} (Δ σ) (x) := \int_{s}^{t} (\nabla X_{u, t}) (\overline{X}_{s, u} (x))^{'} Δ σ_{u} (\overline{X}_{s, u} (x)) d W_{u}

S_{s, t} (Δ σ) (x) := \int_{s}^{t} (\nabla X_{u, t}) (\overline{X}_{s, u} (x))^{'} Δ σ_{u} (\overline{X}_{s, u} (x)) d W_{u}

Z^{s, t} : u \in [s, t] \mapsto Z_{u}^{s, t} := X_{u, t} \circ \overline{X}_{s, u} ⟹ Z_{s}^{s, t} - Z_{t}^{s, t} = X_{s, t} - \overline{X}_{s, t}

Z^{s, t} : u \in [s, t] \mapsto Z_{u}^{s, t} := X_{u, t} \circ \overline{X}_{s, u} ⟹ Z_{s}^{s, t} - Z_{t}^{s, t} = X_{s, t} - \overline{X}_{s, t}

(F_{s, t} (x), Y_{s, t} (y))

(F_{s, t} (x), Y_{s, t} (y))

G_{u, t} (x)

\nabla b_{t}+(\nabla b_{t})^{\prime}\leq-2\lambda~{}I\quad\mbox{\rm and}\quad\nabla\overline{b}_{t}+(\nabla\overline{b}_{t})^{\prime}\leq-2\overline{\lambda}~{}I\quad\mbox{\rm for some $\lambda\wedge\overline{\lambda}>0,$}

\nabla b_{t}+(\nabla b_{t})^{\prime}\leq-2\lambda~{}I\quad\mbox{\rm and}\quad\nabla\overline{b}_{t}+(\nabla\overline{b}_{t})^{\prime}\leq-2\overline{\lambda}~{}I\quad\mbox{\rm for some $\lambda\wedge\overline{\lambda}>0,$}

\partial_{t} \nabla X_{s, t} (x) = \nabla X_{s, t} (x) B_{t}^{'}

\partial_{t} \nabla X_{s, t} (x) = \nabla X_{s, t} (x) B_{t}^{'}

∣ ∣ ∣ f (x) ∣ ∣ ∣_{n} := s \geq 0 sup t \geq s sup E (∥ f_{t} (\overline{X}_{s, t} (x)) ∥^{n})^{1/ n}

∣ ∣ ∣ f (x) ∣ ∣ ∣_{n} := s \geq 0 sup t \geq s sup E (∥ f_{t} (\overline{X}_{s, t} (x)) ∥^{n})^{1/ n}

\begin{array}[]{l}\displaystyle\mathbb{E}\left[\|X_{s,t}(x)-\overline{X}_{s,t}(x)\|^{n}\right]^{1/n}\\ \\ \displaystyle\leq\kappa_{\delta,n}~{}\left({\left|\kern-1.07639pt\left|\kern-1.07639pt\left|\Delta a(x)\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}_{2n/(1+\delta)}+{\left|\kern-1.07639pt\left|\kern-1.07639pt\left|\Delta b(x)\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}_{2n/(1+\delta)}+{\left|\kern-1.07639pt\left|\kern-1.07639pt\left|\Delta\sigma(x)\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}_{2n/\delta}~{}(1\vee\|x\|)\right)\end{array}

\begin{array}[]{l}\displaystyle\mathbb{E}\left[\|X_{s,t}(x)-\overline{X}_{s,t}(x)\|^{n}\right]^{1/n}\\ \\ \displaystyle\leq\kappa_{\delta,n}~{}\left({\left|\kern-1.07639pt\left|\kern-1.07639pt\left|\Delta a(x)\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}_{2n/(1+\delta)}+{\left|\kern-1.07639pt\left|\kern-1.07639pt\left|\Delta b(x)\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}_{2n/(1+\delta)}+{\left|\kern-1.07639pt\left|\kern-1.07639pt\left|\Delta\sigma(x)\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}_{2n/\delta}~{}(1\vee\|x\|)\right)\end{array}

(\ref T 2 - in t r o) ⟹ \forall n \geq 2 E [∥ X_{s, t} (x) - \overline{X}_{s, t} (x) ∥^{n}]^{1/ n} \leq κ_{n} (∣ ∣ ∣ Δ b (x) ∣ ∣ ∣_{n} + ∥Σ - \overline{Σ} ∥)

(\ref T 2 - in t r o) ⟹ \forall n \geq 2 E [∥ X_{s, t} (x) - \overline{X}_{s, t} (x) ∥^{n}]^{1/ n} \leq κ_{n} (∣ ∣ ∣ Δ b (x) ∣ ∣ ∣_{n} + ∥Σ - \overline{Σ} ∥)

X_{s, t} (x) - \overline{X}_{s, t} (x) = \int_{s}^{t} (\nabla X_{u, t}) (\overline{X}_{s, u} (x))^{'} Δ b_{u} (\overline{X}_{s, u} (x)) d u

X_{s, t} (x) - \overline{X}_{s, t} (x) = \int_{s}^{t} (\nabla X_{u, t}) (\overline{X}_{s, u} (x))^{'} Δ b_{u} (\overline{X}_{s, u} (x)) d u

x sup E (∥ (\nabla X_{u, t}) (x) ∥_{2}^{n})^{1/ n} \leq κ_{n} exp (- λ (n) (t - u)) \mbox forsome λ (n) > 0

x sup E (∥ (\nabla X_{u, t}) (x) ∥_{2}^{n})^{1/ n} \leq κ_{n} exp (- λ (n) (t - u)) \mbox forsome λ (n) > 0

E [∥ X_{s, t} (x) - \overline{X}_{s, t} (x) ∥^{n}]^{1/ n}

E [∥ X_{s, t} (x) - \overline{X}_{s, t} (x) ∥^{n}]^{1/ n}

E [∥ X_{s, t} (x) - \overline{X}_{s, t} (x) ∥^{n}]^{1/ n} \leq κ_{n} \int_{s}^{t} exp (- λ (n) (t - u)) d u s \leq u sup E [∥Δ b_{u} (\overline{X}_{s, u} (x)) ∥^{n}]^{1/ n} .

E [∥ X_{s, t} (x) - \overline{X}_{s, t} (x) ∥^{n}]^{1/ n} \leq κ_{n} \int_{s}^{t} exp (- λ (n) (t - u)) d u s \leq u sup E [∥Δ b_{u} (\overline{X}_{s, u} (x)) ∥^{n}]^{1/ n} .

d X_{s, t}^{h} (x) = Y_{s, t}^{h} (x) d t + Σ d W_{t} \mbox with Y_{s, t}^{h} (x) := b (X_{s, s + k h}^{h} (x))

d X_{s, t}^{h} (x) = Y_{s, t}^{h} (x) d t + Σ d W_{t} \mbox with Y_{s, t}^{h} (x) := b (X_{s, s + k h}^{h} (x))

X_{s, t}^{h} (x) - X_{s, t} (x) = \int_{s}^{t} (\nabla X_{u, t}) (X_{s, u}^{h} (x))^{'} [Y_{s, u}^{h} (x) - b (X_{s, u}^{h} (x))] d u .

X_{s, t}^{h} (x) - X_{s, t} (x) = \int_{s}^{t} (\nabla X_{u, t}) (X_{s, u}^{h} (x))^{'} [Y_{s, u}^{h} (x) - b (X_{s, u}^{h} (x))] d u .

\nabla b + (\nabla b)^{'} \leq - 2 λ I ∥\nabla b ∥ := x sup ∥\nabla b (x) ∥ < \infty \mbox and ⟨ x, b (x)⟩ \leq - β ∥ x ∥^{2}

\nabla b + (\nabla b)^{'} \leq - 2 λ I ∥\nabla b ∥ := x sup ∥\nabla b (x) ∥ < \infty \mbox and ⟨ x, b (x)⟩ \leq - β ∥ x ∥^{2}

E (∥ X_{s, t}^{h} (x) - X_{s, t} (x) ∥^{n})^{1/ n} \leq ∥\nabla b ∥ ([∥ b (0) ∥ + m_{n} (x) ∥\nabla b ∥] h + σ h) / λ

E (∥ X_{s, t}^{h} (x) - X_{s, t} (x) ∥^{n})^{1/ n} \leq ∥\nabla b ∥ ([∥ b (0) ∥ + m_{n} (x) ∥\nabla b ∥] h + σ h) / λ

\forall (i, j) \in [p_{1}] \times [p_{2}] (A B)_{i, j} = k \in [q] \sum A_{i, k} B_{k, j} \mbox and B_{j, k}^{'} := B_{k, j} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Backward Itô-Ventzell and stochastic interpolation formulae

P. Del Moral P. Del Moral was supported in part from the Chair Stress Test, RISK Management and Financial Steering, led by the French Ecole polytechnique and its Foundation and sponsored by BNP Paribas, and by the ANR Quamprocs on quantitative analysis of metastable processes.

Authors declaration of interests: none INRIA, Bordeaux Research Center & CMAP, Polytechnique Palaiseau, France

S. S. Singh

Department of Engineering, University of Cambridge, United Kingdom.

Abstract

We present a novel backward Itô-Ventzell formula and an extension of the Alekseev-Gröbner interpolating formula to stochastic flows. We also present some natural spectral conditions that yield direct and simple proofs of time uniform estimates of the difference between the two stochastic flows when their drift and diffusion functions are not the same, yielding what seems to be the first results of this type for this class of anticipative models. We illustrate the impact of these results in the context of diffusion perturbation theory, interacting diffusions and discrete time approximations.

Keywords : Stochastic flows, variational equations, tangent and Hessian processes, perturbation semigroups, backward Itô-Ventzell formula, Alekseev-Gröbner lemma, Skorohod stochastic integral, two-sided stochastic integration, Malliavin differential, Bismut-Elworthy-Li formulae.

Mathematics Subject Classification : 47D07, 93E15, 60H07.

1 Introduction

Let $b_{t}(x)$ be a vector-valued function from $\mathbb{R}^{d}$ into $\mathbb{R}^{d}$ and $\sigma_{t}(x)=[\sigma_{t,1}(x),\ldots,\sigma_{t,r}(x)]$ be a matrix-valued function from $\mathbb{R}^{d}$ into $\mathbb{R}^{d\times r}$ , for some parameters $d,r\geq 1$ . Both functions will be assumed to be differentiable. Let $W_{t}$ be an $r$ -dimensional Brownian motion and denote by ${\cal W}_{s,t}$ the $\sigma$ -field generated by the increments $(W_{u}-W_{v})$ of the Brownian motion, with $u,v\in[s,t]$ .

For any time horizon $s\geq 0$ we denote by $X_{s,t}(x)$ the stochastic flow defined for any $t\in[s,\infty[$ and any starting point $X_{s,s}(x)=x\in\mathbb{R}^{d}$ by the stochastic differential equation

[TABLE]

We assume that $x\mapsto b_{t}(x)$ and $x\mapsto\sigma_{t}(x)$ have continuous and uniformly bounded derivatives up to the third order. This condition is clearly met for linear Gaussian models as well as for the geometric Brownian motion. This condition ensures that the stochastic flow $x\mapsto X_{s,t}(x)$ is a twice differentiable function of the initialisation $x$ . In addition, all absolute moments of the flow and the ones of its first and second order derivatives exists for any time horizon. As it is well known, dynamical systems and hence stochastic models involving drift functions with quadratic growth require additional regularity conditions to ensure non explosion of the solution in finite time. It is also implicitly assumed that all functions $(b_{t},\sigma_{t})$ are smooth functions w.r.t. the time parameter. The present article develop several constructive and stochastic analysis tools including Bismut-Elworthy-Li formulae, stochastic semigroup perturbation formulae, extended two-sided stochastic integration, Malliavin calculus, gradient and Hessian semigroup processes estimates. We are also looking for useful quantitative and time uniform estimates which are valid under a single set of easily checked conditions that only depend on the parameters of the model. Various techniques presented in the article and many results can be separately and readily extended to more general models with weaker and abstract custom assumptions that depend on the different quantities to handle.

Let $\overline{X}_{s,t}(x)$ be the stochastic flow associated with a stochastic differential equation defined as (1.1) by replacing $(b_{t},\sigma_{t})$ by some drift and diffusion functions $(\overline{b}_{t},\overline{\sigma}_{t})$ with the same regularity properties. Constant diffusion functions $(\sigma_{t},\overline{\sigma}_{t})$ are defined by

[TABLE]

In this context, we will assume that $\Sigma_{t}$ and $\overline{\Sigma}_{t}$ are uniformly bounded w.r.t. the time horizon.

The Markov transition semigroups associated with the flows $X_{s,t}(x)$ and $\overline{X}_{s,t}(x)$ are defined for any measurable function $f$ on $\mathbb{R}^{d}$ by the formula

[TABLE]

In this paper we derive equations for the differences $(X_{s,t}-\overline{X}_{s,t})$ and $(P_{s,t}-\overline{P}_{s,t})$ in terms of the difference of their corresponding drifts and diffusion functions,

[TABLE]

where $a_{t}(x):=\sigma_{t}(x)~{}\sigma_{t}(x)^{\prime}$ and $\overline{a}_{t}(x):=\overline{\sigma}_{t}(x)\,\overline{\sigma}_{t}^{\prime}(x)$ . In some applications the functions $\overline{b}_{t}=b_{t}-\Delta b_{t}$ and $\overline{\sigma}_{t}=\sigma_{t}-\Delta\sigma_{t}$ can be interpreted as a local perturbation of the drift and the diffusion of the stochastic flow ${X}_{s,t}$ .

We also address the problem of finding time-uniform estimates for the difference between the stochastic flows $X_{s,t}$ and $\overline{X}_{s,t}$ and their corresponding Markov transition kernels $P_{s,t}$ and $\overline{P}_{s,t}$ .

These important questions arise in a variety of domains including stochastic perturbation theory as well as in the stability and the qualitative theory of stochastic systems. Classical analytic estimates on the difference between the stochastic flows driven by different drift and diffusion functions are often much too large for most diffusion processes of practical interest. In some instances none of the diffusion flows are stable. In this context, any local perturbation of the stochastic model propagates so that any global error estimate eventually tends to $\infty$ as the time horizon $t\rightarrow\infty$ .

Whenever one of the stochastic flows is stable, classical perturbation bounds combining Lipschitz type inequalities with Gronwall lemma [8, 25] yield exceedingly pessimistic global estimates that grows exponentially fast w.r.t. the time horizon. Notice that an exponential type estimate of the form $e^{\lambda t}$ for some parameter $\lambda>0$ and some time horizon $t$ s.t. $\lambda\,t\geq 199$ would induce an error bound larger than the estimated number $10^{86}$ of elementary particles of matters in the visible universe. As mentioned in [29] in the context of Euler scheme type approximations of deterministic dynamical systems, one may encounter situations where $\lambda=10^{8}$ and $t=10^{2}$ and the resulting exponential bounds are clearly impractical from a numerical perspective.

The statement of the main results of the article are presented in section 1.1:

i.

Section 1.1.1 presents a novel generalized backward Itô-Ventzell formula (cf. theorem 1.1). The Itô-Ventzell is a very important formula, arguably as useful as the Itô’s change of variable, but surprisingly the backward Itô-Ventzell presented in this work has never been studied before. Theorem 1.1 can be seen as a new generalized backward version of the generalized Itô-Ventzell formula presented in [41]. 2. ii.

In section 1.1.2 we apply the backward Itô-Ventzell formula to derive a forward-backward stochastic perturbation formula that expresses the difference between the stochastic flows $X_{s,t}$ and $\overline{X}_{s,t}$ in terms of first and second order derivatives of the flows, which we call the tangent and Hessian processes respectively, with respect to the space parameter (cf. theorem 1.2). 3. iii.

Section 1.1.2 also provides a novel forward-backward Itô type differential formula for interpolating stochastic diffusion flows (cf. the change of variable formula (1.9)). 4. iv.

In the beginning of section 1.1.2 we present a discrete time approach based on the pivotal interpolating telescoping sum formula (4.2). This interpolating stochastic semigroup technique can be seen as an extension to stochastic flows of the stochastic perturbation analysis developed in [22, 18, 20, 21] and in [3, 5, 11] in the context of discrete time models, matrix and nonlinear interacting processes (see also [4, 5]). For a more thorough discussion on these models, we refer to section 1.2. This approach allows to derive a stochastic interpolation formula (1.10) with a fluctuation term (1.12) defined by an extended two-sided stochastic integral. 5. v.

Section 1.1.3 presents some natural spectral conditions on the gradients of $b_{t}(x),\sigma_{t}(x),\overline{b}_{t}(x)$ and $\overline{\sigma}_{t}(x)$ that allows us to derive in a direct way a series of realistic uniform estimates with respect to the time horizon.

The rest of the article is organized as follows:

Section 3 provides some basic tools associated with the first and second variational equations associated with a diffusion flow. We also present some quantitative estimates of the tangent and the Hessian processes. For a more thorough discussion on stochastic flows and their differentiability properties we refer to [14, 32, 40].

Section 4 is mainly concerned with the forward-backward stochastic interpolation formula (1.10) stated in theorem 1.2. Two approaches are presented: The first one discussed in section 4.1 is based on an extension of the two-sided stochastic calculus introduced by Pardoux and Protter in [43] to stochastic interpolation flows. The second one discussed in section 4.2 is based on the generalized backward Itô-Ventzell formula. This section also discusses a multivariate Skorohod-Alekseev-Gröbner formula. Apart from more complex and sophisticated tensor notation, the quantitative stochastic analysis of these multivariate formulae follows the same arguments as the ones used in the proof of theorem 1.3. Thus, we have chosen to concentrate this introduction on stochastic flows.

Some extensions of the stochastic interpolation formula (1.10) are discussed in section 4.4.

Section 5 is dedicated to the analysis of the Skorohod fluctuation process introduced in (1.12).

Section 6 is dedicated to the analysis of an extended version of two-sided stochastic integrals and a generalized backward Itô-Ventzell formula.

Section 7 presents some illustrations of the forward-backward interpolation formulae discussed in the present article in the context of diffusion perturbation theory, interacting diffusions and discrete time approximations.

The technical proofs of some results are housed in the appendix.

1.1 Statement of some main results

1.1.1 A backward Itô-Ventzell formula

We represent the gradient of a real valued function of several variables as a column vector while the gradient and the Hessian of a (column) vector valued function as tensors of type $(1,1)$ and $(2,1)$ , see for instance (2.2) and (2.3); in more layman terms a $(1,1)$ tensor is a matrix while the $(2,1)$ tensor can be visualized as a “row of matrices” $[A_{1},\ldots,A_{n}]$ where the entries $A_{i}$ are matrices of a common dimension. We also use the tensor product and the transpose operator defined in (2.1), see also (2.4).

We denote by $D_{t}$ the Malliavin derivative from some dense domain $\mathbb{D}_{2,1}\subset\mathbb{L}_{2}(\Omega)$ into the space $\mathbb{L}_{2}(\Omega\times\mathbb{R}_{+};\mathbb{R}^{r})$ . For multivariate $d$ -column vector random variables $F$ with entries $F^{j}$ , we use the same rules as for the gradient and $D_{t}F$ is the $(r,p)$ -matrix with entries $(D_{t}F)_{i,j}:=D_{t}^{i}F^{j}$ . For $(p\times q)$ -matrices $F$ with entries $F^{j}_{k}$ we let $D_{t}F$ be the tensor with entries $(D_{t}F)_{i,j,k}=D^{i}_{t}F^{j}_{k}$ .

For a more thorough discussion on Malliavin derivatives and Skorohod integration we refer to section 2.3.

Let $F$ be some function from $\mathbb{R}^{p}$ into $\mathbb{R}^{q}$ , and let $y\in\mathbb{R}^{p}$ be some given state, for some $p,q\geq 1$ . Suppose we are given a forward $p$ -dimensional continuous semi-martingale $Y_{s,t}$ and a backward random field $F_{s,t}$ from $\mathbb{R}^{p}$ into $\mathbb{R}^{q}$ with a column-vector type canonical representation of the following form:

[TABLE]

for some ${\cal W}_{s,t}$ -adapted functions $B_{s,t},G_{s,t},H_{s,t},\Sigma_{s,t}$ with appropriate dimensions and satisfying the following conditions:

$(H_{1})$ : The functions $F_{s,t}$ , $G_{u,t}$ and $H_{u,t}$ as well as $\nabla H_{u,t}$ , $\nabla^{2}F_{u,t}$ and the derivatives $D_{v}\nabla F_{u,t}$ and $D_{v}H_{u,t}$ are continuous w.r.t. the state and the time variables for any given $\omega\in\Omega$ .

$(H_{2})$ * The function $G_{u,t},\nabla H_{u,t},\nabla^{2}F_{u,t}$ , and the derivatives $D_{v}H_{u,t},D_{v}\nabla F_{u,t}$ have at most polynomial growth w.r.t. the state variable, uniformly with respect to $\omega\in\Omega$ .*

$(H_{3})$ * The processes $B_{s,u},\Sigma_{s,u}$ as well as $D_{v}\Sigma_{s,u}$ are continuous and have moments of any order.

In this notation, the first main result of this article is the following theorem.

Theorem 1.1.

Assume conditions $(H_{i})_{i=1,2,3}$ are satisfied. In this situation, for any $s\leq u\leq v\leq t$ we have the generalized backward Itô-Ventzell formula

[TABLE]

The stochastic anticipating integral in the r.h.s. of 1.5 is understood as a Skorohod stochastic integral.

The above theorem can be seen as the backward version of the generalized Itô-Ventzell formula presented in [41, 42]. The proof of the above theorem is provided in section 6.2 (see theorem 6.3).

Conventional forward and backward Itô stochastic integrals are particular instances of the two-sided stochastic integrals introduced by Pardoux and Protter in [43]. The terminology " two-sided " coined by the authors in [43] comes from the fact that the integrand of the Skorohod integral depend on the past as well as on the future of the history generated by the Brownian motion.

The stochastic anticipating integral in the r.h.s. of (1.5) involves a backward random field and a forward semimartingale, thus it is tempting to interpret this integral as a two sided integral. Unfortunately, this class of integrands are not considered in the construction of the two-sided stochastic integrals defined in [43]. In section 4.1 and section 6.1 we shall present an extended version of the two-sided stochastic integrals introduced in [43] that applies to integrands defined as a compositions of backward and forward stochastic flows. This extended version applies to backward stochastic flows but it doesn’t encapsulate more general backward random fields. We believe more general extensions of the two-sided integrals can be developed but it is out of the scope of this article to develop a theory on generalized two-sided stochastic integrals. We finally mention that all two-sided stochastic integrals discussed in this article are particular instances of Skorohod integrals

1.1.2 A stochastic flow interpolation formula

The diffusion flow (1.1) is defined in term of a column vector with twice continuously differentiable entries. For $h\simeq 0$ we use the backward approximation:

[TABLE]

In the above display, $X_{s,t}\circ X_{s-h,s}$ stands for the composition of the mappings $X_{s,t}$ and $X_{s-h,s}$ .

The above approximations are rigorously justified in section 4.1 and lead to the backward stochastic flow evolution equation:

[TABLE]

In the above display, $d_{s}X^{i}_{s,t}(x)$ represents the change in $X^{i}_{s,t}(x)$ w.r.t. the variable $s$ .

In the same vein, for any $s<u<t$ we have the interpolating semigroup decompositions

[TABLE]

as well as the forward approximations

[TABLE]

The above approximations are rigorously justified in section 4.1 and lead to the forward-backward stochastic interpolation equation

[TABLE]

The discrete time version of the forward-backward stochastic formula in the above display reduces to the telescoping sum formula (4.2) and the second order Taylor expansions discussed in section 4.1. We already mention that (4.2) can be interpreted as a discrete time version of the Alekseev-Gröbner lemma [1, 24]. The terminology forward-backward comes from the forward and backward nature of (1.9) and the telescoping sum formula (4.2).

Also notice that (1.7) can also be deduced formally from (1.9) by replacing $\overline{X}_{s,u}$ by the stochastic flow ${X}_{s,u}$ in (1.9), and then letting $s=u$ .

This yields the following interpolation theorem.

Theorem 1.2.

We have the forward-backward stochastic interpolation formula

[TABLE]

with the stochastic process

[TABLE]

and the fluctuation term given by the Skorohod stochastic integral

[TABLE]

The fluctuation term in the above display can also be seen as the extended two-sided stochastic integral defined in (4.3) (see also proposition 6.2).

These interpolation formulae combine the backward evolution (1.7) with the conventional forward evolution of the perturbed flow.

The proof of the interpolation formula (1.10) is provided in section 4.

We will present two different approaches: The first one presented in section 4.1 is rather elementary and very intuitive. It combines the conventional Itô-type discrete time approximations of stochastic integrals discussed above with the two-sided stochastic integration calculus introduced in [43]. Using this approximation technique the fluctuation term is defined by the extended two-sided stochastic integral defined in (4.3). In this interpretation, the equation (1.10) can be seen as an extended version of the Itô-type change rule formula stated in theorem 6.1 in the article [43] to the interpolating flow

[TABLE]

Roughly speaking, the increments of the interpolating path are decomposed into two parts:

One comes from the backward increments of the flow $u\mapsto X_{u,t}$ given the past values of the stochastic flow $\overline{X}_{s,u}$ . The other one comes from the conventional Itô increments of $u\mapsto\overline{X}_{s,u}$ given the future values of the stochastic flow $X_{u,t}$ .

The second approach discussed in section 4.2 is based on the generalized backward Itô-Ventzell formula stated in theorem 1.1. More precisely we also recover (1.10) from (1.5) by choosing

[TABLE]

and letting $(u,v)=(s,t)$ in (1.5). The regularity conditions on the drift and the diffusion function ensure that conditions $(H_{i})_{i}$ with $i=1,2,3$ stated in section 1.1.1 are satisfied.

We emphasize that the backward diffusion flow discussed in (1.7) and (4.1) is essential to apply theorem 1.1. Section 4.2 also provides a multivariate version of (1.10).

The interpolation formula (1.10) with a fluctuation term given by the Skorohod stochastic integral (1.12) can be seen as a Alekseev-Gröbner formula of Skorohod type.

In this context, the integrability of the fluctuation term and any quantitative type estimates require a refined analysis of the Malliavin derivatives of the integrand. Under our regularity conditions the stochastic flows $X_{s,t}(x)$ and $\overline{X}_{s,t}(x)$ are Holder-continuous w.r.t. the time parameters as well as twice differentiable w.r.t. the space variables, with almost sure uniformly bounded first and second order derivatives. In addition, for any $n\geq 1$ all the $n$ -absolute moments of the stochastic flows are finite with at most linear growth w.r.t. the initial values. These properties ensure that the Skorohod stochastic integral (1.12) is well defined and they allow to derive several quantitative estimates. Section 5 provides a refined of the fluctuation term; see for instance theorem 5.2.

When $\sigma_{t}=0$ the flow $X_{s,t}(x)$ is deterministic so that the Skorohod fluctuation term (1.12) reduces to the traditional Itô stochastic integral. In this context, quantitative estimates of the fluctuation term are obtained combining Burkholder-Davis-Gundy inequalities with the generalized Minkowski inequality. The resulting interpolation formula (1.10) can be seen as a Alekseev-Gröbner formula of Itô-type.

To distinguish these two classes of models, the interpolation formulae (1.10) associated with the case $\sigma_{t}=0$ will be called an Itô-Alekseev-Gröbner formula; the one associated with the case $\Delta\sigma_{t}\not=0$ will be called a Skorohod-Alekseev-Gröbner formula.

1.1.3 Uniform estimates w.r.t. the time horizon

The final objective of this article is to derive uniform estimates w.r.t. the time parameter. Our methodology is mainly based on two different types of regularity conditions to be defined and discussed in detail in section 2.2:

$\bullet$ The first is a technical condition that ensures that the $n$ -absolute moments of the flows $X_{s,t}$ and $\overline{X}_{s,t}$ are uniformly bounded w.r.t. the time horizon; we call this condition $(M)_{n}$ .

$\bullet$ The second is a spectral condition on the gradient of the drift and diffusion matrices of the stochastic flows, which we call condition $(T)_{n}$ . Without going into details, we state one usual case of interest: for constant diffusion functions (1.2) the spectral condition $(T)_{n}$ is met for any $n\geq 2$ as soon as the following log-norm conditions are met

[TABLE]

To motivate the above condition consider a linear drift function of the form $b_{t}(x)=B_{t}~{}x$ and $\sigma=0$ . In this case the tangent process $\nabla X_{s,t}(x)$ satisfies a time-varying deterministic linear dynamical system

[TABLE]

The asymptotic behavior of this process cannot be characterized by the statistical properties of the spectral abscissa of the matrices $B_{t}$ . Indeed, unstable semigroups associated with time-varying (deterministic) matrices $B_{t}$ with negative eigenvalues are exemplified in [15, 49]. Conversely, stable semigroups with $B_{t}$ having positive eigenvalues are given by Wu in [49]. In contrast, the uniform log-norm condition (1.14) provides a readily verifiable condition.

To describe with some precision the second main result of the article, we need to introduce some additional terminology. When there is no ambiguity, we denote by $\|\mbox{\LARGE.}\|$ any (equivalent) norm on some finite dimensional vector space. For some multivariate function $f_{t}(x)$ , for $(t,x)\in[0,\infty)\times\mathbb{R}^{d}$ , let $\|f(x)\|:=\sup_{t}\|f_{t}(x)\|$ and the uniform norm be $\|f\|:=\sup_{t,x}\|f_{t}(x)\|$ . For any $n\geq 1$ we also set

[TABLE]

We denote by $\kappa_{n}$ and $\kappa_{\delta,n}$ some constants that depend on some parameters $n$ and $(\delta,n)$ but do not depend on the time horizon, nor on the space variable.

In this notation, the second main result of the article takes basically the following form.

Theorem 1.3.

Assume conditions $(M)_{2n/\delta}$ and $(T)_{2n/(1-\delta)}$ are satisfied for some parameters $n\geq 2$ and $\delta\in]0,1[$ . In this situation, we have the time-uniform estimates

[TABLE]

For constant diffusion functions (1.2), the estimate simplifies to

[TABLE]

The estimates (1.16) come from (7.5) and (5.9). A more detailed proof is provided in the appendix, on page Proof of (1.16). The estimates (1.17) are direct consequences of (2.17) and (5.11).

When $\sigma_{t}=\overline{\sigma}_{t}$ the Skorohod term is indeed absent and (1.10) reduces to

[TABLE]

We recover the interpolation formula for nonlinear stochastic flows presented in section 3.1 in the article [3]. In this context the analysis of $\mathbb{L}_{n}$ -errors will proceed via two-step procedure. In section 3.1 we will derive the exponential bound

[TABLE]

Using the Minkowski integral inequality in (1.18) yields

[TABLE]

A further conditioning argument and the above exponential bound on the tangent process yields

[TABLE]

Replacing the term outside the time integral with ${\left|\kern-1.07639pt\left|\kern-1.07639pt\left|\Delta b(x)\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}_{n}$ yields the stated result in (1.16) excluding the terms representing the difference in the diffusions.

We illustrate one use of theorem 1.2 in the context of analyzing the error in discretising the diffusion $X_{s,t}(x)$ for some initial time point $s\geq 0$ . Let $h>0$ denote the discretisation interval size and for any $t\in[s+kh,s+(k+1)h[$ let

[TABLE]

for a fixed diffusion matrix $\sigma_{t}(x)=\Sigma$ . Here $X_{s,t}^{h}(x)$ is the discretisation of $X_{s,t}(x)$ with resolution $h$ . Note that that the drift at time $t$ is not a function of the instantaneous value of $X_{s,t}^{h}(x)$ , at time $t$ , but rather the value it took at the largest discrete time-point before $t$ . In section 4.4 we discuss how the formula in (1.10) also applies in this context and establish that

[TABLE]

This comparison result when combined with the regularity assumptions (1.19) yields the moment bound below.

Proposition 1.4.

Assume that

[TABLE]

for some $\lambda>0$ , $\beta>0.$ In this situation, for any $n\geq 1$ we have the uniform estimates

[TABLE]

where $\widehat{m}_{n}(x)\leq\kappa_{n}~{}(1+\|x\|)$ .

Proposition 1.4 is proved in section 7.3. To apply proposition 1.4 to a Langevin diffusion with a convex potential $U(x)$ , the drift would be $b_{t}(x)=-\nabla U(x)$ and the corresponding assumptions on $U(x)$ are typical.

1.2 Comments and comparisons with existing literature

The interpolation formula (1.10) can be interpreted as an extension of Alekseev-Gröbner lemma [1, 24, 30] as well as an extended version of the variation-of-constant and related Gronwall type lemma [8, 25] to diffusion processes. In this connection we underline that the forward-backward formula (1.10) differs from the stochastic Gronwall lemma presented in [45] based on particular classes of stochastic linear inequalities that doesn’t involve Skorohod type integrals.

The forward-backward interpolation formula (1.10) can also be seen as an extension of theorem 6.1 in [43] on two-sided stochastic integrals to diffusion flows. This interpolation formula can also be interpreted as a backward version of the generalized Itô-Ventzell formula presented in [41] (see also theorem 3.2.11 in [37]).

Stochastic interpolation formulae of the form (1.10) and their discrete time version discussed in (4.2) are not really new. To describe their origins, it is worth to mention that the stochastic perturbations may come from auxiliary random sources, uncertainty propagations, as well as time discretization schemes and mean field type particle fluctuations.

The pivotal interpolating telescoping sum formula (4.2) and the second order forward-backward perturbation semigroup methodology discussed in the present article can also be found in chapter 7 in [18] for discrete time models as well as in the series of articles [20, 21, 22] published at the beginning of the 2000s, see also chapter 10 in [19]. In this context, the random perturbations come from the fluctuations of a genetic type particle interpretation of nonlinear Feynman-Kac semigroups.

The more recent articles [9, 10, 11] also provide a series of backward-forward interpolation formulae of the same form as (1.10) for stochastic matrix Riccati diffusion flows arising in data assimilation theory (cf. for instance theorem 1.3 in [11] as well as section 2.2 in [10] and the proof of theorem 2.3 in [9]). In this context, the random perturbations come from the fluctuations of a mean field particle interpretation of a class of nonlinear diffusions equipped with an interacting sample covariance matrix functional.

We underline that the Itô-Alekseev-Gröbner formula (4.6) discussed in [11] is an extension of the interpolation formula (1.10) to stochastic diffusion flows in matrix spaces. In this context the unperturbed model is given by the flow of a deterministic matrix Riccati differential equation and the random perturbations are described by matrix-valued diffusion martingales. The corresponding Itô-Alekseev-Gröbner formulae can be seen as a matrix version of theorem 1.2 in the present article when $\sigma=0$ . These stochastic interpolation formulae were used in [11] to quantify the fluctuation of the stochastic flow around the limiting deterministic Riccati equation, at any order. We will briefly discuss the analog of these Taylor type expansions in section 7.1 in the context of Euclidian diffusions.

The forward-backward perturbation methodology discussed in the present article has also been used in [3, 5] in the context of nonlinear diffusions and their mean field type interacting particle interpretations, see for instance section 2.3 in [5]. In this context, the random perturbations come from the fluctuations of a mean field particle interpretation of a class of nonlinear diffusions. The extended version of the Itô-Alekseev-Gröbner formula (1.18) to nonlinear diffusions is also discussed in section 3.1 in the article [3]. In this situation, the time varying drift and diffusion functions of the stochastic flows depend on some possibly different nonlinear measure valued semigroups which may start from two possibly different initial distributions. For a more thorough discussion on this class of nonlinear diffusions, we refer to the Itô-Alekseev-Gröbner formula (3.2) and corollary 3.2 in the article [3]. These Itô-Alekseev-Gröbner formulae correspond to theorem 1.2 in the present article when $\sigma=0$ .

The interpolating stochastic semigroup techniques discussed in the present article are also applied to mean field particle systems and deterministic nonlinear measure valued semigroups. In this context, the process $X_{s,t}$ is given a deterministic measure-valued process and $\overline{X}_{s,t}$ represents the evolution of the particle density profiles associated with an approximating mean field particle interpretation of $X_{s,t}$ . For instance, the article [4] is concerned with interacting jumps models on path spaces, the second article [5] discusses the propagation of chaos properties of mean field type interacting diffusions. The stochastic interpolation formulae discussed in [4, 5] correspond to the case (1.10) with $\sigma=0$ and or $\overline{\sigma}\not=\sigma$ (see for instance the interpolation formula (3.5), theorem 2.6, theorem 2.7 and the interpolating telescoping sum in section 1.2 in [5])

In the series of articles discussed above, as in (1.9) the central common idea is to analyse the evolution of the interpolating process (1.13) between a given process $X_{s,t}$ and some stochastic flow $\overline{X}_{s,t}$ with an extra level of randomness. In discrete time settings, the differential interpolation formula (1.9) can also recasted in terms of a telescoping sum of the same form as (4.2) combined with a second order Taylor expansion reflecting the differences between a stochastic semigroup and its perturbations, see for instance chapter 7 in [18].

In most of the application domains discussed above, this second order stochastic perturbation methodology has been developed to quantify uniformly w.r.t. the time horizon the propagations of some stochastic perturbations entering in some deterministic and stable reference or unperturbed process. In the context of Euclidian diffusions, this corresponds to the situation where the diffusion function $\sigma=0$ (the case $\overline{\sigma}=0$ can be treated by symmetry arguments). The Itô-Alekseev-Gröbner type formulae discussed in section 3.1 in the article [3] correspond to theorem 1.2 in the present article when $\sigma=\overline{\sigma}$ .

The present article can be seen as a natural extension of the second order perturbation methodology developed in the above referenced articles to diffusion type perturbed processes when $\sigma\not=\overline{\sigma}$ .

To the best of our knowledge, the first article considering the case $\sigma\not=\overline{\sigma}$ with $\sigma\not=0$ and $\overline{\sigma}\not=0$ is the independent work of Hudde-Hutzenthaler-Jentzen-Mazzonetto [27]. In this article, the authors discuss an Itô-Alekseev-Gröbner formula for abstract diffusion perturbation models of the form (4.11). Here again, as in the list of referenced articles discussed above, the common central idea is to use discrete time approximations and combine the pivotal interpolating telescoping sum formulae (4.2) with a second order Taylor expansion. Besides this fact and in contrast with our analysis, the fluctuation term (1.12) discussed in [27] cannot be interpreted in terms of the extended two-sided stochastic integral defined in (4.3) (see also proposition 6.2) but only in terms of a Skorohod stochastic integral. The study [27] is also based on a series of particularly chosen and custom regularity conditions. For instance, the authors assume that the abstract diffusion perturbation models are chosen so that the Skorohod fluctuation term exists without providing any quantitative type estimate. This work is also not connected to the two-sided stochastic integration calculus developed by Pardoux and Protter in [43] nor to any type of backward Itô-Ventzell formula.

We feel that our approach is more direct and intuitive as it relies on an extended version of Itô’s change rule formula (1.9) to interpolating stochastic flows. It also allows to interpret the fluctuation term (1.12) as an extended two-sided stochastic integral.

In section 5 in the present article, we will also see that any quantitative analysis requires to estimate the absolute moments of the Malliavin derivatives of the stochastic integrands of the Brownian motion arising in the Skorohod fluctuation term. In our framework, these Malliavin derivatives depend on the gradient of both of the diffusion functions $(\sigma,\overline{\sigma})$ as well as on the tangent process of the perturbed diffusion flow. The quantitative analysis developed in 5 can be extended without difficulties to abstract diffusion perturbation models satisfying appropriate differentiability and integrability conditions.

The article [27] also presents an application to tamed Euler type discrete time approximations of a stochastic van-der-Pol process introduced in [47], simplifying the analysis provided in an earlier work [28]. In this situation, we underline that the Skorohod fluctuation term is null so that the resulting Alekseev-Gröbner type formula resumes to the simple and elementary case discussed in (1.18) and in the article [3]. As expected for this class of "unstable processes", the authors recast a series of $\mathbb{L}_{2}$ -estimates discussed in [28] into a series of estimates that grow exponentially fast with respect to the time horizon.

In contrast with the present work, the above article doesn’t discuss any quantitative uniform estimates w.r.t. the time horizon. The analysis presented in [27] is mainly concerned with the proof of a Skorohod-Alekseev-Gröbner type formula for abstract diffusion perturbation models and it doesn’t apply to derive any type of estimates to general diffusion perturbation models without adding regularity conditions.

Besides its elegance the forward-backward interpolation formula (1.10) is clearly of rather poor mathematical and numerical interest without a better understanding of the variational processes and the Skorohod fluctuation term (1.12). A crucial problem is to avoid exceedingly pessimistic exponential estimates that grow exponentially fast w.r.t. the time horizon.

One advantage of the second order perturbation methodology developed in the present article is that it takes advantage of the stability properties of the tangent and the Hessian flow in the estimation of Skorohod fluctuation term and this sharpen analysis of the difference between stochastic flows. Our main contribution is to develop a refined analysis of these variational processes and the Skorohod fluctuation terms. We also deduce several uniform perturbation propagation estimates with respect to the time horizon, yielding what seems to be the first results of this type for this class of models.

The forward-backward stochastic interpolation formula (1.10) can also be extended to more general classes of stochastic flows on abstract state spaces. For instance the recent article [30] provides a deterministic first order version of (1.10) on abstract Banach spaces. The stochastic perturbation analysis developed in the series of articles [4, 5, 9, 10, 11, 20, 21, 22] and the books [18, 19] is applied to matrix-valued diffusions and measure valued processes, including mean field type interacting diffusions and Feynman-Kac type interacting jumps models.

The stability properties of these abstract models discussed above depend on the problem at hand. To focus on the main ideas without clouding the article with unnecessary technical details and sophisticated mathematical tools based on abstract ad hoc regularity conditions we have chosen to concentrate the article on diffusion flows on Euclidian spaces with simple and easily checked regularity conditions.

2 Preliminary results

2.1 Some basic notation

With a slight abuse of notation, we denote by $I$ the identity $(d\times d)$ -matrix, for any $d\geq 1$ . We also denote by $\|\mbox{\LARGE.}\|$ any (equivalent) norm on a finite dimensional vector space over $\mathbb{R}$ . All vectors are column vectors by default.

We introduce some matrix notation needed from the onset.

We denote by $\mbox{\rm Tr}(A)$ , $\|A\|_{2}:=\lambda_{\tiny max}(AA^{\prime})^{1/2}=\lambda_{\tiny max}(A^{\prime}A)^{1/2}$ , resp. $\|A\|_{F}=\mbox{\rm Tr}(AA^{\prime})^{1/2}$ and $\rho(A)=\lambda_{\tiny max}((A+A^{\prime})/2)$ the trace, the spectral norm, the Frobenius norm, and the logarithmic norm of some matrix $A$ . $A^{\prime}$ is the transpose of $A$ and $\lambda_{\tiny max}(\mbox{\LARGE.})$ the largest eigenvalue. The spectral norm is sub-multiplicative or $\|AB\|_{2}\leq\|A\|_{2}\|B\|_{2}$ and compatible with the Euclidean norm for vectors, by that we mean for a vector $x$ we have $\|Ax\|\leq\|A\|_{2}\|x\|$ .

Let $[n]$ be the set of $n$ multiple indexes $i=(i_{1},\ldots,i_{n})\in{\cal I}^{n}$ over some finite set ${\cal I}$ . We denote by $(A_{i,j})_{(i,j)\in[p]\times[q]}$ the entries of a $(p,q)$ -tensor $A$ with index set ${\cal I}$ for $[p]$ and $\mathcal{J}$ for $[q]$ . For the sake of brevity, the index sets will be implicitly defined through the context.

For a given $(p_{1},q)$ -tensor $A$ and a given $(q,p_{2})$ tensor $B$ , $AB$ and $B^{\prime}$ is a $(p_{1},p_{2})$ -tensor resp. a $(p_{2},q)$ -tensor with entries given by

[TABLE]

The symmetric part $A_{\tiny sym}$ of a $(p,p)$ -tensor is the $(p,p)$ -tensor $A_{\tiny sym}$ with entries

[TABLE]

We consider the Frobenius inner product given for any $(p,q)$ -tensors $A$ and $B$ by

[TABLE]

For any $(p,q)$ -tensors $A$ and $B$ we also check the Cauchy-Schwartz inequality

[TABLE]

For any tensors $A,B$ with appropriate dimensions we have the inequality

[TABLE]

Given some tensor valued function $T:(t,x)\mapsto T_{t}(x)$ we also set

[TABLE]

Given some smooth function $h(x)$ from $\mathbb{R}^{p}$ into $\mathbb{R}^{q}$ we denote by

[TABLE]

the gradient $(p,q)$ -matrix associated with the column vector-valued function $h=(h^{i})_{1\leq i\leq q}$ . Building on this notation: let $b:\mathbb{R}^{n}\rightarrow\mathbb{R}^{p}$ and let the mapping $x\rightarrow G(x)=h(b(x))$ . Then $\nabla G(x)=\nabla b(x)\times\nabla h(b(x))$ . Let

[TABLE]

The Hessian $H=\nabla^{2}h$ associated with the function $h=(h^{i})_{1\leq i\leq q}$ is a $(2,1)$ -tensor where $H_{(i,j),k}=(\nabla^{2}h^{k})_{i,j}=\partial_{x_{i},x_{j}}h^{k}$ . In this notation we can compactly represent the second order term of the Taylor expansion of the the vector valued function $h$ . For a vector $y=(y_{1},\ldots,y_{p})^{\prime}$

[TABLE]

where we have regarded the matrix $yy^{\prime}$ as the $(2,1)$ -tensor $Y$ with $Y_{(i,j),1}=y_{i}y_{j}$ .

In the same vein, in terms of the tensor product (2.1), for any pair of column vector-valued function $h=(h^{k})_{1\leq k\leq q}$ and $b=(b^{i})_{1\leq i\leq p}$ and any matrix function $a=(a^{i,j})_{1\leq i,j\leq p}$ from $\mathbb{R}^{p}$ into $\mathbb{R}^{q}$ , for any parameter $1\leq k\leq q$ we also have

[TABLE]

In a more compact form, the above formula takes the form

[TABLE]

For any $n\geq 1$ we let ${\cal P}_{n}(\mathbb{R}^{d})$ be the convex set of probability measures $\mu_{1},\mu_{2}$ on $\mathbb{R}^{d}$ with absolute $n$ -th moment and equipped with the Wasserstein distance of order $n$ denoted by

[TABLE]

In the above display the infimum is taken over all pair or random variables $(X_{1},X_{2})$ with marginal distributions $(\mu_{1},\mu_{2})$ . The stochastic transition semigroups associated with the flows $X_{s,t}(x)$ and $\overline{X}_{s,t}(x)$ are defined for any measurable function $f$ on $\mathbb{R}^{d}$ by the formulae

[TABLE]

Given some column vector-valued function $f=(f^{i})_{1\leq i\leq p}$ , let $\mathbb{P}_{s,t}(f)$ and $P_{s,t}(f)$ denote the column vector-valued functions with entries $\mathbb{P}_{s,t}(f^{i})$ and $P_{s,t}(f^{i})$ . Building on the tensor notation, let $\mathbb{P}_{s,t}(\nabla f)$ and $\mathbb{P}_{s,t}(\nabla^{2}f)$ respectively denote the $(1,1)$ and $(2,1)$ -tensor valued functions with entries

[TABLE]

We also consider the random $(2,1)$ and $(2,2)$ -tensors given by

[TABLE]

Throughout the rest of the article, unless otherwise stated $\kappa,\kappa_{\epsilon},\kappa_{n},\kappa_{n,\epsilon}$ denote constants whose values may vary from line to line but only depend on the parameters in their subscripts, i.e. $n\geq 0$ and $\epsilon>0$ , as well as on the parameters of the model; that is, on the drift and diffusion functions. We also use the letters $c,c_{\epsilon},c_{n},c_{n,\epsilon}$ to denote universal constants. Importantly these contants do not depend on the time horizon. We also consider the uniform log-norm parameters

[TABLE]

and the parameters ${\mathchoice{\raisebox{0.0pt}{$ \displaystyle\chi $}}{\raisebox{0.0pt}{$ \textstyle\chi $}}{\raisebox{0.0pt}{$ \scriptstyle\chi $}}{\raisebox{0.0pt}{$ \scriptscriptstyle\chi $}}}(b,\sigma)$ defined by

[TABLE]

2.2 Regularity conditions and some preliminary results

We consider two different types of regularity conditions ( ${\cal M}$ )n and $({\cal T})_{n}$ , indexed by some parameter $n\in[2,\infty[$ , for the diffusion $(b_{t},\sigma_{t})$ .

$({\cal M})_{n}$

There exists some parameter $\kappa_{n}\geq 0$ such that for any $x\in\mathbb{R}^{d}$ we have

[TABLE]

$({\cal T})_{n}$

There exists some parameter $\lambda_{A}>0$ such that

[TABLE]

where $\sigma_{k,t}$ denotes the $k$ -th column of $\sigma_{t}.$ In addition, the following condition is satisfied

[TABLE]

We now define the corresponding assumptions for the diffusion $(\overline{b}_{t},\overline{\sigma}_{t})$ .

$(\overline{{\cal M}})_{n}$

The regularity condition defined as in $({\cal M})_{n}$ for the diffusion $(\overline{b}_{t},\overline{\sigma}_{t})$ .

$(\overline{{\cal T}})_{n}$

Let $\overline{A}_{t}$ be the symmetric matrix defined as $A_{t}$ in (2.7) when $({b}_{t},{\sigma}_{t})=(\overline{b}_{t},\overline{\sigma}_{t})$ . Assume there exists some $\lambda_{\overline{A}}>0$ such that $\overline{A}_{t}\leq-2\lambda_{\overline{A}}~{}I$ . Furthermore, assume $\lambda_{\overline{A}}(n)>0$ where $\lambda_{\overline{A}}(n)$ is defined as $\lambda_{A}(n)$ when $(\lambda_{A},{\sigma}_{t})=(\lambda_{\overline{A}},\overline{\sigma}_{t})$ .

$(M)_{n}$

We write $(M)_{n}$ when both conditions $({{\cal M}})_{n}$ and $(\overline{{\cal M}})_{n}$ are satisfied.

$(T)_{n}$

Both conditions $({{\cal T}})_{n}$ and $(\overline{{\cal T}})_{n}$ are met, and let

[TABLE]

In practice, the uniform moment condition $({\cal M})_{n}$ is often checked using Lyapunov techniques. For example we can use the following polynomial growth condition.

$({\cal P})_{n}$

There exists some parameters $\alpha_{i},\beta_{i}\geq 0$ with $i=0,1,2$ such that for any $t\geq 0$ and any $x\in\mathbb{R}^{d}$ we have

[TABLE]

for some norm $\|\sigma_{t}(x)\|$ of the matrix-valued diffusion function. In addition, we have

[TABLE]

Lemma 2.1.

For any $n\geq 2$ we have

[TABLE]

The proof of the above assertion follows standard stochastic calculations, thus it is housed in the appendix, on page Proof of (2.10).

For one-dimensional geometric Brownian motions the condition $({\cal P})_{n}$ is a sufficient and necessary condition for the existence of uniformly bounded absolute $n$ -moments. In this case $({\cal T})_{n}$ coincides with $({\cal P})_{n}$ by setting

[TABLE]

Whenever condition $(M)_{n}$ is met for some $n\geq 2$ , we also check the uniform estimates

[TABLE]

with the same parameter $\kappa_{n}$ as the one associated with the condition $(M)_{n}$ .

Recalling that the functions $(b_{t},\overline{b}_{t})$ and $(\sigma_{t},\overline{\sigma}_{t})$ have at most linear growth, with the $\mathbb{L}_{n}$ -norms ${\left|\kern-1.07639pt\left|\kern-1.07639pt\left|\mbox{\LARGE.}\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}_{n}$ introduced in (1.15) we also have that

[TABLE]

To give more insight where these assumptions will be used, we now briefly state the stability results that stem from them. Condition $({\cal T})_{n}$ ensures that the exponential decays of the absolute and uniform $n$ -moments of the tangent and the Hessian processes; that is, when $({\cal T})_{n}$ is met for some $n\geq 2$ we have that

[TABLE]

A more precise statement is provided in proposition 3.2 and proposition 3.10. These uniform estimates clearly imply, via a conditioning argument, that for any $n\geq 2$ and $s\leq u\leq t$ we have

[TABLE]

with the same parameters $(\kappa_{n},\lambda(n))$ as in (2.13).

The case $\nabla\sigma=0$ will also serve a useful purpose, for example in analysing the error of a numerical implementation as in proposition 1.4. For instance whenever $({\cal T})_{2}$ is met we have the almost sure and uniform gradient estimates

[TABLE]

In addition, we have the almost sure and uniform Hessian estimates

[TABLE]

A proof of the above estimates is provided in the beginning of section 3.1 and section 3.2. In this situation, whenever $({\cal T})_{2}$ is met we have

[TABLE]

In the above display, $T_{s,t}(\Delta a,\Delta b)(x)$ stands for the stochastic process discussed in (1.11), and $\kappa$ stands for some finite constant that doesn’t depend on the parameter $n$ . For instance, for a Langevin diffusion associated with some convex potential function $U$ we have $b=-\nabla U$ and $\nabla\sigma=0$ . Then assuming

[TABLE]

where the almost sure tangent and Hessian bounds follow from (2.15) and (2.16) respectively.

In practice, it is often easier to work with $a_{t}(x)=\sigma_{t}(x)\sigma_{t}(x)^{\prime}$ than $\sigma_{t}(x)$ and we now discuss some ways of estimating $\Delta\sigma_{t}(x)=\sigma_{t}(x)-\overline{\sigma}_{t}(x)$ in terms of $\Delta a_{t}(x)=a_{t}(x)-\overline{a}_{t}(x)$ and in the reverse direction. The latter is straightforward:

[TABLE]

To estimate $\Delta\sigma_{t}$ in terms of $\Delta a_{t}$ , assume the following ellipticity condition is satisfied

[TABLE]

We recall the Ando-Hemmen inequality [2] for any symmetric positive definite matrices $Q_{1},Q_{2}$

[TABLE]

for any unitary invariant matrix norm $\|.\|$ . In the above display, $\lambda_{\tiny min}(\mbox{\LARGE.})$ stands for the minimal eigenvalue. We also have the square root inequality

[TABLE]

See for instance theorem 6.2 on page 135 in [26], as well as proposition 3.2 in [2]. A proof of (2.21) can be found in [7]. In this situation, using (2.20) and (2.21) we check that

[TABLE]

This provides a way to estimate the growth of $\sigma_{t}(x)$ in terms of the one of $a_{t}(x)$ . For instance the estimate (1.16) combined with (2.22) implies that

[TABLE]

$\bullet$ Assume that $(\overline{{\cal M}})_{n}$ is satisfied for some $n\geq 1$ . Also let $f_{t}(x)$ be some multivariate function such that

[TABLE]

In this situation, we have the estimates

[TABLE]

2.3 Some results on anticipating stochastic calculus

In this section we review some results on Malliavin derivatives and Skorohod integration calculus which will be needed below. We restrict the presentation to unit time intervals. Let $(\Omega,{\cal W})$ be the canonical space equipped with the Wiener measure $\mathbb{P}$ associated with the $r$ -dimensional Brownian motion $W_{t}$ discussed in the introduction.

The Malliavin derivative $D_{t}$ is a linear operator from some dense domain $\mathbb{D}_{2,1}\subset\mathbb{L}_{2}(\Omega)$ into the space $\mathbb{L}_{2}(\Omega\times[0,1];\mathbb{R}^{r})$ of $r$ -dimensional processes with square integrable states on the unit time interval. For multivariate $d$ -column vector random variables $F$ with entries $F^{i}$ , we use the same rules as for the gradient and we set

[TABLE]

For $(p\times q)$ -matrices $F$ with entries $F^{j}_{k}$ we let $D_{t}F$ be the tensor with entries

[TABLE]

It is clearly out of the scope of this article to review the analytical construction of Malliavin differential calculus. For a more thorough discussion we refer the reader to the seminal book by Nualart [37], see also the more synthetic presentation in the articles [38, 41].

Formally, one can think the Malliavin derivatives $D_{t}^{i}F$ of some $F\in\mathbb{D}_{2,1}$ as way to extract from the random variable $F$ the integrand of Brownian increment $dW^{i}_{t}$ . For instance, when $s\leq t$ we have

[TABLE]

As conventional differentials, for any smooth function $G$ from $\mathbb{R}^{d}$ into $\mathbb{R}^{p\times q}$ , Malliavin derivatives satisfy the chain rule properties

[TABLE]

For instance, for any $s\leq u\leq v$ we have

[TABLE]

In the same vein, we have

[TABLE]

Let $\mathbb{L}_{2,1}(\mathbb{R}^{r})\subset\mathbb{L}_{2}(\Omega\times[0,1];\mathbb{R}^{r})$ be the Hilbert space of $r$ -dimensional process $U_{t}$ with Malliavin differentiable entries $U^{i}_{t}\in\mathbb{D}_{2,1}$ equipped with the norm

[TABLE]

The Skorohod integral w.r.t. the Brownian motion $W^{i}_{t}$ on the unit interval is defined a linear and continuous mapping from

[TABLE]

characterized by the two following properties

[TABLE]

The above formula can be seen as an extended version of the Itô isometry to Skorohod integrals, for instance [39], as well as chapters 1.3 to 1.5 in the book by Nualart [37].

As for the Itô integral, the Skorohod integral w.r.t. the $r$ -dimensional Brownian motion $W_{t}$ of a matrix valued process with entries $V^{i}_{k}\in\mathbb{L}_{2,1}(\mathbb{R})$ is defined by the column vector with entries

[TABLE]

3 Variational equations

3.1 The tangent process

In terms of the tensor product (2.4), the gradient $\nabla X_{s,t}(x)$ of the diffusion flow $X_{s,t}(x)$ is given by the gradient $(d\times d)$ -matrix

[TABLE]

where $W^{k}_{t}$ is the $k$ -th component of the Brownian motion. After some calculations we check that

[TABLE]

with the matrix function $A_{t}(x)$ defined in (2.7) and the symmetric matrix valued martingale

[TABLE]

These expansions, when combined with condition $({\cal T})_{2}$ , yield the following estimates of the difference between $X_{s,t}(x)$ and $X_{s,t}(y)$ .

Proposition 3.1.

Assume $({\cal T})_{2}$ is satisfied. Then

[TABLE]

In addition, we have the almost sure estimate

[TABLE]

Proof of Prop. 3.3.

Whenever $({\cal T})_{2}$ is met, we have the following uniform estimate from (3.1)

[TABLE]

where the $\sqrt{d}$ term arises from imposing the initial condition $\nabla X_{s,s}(x)=I$ on the resulting differential equation for $\partial_{t}\mathbb{E}\left(\|\nabla X_{s,t}(x)\|_{F}^{2}\right)^{1/2}$ . In addition, when $\nabla\sigma=0$ the martingale $M_{s,t}(x)=0$ is null and as a consequence of (3.1) we have the following almost sure estimate

[TABLE]

The Taylor expansion

[TABLE]

combined with (3.4) and (3.5) completes the proof. ∎

These contraction inequalities quantify the stability of the stochastic flow $X_{s,t}(x)$ w.r.t. the initial state $x$ . For instance, the estimate (3.2) ensures that the Markov transition semigroup is exponentially stable; that is, we have that

[TABLE]

For the Langevin diffusions discussed in (2.18) the stochastic flow is time homogeneous; that is we have that $X_{s,t}=X_{t-s}:=X_{0,(t-s)}$ and $P_{s,t}=P_{t-s}:=P_{0,(t-s)}$ . In addition when $\sigma(x)=\sigma~{}I$ , the diffusion flow $X_{t}(x)$ has a single invariant measure on $\mathbb{R}^{d}$ given by the Boltzmann-Gibbs measure

[TABLE]

From (2.18), it follows that

[TABLE]

for all $n\geq 1$ .

Taking the trace in (3.1) we also find that

[TABLE]

with the martingale

[TABLE]

Observe that

[TABLE]

This implies that

[TABLE]

Whenever $({\cal T})_{2}$ is met, we have the estimate

[TABLE]

with the uniform log-norm parameter $\rho(\nabla\sigma)$ defined in (2.5). This yields the estimate

[TABLE]

More generally, we readily check the following result.

Proposition 3.2.

When condition $({\cal T})_{n}$ is met we have the following time-uniform bounds,

[TABLE]

3.2 The Hessian process

In terms of the tensor product (2.1), we have the matrix diffusion equation

[TABLE]

with the null matrix initial condition $\nabla^{2}X_{s,s}(x)=0$ and the matrix-valued martingale

[TABLE]

Consider the tensor functions

[TABLE]

After some computations, we check that

[TABLE]

with the matrix function $A_{t}(x)$ defined in (2.7) and the tensor-valued martingale

[TABLE]

When $\nabla\sigma=0$ the above equation reduces to

[TABLE]

Whenever $({\cal T})_{2}$ is met, taking the trace in the above display we check that

[TABLE]

This yields the estimate

[TABLE]

Using (2.15) this implies that

[TABLE]

This ends the proof of the almost sure estimate (2.16).

For more general models, we have that

[TABLE]

with a continuous martingale $M_{s,t}(x)$ with angle bracket

[TABLE]

Proposition 3.3.

Assume $({\cal T})_{n}$ is met. In this situation, for any $\epsilon>0$ s.t. $\lambda_{A}(n)>\epsilon$ we have

[TABLE]

with the parameters ${\mathchoice{\raisebox{0.0pt}{$ \displaystyle\chi $}}{\raisebox{0.0pt}{$ \textstyle\chi $}}{\raisebox{0.0pt}{$ \scriptstyle\chi $}}{\raisebox{0.0pt}{$ \scriptscriptstyle\chi $}}}(b,\sigma)$ and $\lambda_{A}(n)$ defined in (2.6) and (2.8).

In the above display, $\rho_{\star}(\nabla\sigma)$ is defined in (2.5). The proof of the above estimate is technical and thus housed in the appendix on page Proof of proposition 3.3

3.3 Bismut-Elworthy-Li formulae

We further assume that ellipticity condition (2.19) is met. In this situation, we can extend gradient semigroup formulae to measurable functions using the Bismut-Elworthy-Li formula

[TABLE]

with the stochastic process

[TABLE]

The above formula is valid for any function $\omega_{s,t}:u\in[s,t]\mapsto\omega_{s,t}(u)\in\mathbb{R}$ of the following form

[TABLE]

for some non decreasing differentiable function $\varphi$ on $[0,1]$ with bounded continuous derivatives and such that

[TABLE]

Whenever $({\cal T})_{2}$ is met, combining (3.4) with (3.11), for any $f$ s.t. $\|f\|\leq 1$ we check that

[TABLE]

Let $\varphi_{\epsilon}$ with $\epsilon\in]0,1[$ be some differentiable function on $[0,1]$ null on $[0,1-\epsilon]$ and such that $|\partial\varphi_{\epsilon}(u)|\leq c/\epsilon$ and $(\varphi_{\epsilon}(1-\epsilon),\varphi(1))=(0,1)$ . For instance we can choose

[TABLE]

In this situation, we check that

[TABLE]

from which we find the rather crude uniform estimate

[TABLE]

In the same vein, for any $s\leq u\leq t$ we have the formulae

[TABLE]

with the process

[TABLE]

In the above display $\overline{\nabla}a^{-1/2}_{u}$ stands for the tensor function

[TABLE]

A detailed proof of the formulae (3.14) and (3.15) in the context of nonlinear diffusion flows can be found in the appendix in [5].

Observe that

[TABLE]

Whenever $({\cal T})_{2}$ is met, using the estimate (3.3) for any $\epsilon\in]0,1[$

[TABLE]

In the same vein, using (3.15) for any $u\in]s,t[$ and any bounded measurable function $f$ s.t. $\|f\|\leq 1$ we also check the rather crude uniform estimate

[TABLE]

Choosing $u=s+(1-\epsilon)(t-s)$ in the above display we check that for any $\epsilon\in]0,1[$ we obtain the uniform estimate

[TABLE]

The extended versions of the above formulae in the context of diffusions on differentiable manifolds can be found in the series of articles [6, 13, 23, 36, 46].

4 Backward semigroup analysis

4.1 The two-sided stochastic integration

For any given time horizon $s\leq t$ we have the rather well known backward stochastic flow equation

[TABLE]

The right hand side integral is understood as a conventional backward Itô-integral. In a more synthetic form, the above backward formula reduces to (1.7).

An elementary proof of the above formula based on Taylor expansions is presented in [17], different approaches can also be found in [31] and [33]. Extensions of the backward Itô formula (4.1) to jump type diffusion models as well as nonlinear diffusion flows can also be found in [16] and in the appendix of [3].

Consider the discrete time interval $[s,t]_{h}:=\{u_{0},\ldots,u_{n-1}\}$ associated with some refining time mesh $u_{i+1}=u_{i}+h$ from $u_{0}=s$ to $u_{n}=t$ , for some time step $h>0$ . In this notation, combining (1.6) with (1.8) for any $u\in[s,t]_{h}$ we have the Taylor type approximation

[TABLE]

This yields the interpolating forward-backward telescoping sum formula

[TABLE]

We obtain formally (1.10) by summing the above terms and passing to the limit $h\downarrow 0$ .

To be more precise, we follow the two-sided stochastic integration calculus introduced by Pardoux and Protter in [43]. As mentioned by the authors this methodology can be seen as a variation of Itô original construction of the stochastic integral. In this framework, the Skorohod stochastic integral (1.12) arising in (1.9) is defined by the $\mathbb{L}_{2}$ -convergence

[TABLE]

The proof of the above assertion is based on a slight extension of proposition 3.3 in [43] to Skorohod integrals of the form (1.12). For the convenience of the reader, a detailed proof of the above assertion for one dimensional models is provided in section 6.1.

Using (4.3), the complete proof of (1.9) now follows the same line of arguments as the ones used in the proof of Itô-type change rule formula stated in theorem 6.1 in [43], thus it is skipped.

4.2 A multivariate stochastic interpolation formulae

In terms of the tensor product (2.1), for any $p\geq 1$ and any twice differentiable function $f$ from $\mathbb{R}^{d}$ into $\mathbb{R}^{p}$ with at most polynomial growth the function $F_{s,t}:=\mathbb{P}_{s,t}(f)$ satisfies the backward formula (1.4) with the random fields

[TABLE]

Using the quantitative estimates presented in section 5.2, we checked that the regularity conditions $(H_{1})$ , $(H_{2})$ and $(H_{3})$ stated in section 1.1.1 are satisfied. Rewritten in terms of the stochastic semigroups $\mathbb{P}_{s,t}$ and $\overline{\mathbb{P}}_{s,t}$ we obtain the forward-backward multivariate interpolation formula

[TABLE]

with the stochastic integro-differential operator

[TABLE]

and the two-sided stochastic integral term given by

[TABLE]

Using elementary differential calculus, for twice differentiable (column vector-valued) function $f$ from $\mathbb{R}^{d}$ into $\mathbb{R}^{p}$ we readily check the gradient and the Hessian formulae

[TABLE]

This shows that $\mathbb{T}_{s,t}(f,\Delta a,\Delta b)$ and $\SS_{s,t}(f,\Delta\sigma)$ have the same form as the integrals $T_{s,t}(\Delta a,\Delta b)$ and $S_{s,t}(\Delta a,\Delta b)$ defined in (1.10) and (1.11) up to some terms involving the gradient and the Hessian of the function $f$ . For instance, we have the two-sided stochastic integral formula

[TABLE]

Also observe that (4.4) coincides with (1.10) for the identity function; that is, we have that

[TABLE]

The above discussion shows that the analysis of the differences of the stochastic semigroups $(\mathbb{P}_{s,t}-\overline{\mathbb{P}}_{s,t})$ in terms of the tangent and the Hessian processes is essentially the same as the one of the difference of the stochastic flows $(X_{s,t}-\overline{X}_{s,t})$ . For instance using the discussion provided section 5.3, when the gradient and the Hessian of the function $f$ are uniformly bounded the estimates stated in theorem 1.3 can be easily extended at the level of the stochastic semigroups.

The $\mathbb{L}_{2}$ -norm of the two-sided stochastic integrals in (1.10) and (4.4) are uniformly estimated as soon as the pair of drift and diffusion functions $(b_{t},\sigma_{t})$ and $(\overline{b},\overline{\sigma}_{t})$ satisfy condition $({\cal T})_{2}$ . For a more thorough discussion we refer to section 5.1, see for instance the $\mathbb{L}_{n}$ -norm estimates presented in theorem 5.2 applied to the difference function $\varsigma_{t}=\Delta\sigma_{t}$ .

4.3 Semigroup perturbation formulae

Besides the fact that the Skorohod integral in the r.h.s. of (4.4) is not a martingale (w.r.t. the Brownian motion filtration) it is centered (see for instance (2.26) and the argument provided in the beginning of section 5.1). Thus, taking the expectation in the univariate version of (4.4) we obtain the following interpolation semigroup decomposition.

Corollary 4.1.

For any twice differentiable function $f$ from $\mathbb{R}^{d}$ into $\mathbb{R}$ with bounded derivatives we have the forward-backward semigroup interpolation formula

[TABLE]

In addition, under some appropriate regularity conditions for any differentiable function $f$ such that $\|f\|\leq 1$ and $\|\nabla f\|\leq 1$ we have the uniform estimate

[TABLE]

Rewritten in terms of the infinitesimal generators $(L_{t},\overline{L}_{t})$ of the stochastic flows $(X_{s,t},\overline{X}_{s,t})$ we recover the rather well known semigroup perturbation formula

[TABLE]

The above formula can be readily checked using the interpolating formula given for any $s\leq u<t$ by the evolution equation

[TABLE]

Now we come to the proof of (4.9). Whenever $({\cal T})_{2}$ is met, combining (3.13) with (3.16) for any differentiable function $f$ s.t. $\|f\|\leq 1$ and $\|\nabla f\|\leq 1$ and for any $\epsilon\in]0,1[$ we check that

[TABLE]

This ends the proof of (4.9).

After some elementary manipulations the forward-backward interpolation formula (4.8) yields the following corollary.

Corollary 4.2.

Let $X_{t}$ and $\overline{X}_{t}$ be some ergodic diffusions associated with some time homogeneous drift and diffusion functions $(b,\sigma)$ and $(\overline{b},\overline{\sigma})$ . The invariant probability measures $\pi$ and $\overline{\pi}$ of $X_{t}$ and $\overline{X}_{t}$ are connected for any twice differentiable function $f$ from $\mathbb{R}^{d}$ into $\mathbb{R}$ with bounded derivatives by the following interpolation formula

[TABLE]

In the above display $\overline{Y}$ stands for a random variable with distribution $\overline{\pi}$ and $P_{t}$ stands for the Markov transition semigroup of the process $X_{t}$ .

The formula (4.10) can be used to estimate the invariant measure of a stochastic flow associated with some perturbations of the drift and the diffusion function.

For instance, for homogeneous Langevin diffusions $X_{t}$ associated with some convex potential function $U$ we have

[TABLE]

In the above display, $dx$ stands for the Lebesgue measure on $\mathbb{R}^{d}$ . In this situation, using (4.10), for any ergodic diffusion flow $\overline{X}_{t}$ with some drift $\overline{b}$ and an unit diffusion matrix we have

[TABLE]

Notice that the above formula is implicit as the r.h.s. term depends on $\overline{\pi}$ . By symmetry arguments, we also have the following more explicit perturbation formula

[TABLE]

In the above display ${Y}$ stands for a random variable with distribution ${\pi}$ and $\overline{P}_{t}$ stands for the Markov transition semigroup of the process $\overline{X}_{t}$ .

4.4 Some extensions

Several extensions of the forward-backward stochastic interpolation formula (1.10) to more general stochastic perturbation processes can be developed. For instance, suppose we are given some stochastic processes $\overline{Y}_{s,t}(x)\in\mathbb{R}^{d}$ and $\overline{Z}_{s,t}(x)\in\mathbb{R}^{d\times r}$ adapted to the filtration of the Brownian motion $W_{t}$ , and let $\overline{X}_{s,t}(x)$ be the stochastic flow defined by the stochastic differential equation

[TABLE]

In this situation, the interpolation formula (1.9) remains valid when $\overline{a}_{u}(\overline{X}_{s,u}(x))$ is replaced by the stochastic matrices $\overline{Z}_{s,t}(x)\overline{Z}_{s,t}(x)^{\prime}$ . This yields without further work the forward-backward stochastic interpolation formula (1.10) with the local perturbations

[TABLE]

The corresponding interpolation formula should be used with some caution as the $\mathbb{L}_{2}$ -norm of the two-sided stochastic integral (1.12) depends on the Malliavin differential of the integrand process of the Brownian motion; see for instance the variance formula provided in lemma 5.1.

Assume that $\sigma=I$ and the regularity condition $({\cal T})_{2}$ is met. Also suppose $\overline{X}_{s,t}(x)$ is given by a stochastic differential equation of the form (4.11) with $r=d$ and $\overline{Z}_{s,t}(x)=I$ . Arguing as above, in terms of the tensor product (2.1) we have

[TABLE]

Combining (2.15) with the generalized Minkowski inequality, we check the following proposition.

Proposition 4.3.

Assume that $({\cal T})_{2}$ is met for some $\lambda_{A}>0$ . In this situation, for any $1\leq n\leq\infty$ we have the estimates

[TABLE]

In the same vein, we have

[TABLE]

For instance, for the Langevin diffusion discussed in (2.18) and (3.7) the weak expansion (4.14) implies that

[TABLE]

This yields the $\mathbb{W}_{1}$ -Wasserstein estimate

[TABLE]

Combining (3.13) with (4.15), for any $\epsilon\in]0,1[$ we also have the total variation norm estimate

[TABLE]

5 Skorohod fluctuation processes

5.1 A variance formula

Let $\varsigma_{t}(x)$ be some differentiable $(d\times r)$ -matrix valued function on $\mathbb{R}^{d}$ such that

[TABLE]

Recalling that $(W_{u+h}-W_{u})$ is independent of the flows $\overline{X}_{s,u}$ and $\nabla X_{u+h,t}$ , the discrete time approximation (4.3) shows that Skorohod stochastic integral is centered; that is, we have that $\mathbb{E}(S_{s,t}(\varsigma)(x))=0$ .

Following (4.3), the variance can be computed using the following approximation formula

[TABLE]

The proof of the above assertion is provided in section 6.1, see for instance proposition 6.2.

Consider the matrix valued function

[TABLE]

In this notation, the limiting diagonal term $u=v$ in the r.h.s. of (5.2) is clearly equal to

[TABLE]

In addition, whenever condition $({\cal T})_{2}$ is met and $\varsigma$ is bounded, (3.4) readily yields the estimate

[TABLE]

More generally, using (3.8) whenever $(\overline{{\cal M}})_{2/\delta}$ and $({\cal T})_{2/(1-\delta)}$ are met for some $\delta\in]0,1[$ we have the estimate

[TABLE]

This implies that

[TABLE]

The non-diagonal term can be computed in a more direct way using Malliavin derivatives of the functions $\Sigma_{s,u,t}$ . For any $s\leq u\leq v\leq t$ we have

[TABLE]

As expected, observe that

[TABLE]

In the reverse angle, whenever $s\leq v\leq u\leq t$ we have the chain rule formula

[TABLE]

As above, Malliavin differentials $D_{v}\left(\varsigma_{u}\circ\overline{X}_{s,u}\right)$ and $D_{v}\overline{X}_{s,u}$ can be computed using the chain rule formulae (2.24).

A more detailed analysis of the chain rules formulae (2.24), (2.25) and (5.7) for one dimensional models is provided in section 6.1 (cf. lemma 6.1).

Observe that

[TABLE]

We consider the inner product

[TABLE]

In this notation, an explicit description of the $\mathbb{L}_{2}$ -norm of the two-sided stochastic integral in terms of Malliavin derivatives is given below.

Lemma 5.1.

The $\mathbb{L}_{2}$ -norm of the Skorohod integral $S_{s,t}(\varsigma)(x)$ introduced in (4.3) is given for any $x\in\mathbb{R}^{d}$ and $s\leq t$ by the formulae

[TABLE]

with the random matrix function $\Sigma_{s,u,t}$ defined in (5.3) and the Malliavin derivative $D_{v}\Sigma_{s,u,t}$ given in formulae (5.6) and (5.7). In addition, we have

[TABLE]

The above lemma can be interpreted as a matrix version of the isometry property (2.26). A proof of the above lemma based on the $\mathbb{L}_{2}$ -approximation of two-sided stochastic integrals is provided in section 6.1 (see for instance proposition 6.2).

5.2 Quantitative estimates

For any $p>1$ and any tensor norms we also quote the rather well known $\mathbb{L}_{p}$ -norm estimates

[TABLE]

for some finite constants $c_{i,p}$ whose values only depend on $p$ . A proof of these estimates can be found in [38, 48], see also [39] for multiple Skorohod integrals. By the generalized Minkowski inequality, for any $n\geq 2$ we also have the estimate

[TABLE]

Observe that for any $n\geq 2$ we have

[TABLE]

The main objective of this section is to prove the following theorem.

Theorem 5.2.

Assume that $(M)_{2n/\delta}$ and $(T)_{2n/(1-\delta)}$ are satisfied for some parameter $n\geq 2$ and some $\delta\in]0,1[$ . In this situation, we have the uniform estimate

[TABLE]

For uniformly bounded diffusion functions $(\varsigma,\sigma,\overline{\sigma})$ whenever $(T)_{2n}$ is met for some $n\geq 2$ we have

[TABLE]

In addition, for constant diffusion functions $(\varsigma,\sigma,\overline{\sigma})$ whenever $(T)_{2}$ is met, for any $n\geq 2$ we have the uniform estimate

[TABLE]

The proof of the above theorem, including a more detailed description of the parameters $\kappa_{\delta,n}$ and $\kappa_{n}$ is provided below.

Next, we estimate the $\mathbb{L}_{n}$ -norm of the Malliavin differential $D_{v}\Sigma_{s,u,t}(x)$ in the two cases $(s\leq u\leq v\leq t)$ and $(s\leq v\leq u\leq t)$ .

Case $(s\leq u\leq v\leq t)$ :

Using (5.6) we have

[TABLE]

Using (2.24) and (2.25) this yields the estimate

[TABLE]

with the functions

[TABLE]

In the above display, $Z^{s,v}_{u}(x)$ stands for the interpolating flow defined in (1.13).

•

Firstly assume that $\|\varsigma\|\vee\|\sigma\|<\infty$ and $({\cal T})_{2n}$ is satisfied for some parameter $n\geq 1$ . In this situation, applying proposition 3.2 and proposition 3.3, for any $\epsilon\in]0,1[$ we have the uniform estimates

[TABLE]

with the parameter ${\mathchoice{\raisebox{0.0pt}{$ \displaystyle\chi $}}{\raisebox{0.0pt}{$ \textstyle\chi $}}{\raisebox{0.0pt}{$ \scriptstyle\chi $}}{\raisebox{0.0pt}{$ \scriptscriptstyle\chi $}}}_{n,\epsilon}(b,\sigma)$ given by

[TABLE]

•

More generally, when $\|\nabla\varsigma\|\vee\|\nabla\sigma\|<\infty$ the functions $\varsigma_{t}(x)$ and $\sigma_{t}(x)$ may grow at the most linearly with respect to $\|x\|$ . Assume that conditions $(M)_{2n/\delta}$ and condition $({\cal T})_{2n/(1-\delta)}$ are satisfied for some parameters $n\geq 1$ and $\delta\in]0,1[$ . In this situation, applying Hölder inequality we check that

[TABLE]

Applying proposition 3.2 we check that

[TABLE]

In the same vein, combining proposition 3.2 and proposition 3.3 with the uniform moment estimates (2.11) we check that

[TABLE]

We conclude that

[TABLE]

with the parameter

[TABLE]

Case $(s\leq v\leq u\leq t)$ :

We use (5.7) to check that

[TABLE]

On the other hand, using the chain rules (2.24) we have

[TABLE]

This yields the estimate

[TABLE]

•

Firstly assume that $\|\varsigma\|\vee\|\overline{\sigma}\|<\infty$ and condition $(T)_{2n}$ is satisfied for some $n\geq 1$ . In this situation, arguing as above for any $\epsilon\in]0,1[$ we have the uniform estimates

[TABLE]

for some universal constant $c$ and the parameter $\overline{{\mathchoice{\raisebox{0.0pt}{$ \displaystyle\chi $}}{\raisebox{0.0pt}{$ \textstyle\chi $}}{\raisebox{0.0pt}{$ \scriptstyle\chi $}}{\raisebox{0.0pt}{$ \scriptscriptstyle\chi $}}}}_{n,\epsilon}(b,\sigma)$ given by

[TABLE]

•

More generally assume that $\|\nabla\varsigma\|\vee\|\nabla\overline{\sigma}\|<\infty$ . Also assume that conditions $(M)_{2n/\delta}$ and $(T)_{2n/(1-\delta)}$ are satisfied for some parameters $n\geq 1$ and $\delta\in]0,1[$ . In this situation, we have

[TABLE]

with the parameter

[TABLE]

The end of the proof of theorem 5.2 is a direct consequence of the estimates discussed above combined with (5.8) and the diagonal estimates presented in (5.4).

5.3 Some extensions

This section is concerned with the two-sided stochastic integral (4.6). Using the gradient formula in (4.7) the Skorohod stochastic integral in (4.6) takes the form

[TABLE]

with the integrands

[TABLE]

As in (2.25), using the chain rule properties of Malliavin derivatives we check that

[TABLE]

as well as

[TABLE]

This yields the differential formula

[TABLE]

The Malliavin derivatives $D^{i}_{v}\Sigma_{s,u,t}$ are computed using formulae (5.6) and (5.7); thus, it remains to compute the Malliavin derivatives $D_{v}Z^{s,t}_{u}$ of the interpolating path.

$\bullet$ When $u\leq v$ we have

[TABLE]

In this situation, as in (2.24) using the chain rule properties of Malliavin derivatives we check that

[TABLE]

By (2.23) we conclude that

[TABLE]

$\bullet$ When $v\leq u$ we have

[TABLE]

In this situation, arguing as above we check that

[TABLE]

By (2.23) we conclude that

[TABLE]

6 Some anticipative calculus

For clarity and to avoid unnecessary sophisticated multi-index notation, we only consider one dimensional model. The proof of the results presented in this section in the general case can be reproduced word-for-word for multidimensional models.

To simplify the presentation, we write $\partial^{n}f$ the derivative of order $n\geq 1$ of a smooth function $f$ . We also set $Y_{s,t}(x):=\overline{X}_{s,t}(x)$ . We also reduce the analysis to the unit interval. In this context, for any $t\in[0,1]$ we set

[TABLE]

6.1 Extended two-sided stochastic integrals

The aim of this section is to extend the two-sided stochastic integration introduced in [43] to Skorohod integrals of the form (4.3), for some time homogeneous function $\varsigma_{u}=\varsigma$ satisfying (5.1). For any $t\in[0,1]$ we set

[TABLE]

In this notation the limiting integral in (4.3) takes formally the following form

[TABLE]

The existence of this two-sided stochastic integral is discussed below in (6.4).

To simplify the presentation, we fix the state variable $x$ and we write $Y_{t}$ and $\Phi(X^{t},Y_{t})$ instead of $Y_{t}(x)$ and $\Phi(X^{t},Y_{t}(x))$ . Next technical lemma provided a more explicit description of the Malliavin derivatives of the processes $\Phi(X^{t},Y_{t})$ .

Lemma 6.1.

For any $s<t$ we have

[TABLE]

In addition, we have

[TABLE]

Proof.

Using the chain rules properties, for any $s<t$ we have

[TABLE]

The end of the proof of the first assertion comes from the fact that

[TABLE]

In the same vein, we have

[TABLE]

We also have that

[TABLE]

The last assertion comes from the fact that

[TABLE]

The r.h.s. term in the above display can be rewritten as follows

[TABLE]

In the same vein, we have

[TABLE]

This ends the proof of the second assertion. The proof of the lemma is now completed.

From the above lemma, we also check that all the $n$ -absolute moments of the Malliavin derivatives $D_{s}\,\Phi(X^{t},Y_{t})$ are finite with at most quadratic growth w.r.t. the initial values.

Next proposition extends proposition 3.3 in [43] to stochastic processes of the form (6.2).

Proposition 6.2.

Let $[0,1]_{h}$ be any refining sequence of partitions of the unit interval. For any $h>0$ we define

[TABLE]

Then $S^{h}(\Phi)$ is a Cauchy sequence in $\mathbb{L}_{2}(\Omega)$ . In addition, for any decreasing sequence of time steps $h_{1}>h_{2}$ we have the formula

[TABLE]

Before entering into the details of the proof of the proposition, we give a couple of comments. The hypothesis that $[0,1]_{h}$ is a refining sequence indexed by $h$ is not essential but it simplifies the proof of the proposition, see for instance lemma 3.1.1 in [37]. Arguing as in the proof of theorem 3.3 and theorem 7.1 in [43] the above proposition ensures that the two-sided integral defined by the $\mathbb{L}_{2}(\Omega)$ -limit coincides with the two-sided stochastic integral of the process $\Phi(X^{t},Y_{t})$ over the unit interval; that is, we have that

[TABLE]

In this context, proposition 6.2 can be interpreted as a version of the isometry property (2.26) for the generalized two-sided integral defined above.

Proof of proposition 6.2:

We fix $h_{1}>h_{2}$ and we assume that $[0,1]_{h_{2}}$ is a refinement of $\in[0,1]_{h_{1}}$ . For any $(s,t)\in([0,1]_{h_{1}}\times[0,1]_{h_{2}})$ we also set

[TABLE]

With a slight abuse of notation we set

[TABLE]

$\bullet$ For any overlapping pair $s<t<t+h_{2}<s+h_{1}$ using the decomposition

[TABLE]

we have

[TABLE]

It follows from the continuity properties of the processes that

[TABLE]

$\bullet$ When $s+h_{1}<t$ we have

[TABLE]

On the other hand, we have the decomposition

[TABLE]

with the increment functions

[TABLE]

With a slight abuse of notation, we shall denote by $\mbox{\rm O}(h^{p})$ some possible random variable with any $n$ -absolute moment of order $h^{p}$ , for some $p>0$ with $0<h<1$ . In this notation, we have

[TABLE]

Given a smooth function $\theta$ we set

[TABLE]

In this notation, we have the first and second order decompositions

[TABLE]

This implies that

[TABLE]

from which we conclude that

[TABLE]

This yields the first order decomposition

[TABLE]

with the functions

[TABLE]

Notice that none of the functions but the increment functions $(\Delta X_{t})$ and $(\Delta X^{\prime}_{t})$ depend on ${\cal W}_{t,t+h_{2}}$ , nor on ${\cal W}_{s,s+h_{1}}$ .

In the reverse angle, we have

[TABLE]

with

[TABLE]

Arguing as above, we have

[TABLE]

We conclude that

[TABLE]

In the same vein, we have

[TABLE]

Multiplying these terms, we check that

[TABLE]

with the functions

[TABLE]

None of the functions but the increment $\Delta Y_{s}$ depend on ${\cal W}_{s,s+h_{1}}$ , nor on ${\cal W}_{t,t+h_{2}}$ .

Recall that the functions $\Phi(X^{t+h_{2}},Y_{t})$ and $\psi^{0}_{s,t}(Y_{s})$ don’t depend on $\Delta W_{t}$ . In addition, the functions $\Phi(X^{s+h_{1}},Y_{s})$ and $\Psi^{0}_{s,t}(Y_{s})$ don’t depend on $\Delta W_{s}$ . This yields the formula

[TABLE]

To take the final step, observe that

[TABLE]

In the same vein, we have

[TABLE]

and

[TABLE]

This shows that

[TABLE]

It follows that

[TABLE]

We end the proof of (6.3) using lemma 6.1 and symmetry arguments. This ends the proof of the proposition.

6.2 Generalized backward Itô-Ventzell formula

This section is mainly concerned with the proof of theorem 1.1. Before entering into the details of the proof we discuss how it applies to the process $(X^{t},Y_{t})$ introduced in (6.1).

Consider the random fields

[TABLE]

In this notation, the backward random field formula (4.1) with $t\in[0,1]$ takes the form

[TABLE]

We fix some given $Y_{0}=y\in\mathbb{R}$ and we write $Y_{t}$ instead of $Y_{t}(y)$ and set

[TABLE]

In this notation, we have

[TABLE]

Observe that $B_{u},\Sigma_{u}$ as well as the Malliavin derivatives $D_{v}\Sigma_{u}=\partial\overline{\sigma}(Y_{u})~{}D_{v}Y_{u}$ have moments of any order. Consider the processes

[TABLE]

In this notation, up to a change of sign and replacing $x$ by $Y_{0}$ in (1.10) the stochastic interpolation formula stated in theorem 1.2 on the unit interval takes the following form

[TABLE]

More generally, suppose we are given a forward real valued continuous semi-martingale $Y_{t}$ of the form (6.7) for some ${\cal W}_{0,t}$ -adapted functions $B_{t}$ and $\Sigma_{t}$ , and a backward random field models of the form (6.6) for some ${\cal W}_{t,1}$ -adapted functions $F_{t}(x),G_{t}(x),H_{t}(x)$ .

We consider the following conditions:

$(H_{1})^{\prime}$ : The functions $F_{t}(x)$ , $G_{t}(x)$ and $H_{t}(x)$ as well as the differentials $\partial H_{t}(x)$ and $\partial^{2}F_{t}(x)$ are continuous w.r.t. $(t,x)$ for any given $\omega\in\Omega$ . In addition, for any $n\geq 1$ we have

[TABLE]

$(H_{2})^{\prime}$ : The Malliavin derivatives $D_{s}\partial F_{t}(x)$ and $D_{s}H_{t}(x)$ are continuous w.r.t. $x$ and $(s,t)$ for any given $\omega\in\Omega$ . In addition, for any $n\geq 1$ we have

[TABLE]

$(H_{3})$ *: The random processes $B_{u},\Sigma_{u}$ as well as $D_{v}\Sigma_{u}$ are continuous w.r.t. the time parameter and they have moments of any order.

The next theorem is a slight extension of theorem 1.1 applied to the semi-martingale and the random fields models discussed in (6.7) and (6.5).

Theorem 6.3.

Consider a backward random field models of the form (6.6) for some functions $F_{t}(x),G_{t}(x),H_{t}(x)$ satisfying $(H_{1})^{\prime}$ and $(H_{2})^{\prime}$ . Also let $Y_{t}$ be a continuous semi-martingale of the form (6.7) functions $B_{t}$ and $\Sigma_{t}$ satisfying $(H_{3})$ . In this situation, for any $t\in[0,1]$ we have the generalized backward Itô-Ventzell formula

[TABLE]

The r.h.s. term in the above display is understood as a Skorohod integral.

Proof: We use the same approximation technique as in [12, 41] and [42] (see also the proof of theorem 3.2.11 in [37]). Consider a mollifier type approximation of the identify given for any $\epsilon>0$ by the function

[TABLE]

For any $x$ , applying the Itô-type change rule formula stated in proposition 8.2 in [38] to the product function

[TABLE]

we check that

[TABLE]

with

[TABLE]

The stochastic integral in the r.h.s. of (6.11) can be interpreted as a two-sided stochastic integral. Recalling that

[TABLE]

we check that

[TABLE]

Condition $(H_{3})$ ensures that the processes $Y_{t}$ and $D_{t}Y_{s}$ have moments of any order. In addition, under the regularity conditions $(H_{1})^{\prime}$ and $(H_{2})^{\prime}$ we check that

[TABLE]

Applying the Fubini theorem for Skorohod and measure theory integrals (see for instance [34, 37, 44] and the work by Leon [35]) we check that

[TABLE]

with

[TABLE]

Integrating by parts where derivatives of $\varphi_{\epsilon}$ appear we check that

[TABLE]

From the a.s. continuity of $F_{t}(x)$ in $x$ for each $t\geq 0$ , we have

[TABLE]

The functions $\partial F_{t}(x)$ , $\partial^{2}F_{t}(x)$ and $G_{t}(x)$ are almost surely continuous w.r.t. $x$ and uniformly locally bounded. In addition, the random variables $A_{t}$ and $B_{t}$ are integrable at any order. Moreover, under $(H_{1})^{\prime}$ there exists some parameter $n\geq 0$ depending on the support of $\varphi$ such that for any $\epsilon>0$ we have the estimate

[TABLE]

Thus, by the dominated convergence theorem on $(\Omega\times[0,1])$ equipped with the measure $(\mathbb{P}(d\omega)\otimes dt)$ we have

[TABLE]

It remains to check that

[TABLE]

Observe that

[TABLE]

Using the chain rule property we have

[TABLE]

Integrating by parts, we check that

[TABLE]

Observe that

[TABLE]

On the other hand, we have

[TABLE]

Arguing as above, we have the estimate

[TABLE]

In the above display, $J_{i}(\epsilon)$ stands for the sequences

[TABLE]

The last two terms depend on the Malliavin derivatives of $\partial F_{s}$ and $H_{s}$ are they are given by

[TABLE]

Arguing as above, by the dominated convergence theorem we conclude that the Skorohod integral

[TABLE]

This ends the proof of (6.12), and the proof of the theorem is now easily completed.

We end this section with some comments.

Remark 6.4.

Recalling that the diffusion flow $Y_{t}$ introduced in (6.1) has finite absolute moments of any order, the integrability conditions stated in (6.8) and (6.9) are satisfied as soon as the functions $F_{t},G_{t},H_{t}$ , the differentials $\partial F_{t},\partial^{2}F_{t},\partial H_{t}$ , and the Malliavin derivatives $D_{s}H_{t},D_{s}\partial F_{t}$ have at most polynomial growth w.r.t. the state variable.

It is now readily check that $(H_{1})^{\prime}$ and $(H_{2})^{\prime}$ are met for the random fields introduced in (6.5).

The proof can be also be extended without difficulties to multivariate models. Following the proof of proposition 3.1 in [41], an alternative proof of theorem 6.3 based on Itô formula for Hilbert space valued processes can be developed. This elegant functional approach requires to introduce a custom Hilbert-space valued processes framework but this approach avoids to do explicitly the interchange of integration using the Fubini theorem for Skorohod and measure theory integrals. As the statement of proposition 3.1 in [41], the assumptions of theorem 6.3 can also be weaken when expressed in terms of this generalized stochastic calculus for Hilbert-space valued processes.

7 Illustrations

7.1 Perturbation analysis

Assume that $\overline{\sigma}=\sigma$ and the drift function $\overline{b}_{t}$ is given by a first order expansion

[TABLE]

for some perturbation parameter $\delta\in[0,1]$ and some functions $b^{(i)}_{\delta,t}(x)$ with $i=1,2$ .

In this context, the stochastic flow $\overline{X}_{s,t}(x):=X^{\delta}_{s,t}(x)$ can be seen as a $\delta$ -perturbation of ${X}_{s,t}(x):=X^{0}_{s,t}(x)$ .

We further assume that the unperturbed diffusion satisfies condition $({\cal T})_{2}$ .

To avoid unnecessary technical discussions on the existence of absolute moments of the flows we also assume that $b^{(i)}_{\delta,t}(x)$ are uniformly bounded w.r.t. the parameters $(\delta,t,x)$ . In addition, $b^{(1)}_{t}(x)$ is differentiable w.r.t. the coordinate $x$ and it has uniformly bounded gradients. In this situation, we set

[TABLE]

With some additional work to estimate the absolute moments of the flows, the perturbation analysis presented below allows to handle more general models. The methodology described in this section can also be extended to expand the flow $X^{\delta}_{s,t}(x)$ at any order as soon as $\delta\mapsto b_{\delta,t}(x)$ is sufficiently smooth.

The first order approximation is given by the following theorem.

Theorem 7.1.

For any $s\leq t$ , $x\in\mathbb{R}^{d}$ and $\delta\geq 0$ we have the first order expansion

[TABLE]

with the first order stochastic flow

[TABLE]

The remainder second order term $\partial^{2}_{\delta}{X}_{s,t}(x)$ in the above display is such that for any $n\geq 2$ s.t. $\lambda_{A}(n)>0$ we have the uniform estimate

[TABLE]

Proof.

Using (4.12) we readily check that

[TABLE]

By proposition 3.2 for any $n\geq 2$ we have

[TABLE]

This yields the first order Taylor expansion (7.1) with

[TABLE]

and the second order remainder terms

[TABLE]

Arguing as above, for any $n\geq 2$ s.t. $\lambda_{A}^{+}(n)>0$ we have the uniform estimate

[TABLE]

To estimate $\partial^{(2,1)}_{\delta}{X}_{s,t}(x)$ we need to consider the second order decompositions

[TABLE]

Combining proposition 3.3 with the estimate (7.2) for any $n\geq 2$ s.t. $\lambda_{A}(n)>0$ we check that

[TABLE]

for some universal constant $c<\infty$ and the parameter ${\mathchoice{\raisebox{0.0pt}{$ \displaystyle\chi $}}{\raisebox{0.0pt}{$ \textstyle\chi $}}{\raisebox{0.0pt}{$ \scriptstyle\chi $}}{\raisebox{0.0pt}{$ \scriptscriptstyle\chi $}}}(b,\sigma)$ introduced in (2.6). This ends the proof of (7.1). The proof of the theorem is completed.

7.2 Interacting diffusions

Consider a system of $N$ interacting and $\mathbb{R}^{d}$ -valued diffusion flows $X^{i}_{s,t}(x)$ , with $1\leq i\leq N$ given by a stochastic differential equation of the form

[TABLE]

for some Lipschitz functions $B_{t}(x,y)$ and $\sigma_{t}(y)$ with appropriate dimensions. In the above display, $W^{i}_{t}$ stands for a collection of independent copies of $d$ -dimensional Brownian motion $W_{t}$ . Assume that $B_{t}(x,y)$ linear w.r.t. the first coordinate.

In this situation, up to a change of probability space, the empirical mean of the process

[TABLE]

satisfies the stochastic differential equation

[TABLE]

Formally, the above diffusion converges as $N\rightarrow\infty$ to the flow ${X}_{s,t}(x)$ of the dynamical system defined by

[TABLE]

More rigorously and without further work, the forward-backward interpolation formula (1.10) yields directly the bias-variance error decomposition

[TABLE]

This readily implies the a.s. convergence

[TABLE]

After some elementary manipulations we check the bias formula

[TABLE]

We also have the almost sure fluctuation theorem

[TABLE]

7.3 Time discretization schemes

This section is mainly concerned with the proof of proposition 1.4. We fix some parameter $h>0$ and some $s\geq 0$ and for any $t\in[s+kh,s+(k+1)h[$ we set

[TABLE]

for some fluctuation parameter $\sigma\geq 0$ . For any $s+kh\leq u<s+(k+1)h$ we have

[TABLE]

Using (4.12), in terms of the tensor product (2.1) we readily check that

[TABLE]

Combining (3.5) with the Minkowski integral inequality we check that

[TABLE]

where the second line follows from the exponential estimate of the tangent process from proposition 3.3. The integrand will be bounded as follows: for any $s+kh\leq u<s+(k+1)h$ and any $n\geq 1$ we have

[TABLE]

which then yields the stated result of the proposition. We now prove the stated bound on the difference of the drift processes. For any $s+kh\leq u<s+(k+1)h$ we have

[TABLE]

The $\mathbb{L}_{n}$ -norm of the second integral term is bounded by $\|\nabla b\|\sigma\sqrt{h}$ .

The assumption $\langle x,b(x)\rangle\leq-\beta~{}\|x\|^{2}$ , for some $\beta>0$ , implies the stochastic flows $X_{s,t}(x)$ has uniform absolute moments of any order $n\geq 1$ w.r.t. the time horizon, that is, we have that

[TABLE]

The stochastic flows $X_{s,t}^{h}(x)$ also obey a similar moment bound: observe that for any $t\in[s+kh,s+(k+1)h[$ we have

[TABLE]

Thus, for any $\epsilon>0$ we have

[TABLE]

We can check that the stochastic flows $X_{s,t}^{h}(x)$ also have uniform moments w.r.t. the time horizon; that is, for any $n\geq 1$ we have that

[TABLE]

Using this bounds, we check that

[TABLE]

The end of the proof now follows elementary manipulations, thus it is skipped. The proof of proposition 1.4 is now completed.

Appendix

In this appendix we prove the estimates (1.16) and (2.10) and proposition 3.3.

Proof of (2.10)

Whenever $({\cal M})_{n}$ is satisfied, we have

[TABLE]

with the parameters

[TABLE]

Observe that

[TABLE]

After some elementary computations, for any $n\geq 1$ we check that

[TABLE]

This implies that

[TABLE]

from which we check that for any $\epsilon>0$ we have

[TABLE]

This implies that

[TABLE]

from which we check that

[TABLE]

as soon as $\epsilon<\beta_{2}-(n-1/2)\alpha_{2}$ and $n\geq 1$ . Replacing $\epsilon$ by $\epsilon(\beta_{2}-(n-1/2)\alpha_{2})$ and then $(2n)$ by $n$ we check that

[TABLE]

This ends the proof of (2.10).

Proof of proposition 3.3

The proof of the estimate (3.10) is mainly based on the following technical lemma of its own interest.

Lemma 7.2.

Let $Z_{t}$ be a non negative diffusion process satisfying in integral sense an inequality of the following form

[TABLE]

for some parameters $\lambda>0$ and $v_{t}\geq 0$ , and some non negative processes $(\alpha_{t},\beta_{t},u_{t})$ . In this situation, for any $\epsilon>0$ we have

[TABLE]

with the parameters

[TABLE]

Proof.

Applying Itô’s formula, for any $n\geq 2$ , we have

[TABLE]

On the other hand, for any $\epsilon>0$ we have the almost sure inequality

[TABLE]

This implies that

[TABLE]

Applying Hölder inequality we check that

[TABLE]

This yields the estimate

[TABLE]

This ends the proof of the lemma.

We set

[TABLE]

and we also consider the collection of parameters

[TABLE]

with the tensor functions $(\tau_{t},\upsilon_{t})$ introduced in (3.9). Observe that

[TABLE]

Whenever $({\cal T})_{2}$ is met we have

[TABLE]

Also observe that

[TABLE]

and

[TABLE]

In the same vein, we have

[TABLE]

We are now in position to prove proposition 3.3.

Proof of proposition 3.3:

Applying the above lemma to the processes

[TABLE]

and the parameters

[TABLE]

we obtain the estimate (7.4) with the parameters

[TABLE]

Observe that

[TABLE]

for some universal constant $c<\infty$ and the parameter ${\mathchoice{\raisebox{0.0pt}{$ \displaystyle\chi $}}{\raisebox{0.0pt}{$ \textstyle\chi $}}{\raisebox{0.0pt}{$ \scriptstyle\chi $}}{\raisebox{0.0pt}{$ \scriptscriptstyle\chi $}}}(b,\sigma)$ defined in (2.6). Using (3.8) we check that

[TABLE]

Assume that

[TABLE]

In this case there exists some $0<\epsilon_{n}\leq 1$ such that for any $0<\epsilon\leq\epsilon_{n}$ we have

[TABLE]

and therefore

[TABLE]

This ends the proof of the proposition.

Proof of (1.16)

Using (2.14), the generalized Minkowski inequality applied to (1.10) whenever $({\cal T})_{n/\delta}$ is met for some $\delta\in]0,1[$ and $n\geq 2$ gives

[TABLE]

The Skorohod integral $S_{s,t}(\Delta\sigma)(x)$ is estimated using theorem 5.2. Using (7.5) and (5.9) we check that

[TABLE]

as soon as the regularity conditions $({\cal T})_{n/\delta_{1}}$ , $(M)_{2n/\delta_{2}}$ and $(T)_{2n/(1-\delta_{2})}$ are satisfied for some parameter $n\geq 2$ and some $\delta_{1},\delta_{2}\in]0,1[$ . Choosing $\delta_{1}=(1-\delta_{2})/2$ and setting $\delta=\delta_{2}$ we check that

[TABLE]

as soon as $(M)_{2n/\delta}$ and $(T)_{2n/(1-\delta)}$ are satisfied for some parameter $n\geq 2$ and some $\delta\in]0,1[$ . For instance, $({\cal M})_{2n/\delta}$ and $({\cal T})_{2n/(1-\delta)}$ are satisfied as soon as

[TABLE]

This ends the proof of (1.16).

Acknowledgments

P. Del Moral is supported in part from the Chair Stress Test, RISK Management and Financial Steering, led by the French Ecole polytechnique and its Foundation and sponsored by BNP Paribas, and by the ANR Quamprocs on quantitative analysis of metastable processes.

We also thank the anonymous reviewers for their excellent suggestions for improving the paper. Their detailed comments greatly improved the presentation of the article.

Bibliography49

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] V. Alekseev. An estimate for the perturbations of the solution of ordinary differential equations. Vestn. Mosk.Univ., Ser. I, Math. Meh. vol. 2, (1961).
2[2] T. Ando and J. L. van Hemmen. An inequality for trace ideals. Commun. Math. Phys., vol. 76, pp. 143–148 (1980).
3[3] M. Arnaudon, P. Del Moral. A variational approach to nonlinear and interacting diffusions. Ar Xiv:1812.04269 (2018). Stochastic Analysis and Applications DOI: 10.1080/07362994.2019.1609985 (2019).
4[4] M. Arnaudon, P. Del Moral. A duality formula and a particle Gibbs sampler for continuous time Feynman-Kac measures on path spaces. Ar Xiv 1805.05044 (2018). Electronic Journal of Probability 25 (2020).
5[5] M. Arnaudon, P. Del Moral. A second order analysis of Mc Kean-Vlasov semigroups. Ar Xiv:1906.05140 (2019 ), Annals of Applied Probability, vol. 30, no. 6, pp. 2613--2664. (2020).
6[6] M. Arnaudon, H. Plank, A. Thalmaier. A Bismut type formula for the Hessian of heat semigroups. C. R. Math. Acad. Sci. Paris, vol. 336, no. 8, pp. 661--666 (2003).
7[7] R. Bellman. Some inequalities for the square Root of a Positive Definite Matrix. Linear Algebra and its applications, vol. 1, no. 3, pp. 321--324 (1968).
8[8] R. Bellman, Stability Theory of Differential Equations, Mc Graw Hill, New York, (1953).

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Backward Itô-Ventzell and stochastic interpolation formulae

Abstract

1 Introduction

1.1 Statement of some main results

1.1.1 A backward Itô-Ventzell formula

Theorem 1.1**.**

1.1.2 A stochastic flow interpolation formula

Theorem 1.2**.**

1.1.3 Uniform estimates w.r.t. the time horizon

Theorem 1.3**.**

Proposition 1.4**.**

1.2 Comments and comparisons with existing literature

2 Preliminary results

2.1 Some basic notation

2.2 Regularity conditions and some preliminary results

Lemma 2.1**.**

2.3 Some results on anticipating stochastic calculus

3 Variational equations

3.1 The tangent process

Proposition 3.1**.**

Proof of Prop. 3.3.

Proposition 3.2**.**

3.2 The Hessian process

Proposition 3.3**.**

3.3 Bismut-Elworthy-Li formulae

4 Backward semigroup analysis

4.1 The two-sided stochastic integration

4.2 A multivariate stochastic interpolation formulae

4.3 Semigroup perturbation formulae

Corollary 4.1**.**

Corollary 4.2**.**

4.4 Some extensions

Proposition 4.3**.**

5 Skorohod fluctuation processes

5.1 A variance formula

Lemma 5.1**.**

5.2 Quantitative estimates

Theorem 5.2**.**

Case (s≤u≤v≤t)(s\leq u\leq v\leq t)(s≤u≤v≤t):

Case (s≤v≤u≤t)(s\leq v\leq u\leq t)(s≤v≤u≤t):

5.3 Some extensions

6 Some anticipative calculus

6.1 Extended two-sided stochastic integrals

Lemma 6.1**.**

Proof.

Proposition 6.2**.**

6.2 Generalized backward Itô-Ventzell formula

Theorem 6.3**.**

Remark 6.4**.**

7 Illustrations

7.1 Perturbation analysis

Theorem 7.1**.**

Proof.

7.2 Interacting diffusions

7.3 Time discretization schemes

Appendix

Proof of (2.10)

Proof of proposition 3.3

Lemma 7.2**.**

Proof.

Proof of (1.16)

Acknowledgments

Theorem 1.1.

Theorem 1.2.

Theorem 1.3.

Proposition 1.4.

Lemma 2.1.

Proposition 3.1.

Proposition 3.2.

Proposition 3.3.

Corollary 4.1.

Corollary 4.2.

Proposition 4.3.

Lemma 5.1.

Theorem 5.2.

Case $(s\leq u\leq v\leq t)$ :

Case $(s\leq v\leq u\leq t)$ :

Lemma 6.1.

Proposition 6.2.

Theorem 6.3.

Remark 6.4.

Theorem 7.1.

Lemma 7.2.