Rough nonlocal diffusions

Michele Coghi; Torstein Nilssen

arXiv:1905.07270·math.PR·July 27, 2021

Rough nonlocal diffusions

Michele Coghi, Torstein Nilssen

PDF

TL;DR

This paper develops a new rough path framework to analyze nonlinear Fokker-Planck equations driven by rough signals, specifically addressing McKean-Vlasov diffusions with common noise, and establishes their well-posedness.

Contribution

It introduces a self-contained nonlinear rough integration theory and defines solutions for the associated Fokker-Planck equations, proving their well-posedness.

Findings

01

Established a well-posedness theory for rough McKean-Vlasov equations.

02

Developed a novel nonlinear rough integration framework.

03

Provided a solution concept for rough Fokker-Planck equations.

Abstract

We consider a nonlinear Fokker-Planck equation driven by a deterministic rough path which describes the conditional probability of a McKean-Vlasov diffusion with "common" noise. To study the equation we build a self-contained framework of non-linear rough integration theory which we use to study McKean-Vlasov equations perturbed by rough paths. We construct an appropriate notion of solution of the corresponding Fokker-Planck equation and prove well-posedness.

Equations841

d X_{t}^{i} = \frac{1}{N} j = 1 \sum N b (X_{t}^{j}, X_{t}^{i}) d t + \frac{1}{N} j = 1 \sum N σ (X_{t}^{j}, X_{t}^{i}) d W_{t}^{i} + \frac{1}{N} j = 1 \sum N β (X_{t}^{j}, X_{t}^{i}) \circ d B_{t} .

d X_{t}^{i} = \frac{1}{N} j = 1 \sum N b (X_{t}^{j}, X_{t}^{i}) d t + \frac{1}{N} j = 1 \sum N σ (X_{t}^{j}, X_{t}^{i}) d W_{t}^{i} + \frac{1}{N} j = 1 \sum N β (X_{t}^{j}, X_{t}^{i}) \circ d B_{t} .

\left\{\begin{array}[]{ll}dx_{t}&=\int_{\mathbb{R}^{d}}b(\omega,x_{t})d\mu_{t}(\omega)dt+\int_{\mathbb{R}^{d}}\sigma(\omega,x_{t})d\mu_{t}(\omega)dW_{t}+\int_{\mathbb{R}^{d}}\beta(\omega,x_{t})d\mu_{t}(\omega)\circ dB_{t}\\ \mu_{t}&=\mathcal{L}(x_{t}|\mathcal{F}_{t}^{B}).\\ \end{array}\right.

\left\{\begin{array}[]{ll}dx_{t}&=\int_{\mathbb{R}^{d}}b(\omega,x_{t})d\mu_{t}(\omega)dt+\int_{\mathbb{R}^{d}}\sigma(\omega,x_{t})d\mu_{t}(\omega)dW_{t}+\int_{\mathbb{R}^{d}}\beta(\omega,x_{t})d\mu_{t}(\omega)\circ dB_{t}\\ \mu_{t}&=\mathcal{L}(x_{t}|\mathcal{F}_{t}^{B}).\\ \end{array}\right.

d μ_{t} = \frac{1}{2} Tr \nabla^{2} (σ (μ, \cdot)_{t} σ (μ, \cdot)_{t}^{T} μ_{t}) d t - div (b (μ, \cdot)_{t} μ_{t}) d t - div (β (μ, \cdot)_{t} μ_{t}) \circ d B_{t},

d μ_{t} = \frac{1}{2} Tr \nabla^{2} (σ (μ, \cdot)_{t} σ (μ, \cdot)_{t}^{T} μ_{t}) d t - div (b (μ, \cdot)_{t} μ_{t}) d t - div (β (μ, \cdot)_{t} μ_{t}) \circ d B_{t},

\partial_{t} μ = \frac{1}{2} Tr \nabla^{2} (σ (μ, \cdot) σ (μ, \cdot)^{T} μ) - div (b (μ, \cdot) μ) - div (β (μ, \cdot) μ) \dot{Z} .

\partial_{t} μ = \frac{1}{2} Tr \nabla^{2} (σ (μ, \cdot) σ (μ, \cdot)^{T} μ) - div (b (μ, \cdot) μ) - div (β (μ, \cdot) μ) \dot{Z} .

d x_{t} = b (L (x_{t}), x_{t}) d t + σ (L (x_{t}), x_{t}) d W_{t} + β (L (x_{t}), x_{t}) d Z_{t} .

d x_{t} = b (L (x_{t}), x_{t}) d t + σ (L (x_{t}), x_{t}) d W_{t} + β (L (x_{t}), x_{t}) d Z_{t} .

d x_{t} = b_{t} (x_{t}) d t + σ_{t} (x_{t}) d W_{t} + β_{t} (x_{t}) d Z_{t},

d x_{t} = b_{t} (x_{t}) d t + σ_{t} (x_{t}) d W_{t} + β_{t} (x_{t}) d Z_{t},

d x_{t}^{n + 1} = b (L (x_{t}^{n}), x_{t}^{n + 1}) d t + σ (L (x_{t}^{n}), x_{t}^{n + 1}) d W_{t} + β (L (x_{t}^{n}), x_{t}^{n + 1}) d Z_{t} .

d x_{t}^{n + 1} = b (L (x_{t}^{n}), x_{t}^{n + 1}) d t + σ (L (x_{t}^{n}), x_{t}^{n + 1}) d W_{t} + β (L (x_{t}^{n}), x_{t}^{n + 1}) d Z_{t} .

d x_{t} = β_{t} d Z_{t} .

d x_{t} = β_{t} d Z_{t} .

d x_{t} = β (L (x_{t}), x_{t}) d Z_{t},

d x_{t} = β (L (x_{t}), x_{t}) d Z_{t},

\left(\begin{array}[]{c}\mathbf{W}_{st}\\ \mathbf{Z}_{st}\\ \end{array}\right):=\left(\left(\begin{array}[]{c}W_{t}-W_{s}\\ Z_{t}-Z_{s}\\ \end{array}\right),\qquad\left(\begin{array}[]{cc}\int_{s}^{t}(W_{r}-W_{s})dW_{r}&\int_{s}^{t}(W_{r}-W_{s})dZ_{r}\\ \int_{s}^{t}(Z_{r}-Z_{s})dW_{r}&\mathbb{Z}_{st}\\ \end{array}\right)\right),

\left(\begin{array}[]{c}\mathbf{W}_{st}\\ \mathbf{Z}_{st}\\ \end{array}\right):=\left(\left(\begin{array}[]{c}W_{t}-W_{s}\\ Z_{t}-Z_{s}\\ \end{array}\right),\qquad\left(\begin{array}[]{cc}\int_{s}^{t}(W_{r}-W_{s})dW_{r}&\int_{s}^{t}(W_{r}-W_{s})dZ_{r}\\ \int_{s}^{t}(Z_{r}-Z_{s})dW_{r}&\mathbb{Z}_{st}\\ \end{array}\right)\right),

dx_{t}=\left(\begin{array}[]{c}\sigma_{t}\\ \beta_{t}\\ \end{array}\right)(x_{t})d\left(\begin{array}[]{c}\mathbf{W}_{t}\\ \mathbf{Z}_{t}\\ \end{array}\right).

dx_{t}=\left(\begin{array}[]{c}\sigma_{t}\\ \beta_{t}\\ \end{array}\right)(x_{t})d\left(\begin{array}[]{c}\mathbf{W}_{t}\\ \mathbf{Z}_{t}\\ \end{array}\right).

\int_{s}^{t} σ_{r} (x_{r}) d W_{r} - W_{s t}^{σ} (x_{s}) \lor \int_{s}^{t} β_{r} (x_{r}) d Z_{r} - Z_{s t}^{β} (x_{s}),

\int_{s}^{t} σ_{r} (x_{r}) d W_{r} - W_{s t}^{σ} (x_{s}) \lor \int_{s}^{t} β_{r} (x_{r}) d Z_{r} - Z_{s t}^{β} (x_{s}),

[g]_{ζ, h; E} := (s, t) \in Δ_{T} : ∣ t - s ∣ \leq h sup \frac{∥ g _{s t} ∥ _{E}}{∣ t - s ∣ ^{ζ}} < \infty.

[g]_{ζ, h; E} := (s, t) \in Δ_{T} : ∣ t - s ∣ \leq h sup \frac{∥ g _{s t} ∥ _{E}}{∣ t - s ∣ ^{ζ}} < \infty.

[f]_{ζ; E} \leq [f]_{ζ, h; E} (1 \lor 2 h^{ζ - 1})

[f]_{ζ; E} \leq [f]_{ζ, h; E} (1 \lor 2 h^{ζ - 1})

C_{0}^{α} ([0, T]; E) := {f \in C^{α} ([0, T]; E) : h \to 0 lim [f]_{α, h} = 0}

C_{0}^{α} ([0, T]; E) := {f \in C^{α} ([0, T]; E) : h \to 0 lim [f]_{α, h} = 0}

[[g]]_{p, [s, t]; E} := π sup {t_{i}} = π \sum ∥ g_{t_{i} t_{i + 1}} ∥_{E}^{p}^{1/ p} < \infty

[[g]]_{p, [s, t]; E} := π sup {t_{i}} = π \sum ∥ g_{t_{i} t_{i + 1}} ∥_{E}^{p}^{1/ p} < \infty

[[g]]_{p, [s, t]; E} = in f {w (s, t)^{1/ p} ∣ w is a control such that ∥ g_{uv} ∥_{E} \leq w (u, v)^{1/ p} for s \leq u < v \leq t} .

[[g]]_{p, [s, t]; E} = in f {w (s, t)^{1/ p} ∣ w is a control such that ∥ g_{uv} ∥_{E} \leq w (u, v)^{1/ p} for s \leq u < v \leq t} .

π \sum ∥ g_{t_{i} t_{i + 1}} ∥_{E}^{p} \leq π \sum [g]_{α; E}^{p} ∣ t_{i + 1} - t_{i} ∣^{α p} = [g]_{α; E}^{p} ∣ t - s ∣

π \sum ∥ g_{t_{i} t_{i + 1}} ∥_{E}^{p} \leq π \sum [g]_{α; E}^{p} ∣ t_{i + 1} - t_{i} ∣^{α p} = [g]_{α; E}^{p} ∣ t - s ∣

w_{g} (s, t) \leq [g]_{α; E}^{1/ α} ∣ t - s ∣.

w_{g} (s, t) \leq [g]_{α; E}^{1/ α} ∣ t - s ∣.

τ_{0} = s, τ_{n + 1} = in f {t ∣ w (τ_{n}, t) \geq β, τ_{n} < t \leq T} \land T,

τ_{0} = s, τ_{n + 1} = in f {t ∣ w (τ_{n}, t) \geq β, τ_{n} < t \leq T} \land T,

N_{β} (w, [s, t]) := sup {n \geq 0 ∣ τ_{n} < t} .

N_{β} (w, [s, t]) := sup {n \geq 0 ∣ τ_{n} < t} .

Z := (Z, Z) \in C^{α} ([0, T]; E) \times C_{2}^{2 α} ([0, T]; E \otimes E)

Z := (Z, Z) \in C^{α} ([0, T]; E) \times C_{2}^{2 α} ([0, T]; E \otimes E)

δ Z_{s θ t} = Z_{s θ} \otimes Z_{θ t},

δ Z_{s θ t} = Z_{s θ} \otimes Z_{θ t},

[Z - X]_{α, h} := [Z - X]_{α, h} + [Z - X]_{2 α, h} .

[Z - X]_{α, h} := [Z - X]_{α, h} + [Z - X]_{2 α, h} .

Y : [0, T] \to L (R^{m}; E), Y^{'} : [0, T] \to L (R^{m \times m}; E)

Y : [0, T] \to L (R^{m}; E), Y^{'} : [0, T] \to L (R^{m \times m}; E)

Y_{s t}^{♯} := δ Y_{s t} - Y_{s}^{'} Z_{s t}, ⟹ Y^{♯} \in C_{2}^{2 α} ([0, T]; L (R^{m}; E)) .

Y_{s t}^{♯} := δ Y_{s t} - Y_{s}^{'} Z_{s t}, ⟹ Y^{♯} \in C_{2}^{2 α} ([0, T]; L (R^{m}; E)) .

∥ (Y, Y^{'}) ∥_{Z, α, h; E} := ∣ Y_{0} ∣ + [Y^{'}]_{α, h; E} + [Y^{♯}]_{2 α, h; E} .

∥ (Y, Y^{'}) ∥_{Z, α, h; E} := ∣ Y_{0} ∣ + [Y^{'}]_{α, h; E} + [Y^{♯}]_{2 α, h; E} .

[δ g]_{ζ, h; E} := s < θ < t : ∣ t - s ∣ \leq h sup \frac{∥ δ g _{s θ t} ∥ _{E}}{∣ t - s ∣ ^{ζ}} < \infty

[δ g]_{ζ, h; E} := s < θ < t : ∣ t - s ∣ \leq h sup \frac{∥ δ g _{s θ t} ∥ _{E}}{∣ t - s ∣ ^{ζ}} < \infty

δ I (g)_{s t} = g_{s t} + I (g)_{s t}^{♮}

δ I (g)_{s t} = g_{s t} + I (g)_{s t}^{♮}

g_{s t} := Y_{s} Z_{s t} + Y_{s}^{'} Z_{s t} := Y_{s}^{k} Z_{s t}^{k} + Y_{s}^{k, l} Z_{s t}^{l, k} .

g_{s t} := Y_{s} Z_{s t} + Y_{s}^{'} Z_{s t} := Y_{s}^{k} Z_{s t}^{k} + Y_{s}^{k, l} Z_{s t}^{l, k} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Rough nonlocal diffusions

Michele Coghi , Torstein Nilssen

WIAS Berlin, Mohrenstraße 39, 10117 Berlin. Support from the Berlin Mathematics Research Center MATH+ is gratefully acknowledged.

Institute of Mathematics, Technical University of Berlin, Germany, Financial support by the DFG via Research Unit FOR 2402 is gratefully acknowledged.

Abstract

We consider a nonlinear Fokker-Planck equation driven by a deterministic rough path which describes the conditional probability of a McKean-Vlasov diffusion with "common" noise. To study the equation we build a self-contained framework of non-linear rough integration theory which we use to study McKean-Vlasov equations perturbed by rough paths. We construct an appropriate notion of solution of the corresponding Fokker-Planck equation and prove well-posedness.

MSC Classification Numbers: 60H05, 60H15, 60J60, 35K55.

Key words: Rough paths, Stochastic PDEs, McKean-Vlasov, non-local equations.

1 Introduction
2 Notations and preliminary results
2.1 Hölder and p-variation spaces
2.2 Rough paths
2.3 Taylor’s formula
2.4 Wasserstein metric
2.5 Spatial function spaces
3 Non linear integration
3.1 A priori estimates
3.2 A priori contractive estimates
3.3 Well-posedness of nonlinear RDEs
4 Rough non-linearities
4.1 Construction of the rough driver
4.1.1 Itô theory
4.1.2 Gubinelli integration
4.1.3 Mixed Itô and rough path integration
4.2 Integrability of the random rough driver
4.3 The average Itô formula
5 Linear Rough PDE
5.1 Unbounded rough drivers
5.2 A priori estimates for smooth vector fields
5.3 Existence of a smooth solution
5.4 Uniqueness
6 The McKean-Vlasov equation
7 Non local rough PDEs
A Appendix
A.1 Kolmogorov continuity theorem
A.2 Weakly geometric rough paths
A.3 A separable subspace of the Hölder space

1 Introduction

The term diffusion is sometimes used interchangeably when talking either about the macroscopic (Eulerian) description of the density of a substance occupying some space or the infinitesimal (Lagrangian) description of the particles of the substance. Many physical phenomena are however inherently nonlinear in the sense that the dynamic of the system will depend not only on space but also on the density of the substance itself. In this paper we study this type of nonlinear diffusion from both the Eulerian and Lagrangian perspective when the diffusion is perturbed by a rough path. We are motivated by dynamics that arise from interacting particle systems with common noise;

[TABLE]

Here each particle $X^{i}$ is influenced by 2 independent sources of noise, the Brownian motion $B$ 111Since we will in this paper only consider geometric rough paths, we shall consider Stratonovich integration for this term. is visible for all particles (common noise) and the Brownian motion $W^{i}$ represents a noise term specific for particle $X^{i}$ . Since $B$ is influencing every particle, taking the limit $N\rightarrow\infty$ will only average out the individual noise terms, giving, at least formally, the mean-field dynamics

[TABLE]

We note that the conditional law $\mathcal{L}(x_{t}|\mathcal{F}_{t}^{B})$ heuristically satisfies the non-local Fokker-Plank equation

[TABLE]

where we have used the notation $\sigma(\mu,x)_{t}=\int_{\mathbb{R}^{d}}\sigma(\omega,x)d\mu_{t}(\omega)$ etc. and $\operatorname{Tr}\nabla^{2}(a)=\sum_{i,j=1}^{d}\partial_{i}\partial_{j}a^{i,j}$ for a matrix valued function $a$ . In fact, we can also address the case when $\sigma$ is a certain type of Lipschitz nonlinearity on $\mathcal{P}(\mathbb{R}^{d})\times\mathbb{R}^{d}$ , where $\mathcal{P}(\mathbb{R}^{d})$ denotes the set of probability measures on $\mathbb{R}^{d}$ , see Assumption 6.2. We will only address the case when $\beta$ and $b$ are linear in their second argument.

In practice, (2) is difficult to solve since it needs to be formulated on a very large state space, namely $[C([0,T];\mathcal{P}(\mathbb{R}^{d}))]^{\Omega}$ where $\Omega$ is the underlying probability space. Even when $\Omega$ is finite, this space is too large to do analysis since it is difficult to find compact subsets that is used for proving well-posedness of (1) and (2). For a long time, well-posedness for equation (2) was known only for densities, see [20]. A proper well-posedness result in the space of measures was obtain just very recently in [8].

In this paper we take a different approach, namely we study equation (1) for a fixed sample path of the Brownian motion. Our method relies on the theory of rough paths and as such, allows the study of (1) where $B$ is replaced by any path that can be lifted to a rough path. In particular, no markovianity or martingale structure is needed for the common noise.

From now on we replace $B$ by a (deterministic) rough path $\mathbf{Z}=(Z,\mathbb{Z})$ , and equation (2) becomes

[TABLE]

The main contribution of this paper is the following.

Theorem (see Theorems 7.2 and 7.4).

Given a probability measure $\mu_{0}$ on $\mathbb{R}^{d}$ with finite $\rho$ -th moment, for any $\rho\geq 2$ , there exists a unique measure-valued path $\mu:[0,T]\to\mathcal{P}(\mathbb{R}^{d})$ , which solves (3) with initial condition $\mu_{0}$ .

Moreover we will prove in Theorem 7.2 that the unique solution is given as $\mu_{t}:=\mathcal{L}(x_{t})$ , namely the law of solution $x$ to the McKean-Vlasov equation

[TABLE]

We will show well-posedness of (4) in Section 6.

The strategy to prove uniqueness to equation (3) relies on showing that every solution must be the law of the McKean-Vlasov equation. As it will be clear in the proof of Theorem 7.4, this also necessitates to be able to have well-posedness of the equation

[TABLE]

for given time inhomogeneous functions $b$ , $\sigma$ and $\beta$ , where the time dependence is induced by the law. Moreover, a common approach to proving well-posedness of (4) is to construct the solution as a fixed point in the space of measures on an appropriate function space. Towards this end one would e.g. define inductively

[TABLE]

Once again, it is necessary to give a meaning to equation (5). If we consider the case $b=\sigma=0$ and $\beta_{t}(x)=\beta_{t}$ the equation reads

[TABLE]

It is well-known that the above integration does not make sense unless we impose additional structure on $\beta$ , namely that there exists a Taylor-type expansion around the irregular path $Z$ , which is exactly the notion of controlled rough paths as introduced by Gubinelli in [17]. If one aims to solve a mean-field equation on the form

[TABLE]

where $\mathcal{L}(x_{t})$ denotes the law of $x$ , and $\beta$ is an appropriate function on the space of measures, it is reasonable to expect that $t\mapsto\beta(\mathcal{L}(x_{t}),x)$ has such a decomposition and that one could solve the equation as a fixed-point in an appropriate space of measures.

Following this logic, if we want to consider the equation with added Brownian motion (4) as a fixed-point, this would necessitate being able to solve equation (5). The usual way, see [12], [13] and [14], to study this hybrid rough path and Itô equation is to consider the joint rough path

[TABLE]

and recast the equation on the form of a rough path equation

[TABLE]

Again, one would need to make an expansion of $(\sigma_{t},\beta_{t})^{T}$ in terms of the path $(W,Z)^{T}$ . However, thinking towards the goal of solving mean-field equations, the simplest examples shows that there is no reason to expect that $\sigma_{t}$ is controlled by a fixed Brownian path in any sense - the law of the solution is an average over all Brownian sample paths.

Instead, if we define $W_{st}^{\sigma}(x)=\int_{s}^{t}\sigma_{r}(x)dW_{r}$ as a Wiener-Itô integral and $Z_{st}^{\beta}(x)=\int_{s}^{t}\beta_{r}(x)d\mathbf{Z}_{r}$ as a rough path integral, then on small time scales one would expect

[TABLE]

to be small, so that one could use $W^{\sigma}$ and $Z^{\beta}$ to define a notion of non-linear 222We choose to call the integration non-linear since a mapping $x\mapsto\int f_{r}(x_{r})dr$ is obviously never linear. integration. At the heart of all stochastic integration is the difficulty that the above is not enough to guarantee a canonically defined integration map in the pathwise sense. The most fundamental understanding of the rough path theory is that one can construct integrals once additional information about the driving path is given by some off-line argument e.g. stochastic integration.

Existing literature

The stochastic equation, i.e. (1) and (2) has been studied in [20] and [21] but focusing on the case where the initial condition has a density. The measure-valued case was studied very recently in [8]. Under more restrictive conditions, either on the class of solutions or on the coefficients (like strong parabolicity), the well-posedness of solutions to SPDE of the type (2) had been previously considered by Dawson, Vaillancourt in [10].

McKean-Vlasov equations from a rough path perspective has already been introduced in [7] and, more recently in [1], focusing on the Lagrangian description. In [1] the equation is driven by a general random rough path, which gives the additional difficulty of needing to keep track of the rough path as a $L^{p}(\Omega)$ -valued path. The latter space is present to consider a probability measure as the law of a random variable and Lions’ approach to calculus for the Wasserstein metric. The approach by Gubinelli on controlled rough paths is then used to solve the equation as a fixed-point in the mixed $\mathbb{R}^{d}$ and $L^{p}(\Omega)$ -space.

We mention also [5] where the authors study mean-field games in the presence of a common noise as in (1). The authors use tightness arguments along with approximations to prove existence of a (probabilistically) weak solutions. Then, the authors prove a Yamada-Watanabe type principle for these equations to prove existence and uniqueness of (probabilistically) strong solutions.

In Section 3 we build a version of the rough path theory that allow for time dependent coefficients. The results in this section should be compared to [3] where the authors solves equations on this form. There, the main focus is flows build from a non-linear version of the sewing lemma. Very recently, right before the completion of the present paper, the authors of [23] introduce the very same object, here called a nonlinear rough path. The authors use a similar set up as in [17] to solve rough equations with time-dependent coefficients.

The papers [3] and [23] does not contain the same precise estimates as the present paper, which is crucially needed to set up a contraction mapping for the McKean-Vlasov equation (4).

Main contributions

The main contribution of this paper is the formulation and well-posedness of the nonlinear Fokker-Planck equation in terms of the appropriate rough path topology. We believe this is the first paper to study a rough non-local diffusion from both the Lagrangian and Eulerian perspective. Furthermore we believe it is the first work to prove well-posedness of an equation with a nonlinearity in the noise term on this form.

It is plausible that the well-posedness of the McKean-Vlasov equation equation in the present paper can be seen as a particular case of the equation studied in [1] by doing a rough path lift of $W$ and $\mathbf{Z}$ as in (6), but now as a rough path with values in an $L^{p}(\Omega)$ -space. However, our proof of the well-posedness of the nonlinear Fokker-Planck equation necessitate well-posedness of a rough path equation with time-dependent coefficients. As already mentioned, it is not reasonable to expect that the coefficients could be controlled by a single Brownian path thus one could not use [1] for the time dependent case. Moreover, for the same reason, time dependent coefficients are also needed to understand the McKean-Vlasov equation as a fixed point of linear diffusions in an appropriate space of measures.

In addition, we prove a result on existence of a solution to a linear, possibly degenerate, rough PDE which could be of independent interest.

Structure of the paper

The paper is structured as follows. In Section 2 we introduce the necessary concepts from rough path theory, including controlled rough paths, that will be needed for the paper. In Section 3 we introduce the corresponding integration theory to handle non-linear integration and differential equations. In Section 4 we show how to concretely build rough drivers from Itô integration theory and the theory of controlled paths. These examples will also act exactly as the rough drivers needed to formulate the McKean-Vlasov equation as a fixed point. Moreover, this section contains an average, in $\Omega$ , Itô formula that allows us to prove that the law of a diffusion solves the Fokker-Planck equation (linear or nonlinear). In Section 5 we prove well-posedness for a linear RPDE with time dependent coefficients. In Section 6 we construct the appropriate space for solving the McKean-Vlasov equation. In Section 7 we prove uniqueness of our main equation, which hinges on the results of the previous sections.

2 Notations and preliminary results

2.1 Hölder and p-variation spaces

For $T>0$ we let $\Delta_{T}$ denote the simplex $\Delta_{T}=\{(s,t)\in[0,T]^{2}:s<t\}$ . For $\zeta>0$ and a Banach space $E$ we denote by $C_{2}^{\zeta}([0,T];E)$ the space of all continuous mappings $g:\Delta_{T}\rightarrow E$ such that

[TABLE]

It can be checked that the above space is independent of $h$ , and we will write for simplicity $[g]_{\alpha;E}:=[g]_{\alpha,T;E}$ . When it is clear from the context, we will also omit the Banach space $E$ , writing $[g]_{\alpha,h}$ and $[g]_{\alpha}$ . We let $C^{\zeta}([0,T];E)$ denote the space of all paths $f:[0,T]\rightarrow E$ such that the increment $\delta f_{st}:=f_{t}-f_{s}$ belongs to $C^{\zeta}_{2}([0,T];E)$ . For simplicity we will write $[f]_{\alpha,h;E}:=[\delta f]_{\alpha,h;E}$ . It is well known that local and global Hölder norms are comparable for paths, in the sense that

[TABLE]

for all $f\in C^{\zeta}([0,T];E)$ (see Exercise 4.25 in [15]). It is well known that the Hölder spaces are not separable. However, the subspace

[TABLE]

is separable, as proved in Proposition A.4.

We let $C_{2}^{p-\operatorname{var}}([0,T];E)$ be the space of all continuous mappings $g:\Delta_{T}\rightarrow E$ such that

[TABLE]

where the above supremum is taken over all partitions $\pi$ of $[s,t]$ . If we define $w_{g}(s,t):=[\![g]\!]_{p,[s,t];E}^{p}$ it can be shown that $(s,t)\mapsto w_{g}(s,t)$ is a control, namely continuous and superadditive i.e. $w_{g}(s,u)+w_{g}(u,t)\leq w_{g}(s,t)$ . Moreover, we see that if there exists a control $w$ such that $\|g_{st}\|_{E}\leq w(s,t)^{1/p}$ , then $w_{g}(s,t)\leq w(s,t)$ , so that we could equivalently define

[TABLE]

We will write $[\![g]\!]_{p;E}:=[\![g]\!]_{p,[0,T];E}$ and when the space $E$ is clear from the context we will simply write $[\![g]\!]_{p,[s,t]}$ and $[\![g]\!]_{p}:=[\![g]\!]_{p,[0,T]}$ .

To see the relationship between Hölder continuity and $p$ -variation, notice that for any partition $\pi$ we have

[TABLE]

when $\alpha=1/p$ , which gives the bound

[TABLE]

Given a control $w$ , we construct the greedy partition, following [15, Chapter 11]; for $\beta>0$ , define the partition $\{\tau_{n}\}_{n}$ as

[TABLE]

so that $w(\tau_{n},\tau_{n+1})=\beta$ , for all $n<N$ , and $w(\tau_{N},\tau_{N+1})\leq\beta$ . Define now the integer

[TABLE]

2.2 Rough paths

Assume $E$ is a Banach space and equip $E\otimes E$ with the projective tensor norm. We call a pair

[TABLE]

for $\alpha\in(\frac{1}{3},\frac{1}{2})$ a rough path provided Chen’s relation,

[TABLE]

holds where we have defined the second order increment operator $\delta g_{s\theta t}:=g_{st}-g_{\theta t}-g_{s\theta}$ . We denote by $\mathscr{C}^{\alpha}([0,T];E)$ the (non-linear) set of all rough paths which we equip with the subset metric,

[TABLE]

For a path of bounded variation, $Z:[0,T]\rightarrow E$ there is a canonical rough path, $\mathbf{Z}=(Z,\int Z\otimes dZ)$ where the latter is the iterated integral $\big{(}\int Z\otimes dZ\big{)}_{st}=\int_{s}^{t}Z_{sr}\otimes dZ_{r}$ which is well defined when $Z$ is of bounded variation. We denote by $\mathscr{C}_{g}^{\alpha}([0,T];E)$ the set of geometric rough paths, which is the closure of the set of bounded variation paths in the rough path metric.

We notice that if $\mathbf{Z}$ is geometric, then $\mathbf{Z}$ is also weakly geometric which means $\textrm{sym}(\mathbb{Z}_{st})=\frac{1}{2}Z_{st}\otimes Z_{st}$ , and we denote by $\mathscr{C}_{wg}^{\alpha}([0,T];E)$ the set of all such rough paths. When $E$ is finite dimensional it is known that (see e.g. [16, Proposition 8.12]) if $\mathbf{Z}$ is weakly geometric, there exists a sequence of smooth paths $Z^{n}$ such that $\mathbf{Z}^{n}\rightarrow\mathbf{Z}$ in $\mathscr{C}^{\bar{\alpha}}([0,T];E)$ for all $\bar{\alpha}<\alpha$ .

Controlled space

Given a path $Z$ taking values in $\mathbb{R}^{m}$ we denote by $\mathscr{D}_{Z}^{2\alpha}([0,T];E)$ the (linear) space of all controlled path, given by pairs $(Y,Y^{\prime})$ of mappings

[TABLE]

such that

[TABLE]

We call $Y^{\prime}$ the Gubinelli derivative of $Y$ . The above definition is sometimes better understood in coordinates $Y^{\sharp,i}_{st}:=\delta Y_{st}^{i}-Y_{s}^{i,k}Z_{st}^{k}$ where we abuse notation and write $Y^{i,k}$ for the matrix representing the Gubinelli derivative. Above and for the remainder of the paper we shall use the convention of summation over repeated indices. We equip the space of all controlled paths with the norm

[TABLE]

Sewing lemma and rough path integration

We recall here the main result used to obtain estimates in the theory of rough paths, namely the sewing lemma.

Lemma 2.1.

Suppose $g:\Delta_{T}\rightarrow E$ is such that

[TABLE]

for some $\zeta>1$ and $h>0$ . Then there exists a unique pair $I(g):[0,T]\rightarrow E$ and $I(g)^{\natural}:\Delta_{T}\rightarrow E$ such that

[TABLE]

with $[I(g)^{\natural}]_{\zeta;E}\leq C[\delta g]_{\zeta,h;E}$ for $C$ depending only on $\zeta$ .

In fact, we have $I(g)_{st}:=\lim_{|\pi|\rightarrow 0}\sum_{\pi}g_{t_{i}t_{i+1}}$ and we think of $I(g)$ as being an integral with local expansion $g$ .

With this in hand we can define the rough path integral. Given a rough path $\mathbf{Z}$ and a controlled path $(Y,Y^{\prime})\in\mathscr{D}_{Z}^{2\alpha}([0,T];E)$ , define the local expansion

[TABLE]

Using Chen’s relation it is straightforward to check that $[\delta g]_{3\alpha;E}<\infty$ and we shall write $\int Yd\mathbf{Z}:=I(g)$ .

This construction also gives rise to a new rough path, namely

[TABLE]

where the latter integral is defined by the local expansion

[TABLE]

One can then check that $\mathbf{X}:=(X,\mathbb{X})\in\mathscr{C}^{\alpha}([0,T];E)$ and that this operation is continuous from $\mathscr{D}_{Z}^{2\alpha}([0,T];E)$ to $\mathscr{C}^{\alpha}([0,T];E)$ . Moreover, at least when $E$ is a separable Hilbert space, weak geometricity is preserved under rough path integration as spelled out in Lemma A.2.

We shall also use the sewing lemma to get a priori estimates by a slight (straightforward) generalization of the sewing lemma. Assume that $g$ is such that there exists controls $w$ and $w_{*}$ and a positive function $k$ such that

[TABLE]

for some $\zeta>1$ . Then there exists a universal constant $C$ such that

[TABLE]

2.3 Taylor’s formula

For a path $y:[0,T]\rightarrow\mathbb{R}^{d}$ and a function $g:\mathbb{R}^{d}\rightarrow V$ (where $V$ is a finite-dimensional vector space) we use the notation

[TABLE]

With this notation at hand the first and second order Taylor’s formula reads

[TABLE]

respectively. We obviously get $|[g]^{k,y}_{st}|\lesssim\|g\|_{\infty}$ .

2.4 Wasserstein metric

We shall work with the Wasserstein metric on measures on Hölder spaces, but since separability of the underlying space is required for the Wasserstein metric to give a complete space, we shall use the subspaces $C_{0}^{\alpha}([0,T];\mathbb{R}^{d})$ . When the dimension is clear from the context we shall simply write $C_{0}^{\alpha}$ . Given two probability measure $\mu,\nu\in\mathcal{P}(C^{\alpha}_{0})$ say that $\pi\in\mathcal{P}(C^{\alpha}_{0}\times C^{\alpha}_{0})$ is a coupling of $\mu$ and $\nu$ provided its first (respectively second) marginal is equal to $\mu$ (respectively $\nu$ ). We define the Wasserstein metric

[TABLE]

where the above infimum ranges over all couplings $\pi$ of the measures $\mu$ and $\nu$ . Since $C_{0}^{\alpha}$ is separable we have that $\mathcal{P}_{\rho}(C_{0}^{\alpha})$ is a complete space w.r.t. $W_{\rho}$ .

We note that the $\rho$ -th moment of a probability measure $\mu$ can be written $W_{\rho}(\mu,\delta_{0})^{\rho}$ where $\delta_{0}$ is the Dirac-Delta centered in the path constantly equal to [math].

2.5 Spatial function spaces

We fix $d\in\mathbb{N}$ . For any multi-index $\beta=(\beta_{1},\dots,\beta_{d})$ , we set

[TABLE]

and $|\beta|=\beta_{1}+\cdots+\beta_{d}$ . For $p>1$ and an integer $k\geq 0$ , we let $W^{k,p}=W^{k,p}(\mathbb{R}^{d})$ be the Sobolev space of real-valued functions on $\mathbb{R}^{d}$ with finite norm

[TABLE]

Let $H^{k}:=W^{k,2}(\mathbb{R}^{d};\mathbb{R}^{d})$ , be the Sobolev space of square integrable functions over $\mathbb{R}^{d}$ , endowed with the norm $\|\cdot\|_{H^{k}}:=\|\cdot\|_{W^{k,2}}$ . For a Hilbert space $H$ , we endow the space of linear functionals $\mathcal{L}(\mathbb{R}^{d};H)$ with the Hilbert-Schmidt norm

[TABLE]

Moreover, we call $M^{2}_{T}(H)$ the space of $H$ -valued, time-continuous, square integrable martingales endowed with the norm

[TABLE]

Let $k>\frac{d}{2}$ . We denote by $C_{b}^{3}\otimes H^{k}$ the space of continuous functions $f:\mathbb{R}^{d}\times\mathbb{R}^{d}\rightarrow\mathbb{R}^{d}$ such that

(i)

For all $x\in\mathbb{R}^{d}$ , the function $y\mapsto f(x,y)\in H^{k}$ . 2. (ii)

For all $y\in\mathbb{R}^{d}$ , the function $x\mapsto f(x,y)\in C_{b}^{3}$ . 3. (iii)

We have

[TABLE]

We endow the space $C_{b}^{3}\otimes H^{k}$ with the induced norm $\|f\|_{C_{b}^{3}\otimes H^{k}}$ . Above we have used the Frechet derivative in the first variable and the weak derivative in the second variable.

Contrary to $H^{l}\otimes H^{k}$ , this space is well suited for the convolution $f(x,y)=\sigma(x-y)$ and we see that $f\in C_{b}^{3}\otimes H^{k}$ if $\sigma\in H^{3+k}$ .

3 Non linear integration

In this section we build the theory of rough paths to accommodate for time-dependent coefficients. We aim to solve the equation

[TABLE]

for given function $f$ which is a distribution in time but regular in space. We shall use the framework akin to the definition by Davie in [9]. To illustrate the set up, assume that $x$ is a smooth solution of (17). Integrating the equation and using Taylor’s formula we obtain

[TABLE]

Here we have defined the driver $\mathbf{F}:=(F,\mathbb{F})$ of the equation as follows

[TABLE]

and the remainder as

[TABLE]

With the above notation, we rewrite equation (17) as

[TABLE]

As is usual in rough path theory, we shall now read the definition (18) in the opposite direction - we assume we are given a pair of functions $(F,\mathbb{F})$ satisfying some compatibility conditions (in Definition 3.1 below), and take this as a definition of the non-linearity $f$ . We will then take $x^{\natural}$ to be implicitly defined and say that $x$ is a solution provided $x^{\natural}$ is of high time regularity.

We can read (17) in integral form as $x_{t}=x_{0}+\int_{0}^{t}\mathbf{F}_{dr}(x_{r})$ and can be regarded as a rough version of the semimartingale integration theory by Kunita in [19].

We shall use a similar definition as in [3], with a noticeable difference that we allow our driver to depend on two spatial points. Moreover, we will not only be dealing with weakly geometric drivers.

Definition 3.1.

For $p\in[2,3)$ , a pair of functions $\mathbf{F}=(F,\mathbb{F})\in C^{p-\operatorname{var}}([0,T];C_{b}^{3}(\mathbb{R}^{d};\mathbb{R}^{d}))\times C_{2}^{\frac{p}{2}-\operatorname{var}}([0,T];C_{b}^{2}(\mathbb{R}^{d}\times\mathbb{R}^{d};\mathbb{R}^{d}))$ is called a $p$ -rough driver provided Chen’s relation,

[TABLE]

holds. The set of all such pairs is equipped with the metrics

[TABLE]

Most of the time we will work on the diagonal of the spatial points and write simply $\mathbb{F}_{st}(x):=\mathbb{F}_{st}(x,x)$ , and we shall also write $\nabla F_{ut}(x)F_{su}(x)=F_{su}(x)\otimes\nabla F_{ut}(x)$ .

For $\alpha\in(\frac{1}{3},\frac{1}{2}]$ a pair of functions $\mathbf{F}=(F,\mathbb{F})\in C^{\alpha}([0,T];C_{b}^{3}(\mathbb{R}^{d};\mathbb{R}^{d}))\times C_{2}^{2\alpha}([0,T];C_{b}^{2}(\mathbb{R}^{d}\times\mathbb{R}^{d};\mathbb{R}^{d}))$ is called an $\alpha$ -rough driver provided (21) holds. The set of all such pairs is equipped with the metric

[TABLE]

Remark 3.2.

The reason for using both $p$ -variation and $\alpha$ -Hölder continuous drivers is that the construction using Kolmogorov continuity theorem (Lemma 4.3, below) gives us more easily bounds in the sense of Hölder continuity. However, to estimate the difference between two solutions we need exponential bounds, and it is well known that even when $W$ is a Brownian motion, the random variable $[W]_{\alpha}$ is not exponentially integrable. This problem is circumvented by using $p$ -variation, more specifically using the local accumulation $N_{1}(\|W\|_{p-var;[\cdot,\cdot]},[0,T])$ , see Section 4.2 for the details.

From (8) it is clear that if $\mathbf{F}$ is an $\alpha$ -rough driver, then it is also a $p$ -rough driver with $p=\frac{1}{\alpha}$ . When the notion is clear from the context, we shall simply say that $\mathbf{F}$ is a rough driver.

Example 3.3.

Consider a rough path $\mathbf{X}\in\mathscr{C}^{\alpha}([0,T];C_{b}^{3}(\mathbb{R}^{d};\mathbb{R}^{d}))$ , where we identify $C_{b}^{3}(\mathbb{R}^{d};\mathbb{R}^{d})\otimes C_{b}^{3}(\mathbb{R}^{d};\mathbb{R}^{d})$ with a subspace 333Since we are on the unbounded domain $\mathbb{R}^{d}$ , we don’t know if one can identify these spaces, but the inclusion is enough for our purposes of $C_{b}^{3}(\mathbb{R}^{d}\times\mathbb{R}^{d};\mathbb{R}^{d\times d})$ so that Chen’s relation reads

[TABLE]

Let now $F_{st}(x)=X_{st}(x)$ and $\mathbb{F}_{st}(x,y)=\nabla_{y}^{\otimes}(\mathbb{X}_{st}(x,y))$ where $\nabla_{2}^{\otimes}:C_{b}^{3}(\mathbb{R}^{d}\times\mathbb{R}^{d};\mathbb{R}^{d\times d})\rightarrow C_{b}^{3}(\mathbb{R}^{d}\times\mathbb{R}^{d};\mathbb{R}^{d})$ is the multiplication of vector fields, i.e. the linear extension of the mapping defined by

[TABLE]

It is straightforward to check that this gives a rough driver, and we notice that the mapping $\mathbf{X}\mapsto\mathbf{F}$ is continuous.

With this at hand we can define the notion of a solution.

Definition 3.4.

Let $\mathbf{F}$ be a rough driver as in Definition 3.1 and $\xi\in\mathbb{R}^{d}$ . A path $x:[0,T]\rightarrow\mathbb{R}^{d}$ is called a solution to (20) provided $x^{\natural}$ defined by

[TABLE]

is such that $x^{\natural}\in C_{2}^{\frac{p}{3}-\operatorname{var}}([0,T];\mathbb{R}^{d})$ .

Remark 3.5.

One drawback with this method compared to linear integration is the lack of "universality" in the Itô-Lyons map; recall that the stochastic equation

[TABLE]

and its corresponding mapping $B\mapsto x$ can be factorized into a discontinuous map, $B\mapsto(B,\int BdB)$ and a continuous one $(B,\int BdB)\mapsto x$ . One of the nice features of this decomposition is the fact that $B\mapsto(B,\int BdB)$ is universal in the sense that it does not depend on the vector field $V$ driving the equation, which allows to fix a subset $\Omega_{0}\subset\Omega$ for which one can do deterministic analysis on the differential equation.

In our case, however, the subset of $\Omega$ will depend on the driving vector fields since we are building a non-linear integration theory depending on the coefficients.

3.1 A priori estimates

Let $\mathbf{F}$ be a $p$ -rough driver and assume $x$ is a solution of equation (20) in the sense of Definition 3.4. In this section we use (12) and (13) to deduce a priori estimates. We let $w_{\mathbf{F}}$ be the smallest control such that

[TABLE]

Define the controlled quantity,

[TABLE]

Lemma 3.6.

Let $g\in C_{b}^{2}$ , we have the following chain rule, $\forall s,t\in[0,T]$ ,

[TABLE]

Proof.

We have from Taylor’s formula

[TABLE]

where

[TABLE]

By the definition of brackets (14), we get

[TABLE]

The result follows. ∎

With this in hand we turn to an a priori estimate for the nonlinear RDE.

Proposition 3.7.

Let $0<h\leq T$ . There exists constants $C$ and $h$ depending only on $p$ such that for all $s,t$ such that $w_{\mathbf{F}}(s,t)\leq h$ we have

[TABLE]

Proof.

We start with the easily verifiable identity for a function $G$ and path $y$

[TABLE]

Using Chen’s relation we get

[TABLE]

We get from Lemma 3.6, provided $h<1$

[TABLE]

and clearly

[TABLE]

From the sewing lemma there exists a constant $C$ such that

[TABLE]

From equations (22) and (23) we have

[TABLE]

and consequently

[TABLE]

If now $s,t$ is such that $Cw_{\mathbf{F}}(s,t)^{1/p}\leq\frac{1}{2}$ we get

[TABLE]

which gives

[TABLE]

∎

The above bound translates now to global estimates on the solution itself in the following way.

Lemma 3.8.

Assume now that $\mathbf{F}$ is an $\alpha$ -rough driver with $\alpha=\frac{1}{p}$ . Then we have, for $h>0$ small enough depending on $\mathbf{F}$ ,

[TABLE]

Moreover, we have the global estimate

[TABLE]

for a constant $C>0$ depending only on $\alpha$ .

Proof.

Since $\mathbf{F}$ is Hölder continuous we have $w_{\mathbf{F}}(s,t)\leq[\mathbf{F}]_{\alpha,h}^{p}|t-s|$ for all $|t-s|\leq h$ . Choose now $h$ such that $h^{\alpha}[\mathbf{F}]_{\alpha,h}C\leq\frac{1}{2}$ where $C$ is as in Proposition 3.7. For $|t-s|\leq h$ we have

[TABLE]

from which (25) follows.

From (7) we get, choosing now $h\simeq[\mathbf{F}]_{\alpha}^{-1/\alpha}$ , $h^{\alpha-1}\simeq[\mathbf{F}]_{\alpha}^{(1-\alpha)/\alpha}$

[TABLE]

for some universal constant $C$ depending only on $\alpha$ . ∎

3.2 A priori contractive estimates

Let $p<3$ , and assume $\mathbf{F}$ , $\mathbf{G}$ are two $p$ -rough drivers. We take two solutions $x$ and $y$ of equation (20) in the sense of Definition (3.4), with initial conditions $x_{0}$ and $y_{0}$ and driven by $\mathbf{F}$ and $\mathbf{G}$ respectively.

To illustrate the ideas of this section, we give the following remark.

Remark 3.9.

Assume that $F:=\int_{0}^{t}f_{r}(x)dr$ , $G:=\int_{0}^{t}g_{r}(x)dr$ , $x$ and $y$ are smooth in time, so that we can write

[TABLE]

where we have used Gronwall’s inequality in the last step. The purpose of this subsection is to replicate these estimates also for the rough case. The steps are similar to the previous subsection, except we compare two solutions.

We start by writing

[TABLE]

Let $z:=x-y$ and $z^{\sharp}:=x^{\sharp}-y^{\sharp}$ so that the above gives the estimate

[TABLE]

We begin with the analogue of Lemma 3.6 that allows us to estimate nonlinearities of the remainders.

Lemma 3.10.

Let $f,g\in C_{b}^{3}$ . Then using the notation as in Lemma 3.6 we have the estimate

[TABLE]

Proof.

We write

[TABLE]

The first two terms above can be written

[TABLE]

Which gives the bound

[TABLE]

Now write

[TABLE]

We see that

[TABLE]

which gives (28). ∎

Proposition 3.11.

Assume that $\mathbf{F}$ and $\mathbf{G}$ are $\alpha$ -rough drivers with $\alpha=\frac{1}{p}$ . Then there exists universal constants $C$ such that

[TABLE]

Moreover,

[TABLE]

for all $s,t$ such that $C([\![\mathbf{F}]\!]_{p,[s,t]}+[\![\mathbf{G}]\!]_{p,[s,t]})\leq 1$ . In particular, we have uniqueness for equation (20) and the solution is continuous w.r.t. the initial condition.

Proof.

Using Chen’s relation we get

[TABLE]

Replacing $f=F_{ut}$ and $g=G_{ut}$ in (28) we get

[TABLE]

Replacing $f=\mathbb{F}_{ut}$ and $g=\mathbb{G}_{ut}$ in (28) we get

[TABLE]

Use also the estimate

[TABLE]

Let now $s,t$ be such that $w_{\mathbf{G}}(s,t)^{1/p},w_{\mathbf{F}}(s,t)^{1/p}\leq\frac{C}{2}$ which gives

[TABLE]

From (12) and (13) we get that there exists a universal constant $C$ such that

[TABLE]

Choose now $s,t$ such that $w_{\mathbf{G}}(s,t)^{1/p}\leq\frac{C}{2}\wedge 1$ , so that

[TABLE]

For the solution we have

[TABLE]

Let now $(s,t)$ be such that $w_{\mathbf{F}}(s,t)^{1/p},w_{\mathbf{G}}(s,t)^{1/p}\leq\frac{C}{2}$ we get

[TABLE]

where $w_{*}(s,t)=w_{\mathbf{F}-\mathbf{G}}(s,t)^{1/3}w_{\mathbf{F}}(s,t)^{2/3}+w_{\mathbf{F}-\mathbf{G}}(s,t)^{2/3}w_{\mathbf{F}}(s,t)^{1/3}$ . From the rough Gronwall lemma, [11, Lemma 2.11], we get

[TABLE]

and we notice that this holds for all subintervals $[s,t]$ , i.e. no smallness assumption. Now, choose the finest partition $\tau_{k}$ of $[s,t]$ such that $w_{\mathbf{F}}(\tau_{k},\tau_{k+1})=1$ . We have

[TABLE]

and on $[\tau_{1},\tau_{2}]$ we get

[TABLE]

provided $C>1$ . An easy induction shows that

[TABLE]

By definition of the greedy partition (9) we get

[TABLE]

Letting $s=0$ and using the bound $w_{*}(0,T)\leq w_{\mathbf{F}-\mathbf{G}}(0,T)\leq[\mathbf{F}-\mathbf{G}]_{\alpha}$ this shows (29). To see (30) we plug the above into (27) to get

[TABLE]

using $w_{\mathbf{F}}(s,t)^{1/p},w_{\mathbf{G}}(s,t)^{1/p}\leq 1/2$ in the last step. Using (33) gives

[TABLE]

This gives (30). The bound (31) is proved in a similar way. ∎

Corollary 3.12.

Assume that $\mathbf{F}$ and $\mathbf{G}$ are $\alpha$ -rough drivers with $\alpha=\frac{1}{p}$ . Then there exists a universal constant $C$ such that,

[TABLE]

Proof.

Use bounds on the form $w_{\mathbf{F}}(s,t)\leq[\mathbf{F}]_{\alpha,h}^{p}|t-s|$ for all $|t-s|\leq h$ in inequality (34). This gives the Hölder estimate

[TABLE]

which holds when $h$ is such that $|t-s|\leq h$ we have $w_{\mathbf{F}}(s,t)^{1/p},w_{\mathbf{G}}(s,t)^{1/p}\lesssim 1$ , in particular when $h^{\alpha}([\mathbf{F}]_{\alpha,h}+[\mathbf{G}]_{\alpha,h})\lesssim 1$ .

Let $C$ be the constant given by Proposition 3.11 and set $h=C^{-\frac{1}{\alpha}}([\mathbf{F}]_{\alpha}+[\mathbf{G}]_{\alpha})^{-\frac{1}{\alpha}}$ . It follows by (7) and Proposition 3.11 that (the value of $C$ changes in the following lines, but it only depends on $\alpha$ )

[TABLE]

This concludes the proof. ∎

3.3 Well-posedness of nonlinear RDEs

Since uniqueness of equation (20) follows from Proposition 3.11, it is only left to prove existence of a solution. We do so by using a Picard iteration.

Theorem 3.13.

Let $\mathbf{F}$ be a $p$ -variation rough driver. There exists a unique solution $x$ of equation (20), in the sense of Definition 3.4, with initial condition $\xi\in\mathbb{R}^{d}$ .

Proof.

Uniqueness is given by Proposition 3.11. We study now existence. Define $x^{0}_{t}=\xi$ , $x_{t}^{1}=F_{0t}(\xi)$ and

[TABLE]

which gives

[TABLE]

Consequently, there exists a pair $(x^{2},x^{2,\natural})$ such that

[TABLE]

and we have $|x_{st}^{2,\natural}|\leq Cw_{\mathbf{F}}(s,t)^{3/p}$ for some universal constant $C$ .

We prove inductively that there exists universal constants $C$ and $h$ such that for $w_{\mathbf{F}}(s,t)\leq h$ we have $|x_{st}^{n,\natural}|\leq Cw_{\mathbf{F}}(s,t)^{3/p}$ and $|\delta x_{st}^{n}|\leq 2w_{\mathbf{F}}(s,t)^{1/p}+Cw_{\mathbf{F}}(s,t)^{3/p}$ .

Given $x^{n-1}$ and $x^{n}$ we let

[TABLE]

We then get

[TABLE]

which gives

[TABLE]

provided $h$ is such that $5Cw_{\mathbf{F}}(s,t)^{2/p}\leq 1$ . This gives that there exists $x^{n+1},x^{n+1,\natural}$ such that

[TABLE]

so $C\geq C_{p}8$ will do. Provided $h$ is such that $w_{\mathbf{F}}(s,t)^{1/p}\leq 1$ we also get

[TABLE]

which proves the induction hypothesis.

From Arzelà-Ascoli we get that there exists a subsequence $x^{n_{k}}$ converging in $C([0,T];\mathbb{R}^{d})$ to some element $x$ . Clearly we get

[TABLE]

Since all the terms of (35) (or rather, the one with $n$ replaced by $n_{k}$ ) converges, we get that also $x^{n_{k},\natural}_{st}$ must converge to a limit denoted $x^{\natural}_{st}$ . Then $x$ and $x^{\natural}$ satisfies (22) and from the uniform bounds on $x^{n_{k},\natural}$ we see that $x$ indeed is a solution. ∎

4 Rough non-linearities

In this section we show how to construct the rough drivers that are used for solving the McKean-Vlasov equation (4). We start by constructing rough drivers corresponding to Itô theory, i.e. given a vector field $\sigma$ and a Brownian motion $W$ , we want to define

[TABLE]

where the latter integration is in the sense of Itô. As the following example demonstrates, it is not possible to simply integrate a function $\sigma\in C([0,T];C_{b}^{3}(\mathbb{R}^{d};\mathbb{R}^{d}))$ to produce a rough driver.

Example 4.1.

Let $d=1$ and $\sigma_{r}(x)=\sin(rx)$ , then the mapping $x\mapsto W_{st}^{\sigma}(x)$ is $P$ -a.s. unbounded as $x\rightarrow\infty$ . Indeed, let $s=0$ and $t=1$ and $x=2\pi n$ for $n\in\mathbb{N}$ , then $\{W_{01}^{\sigma}(2\pi n)\}_{n\in\mathbb{N}}$ is an i.i.d. Gaussian sequence, which $P$ -a.s. diverges.

The above example shows that we need some decay on our vector fields as $|x|\rightarrow\infty$ . We choose to assume that $\sigma$ belongs to a Sobolev space $H^{k}(\mathbb{R}^{d};\mathbb{R}^{d})$ where $k$ is large enough to use Sobolev embedding to show that $\mathbf{W}^{\sigma}$ is a rough driver. The reason for this choice is the relatively simple and well established theory of Itô integration that is available for Hilbert spaces. We conjecture that this regularity can be significantly lowered (e.g. with decay as in [3, Corollary 9]) and leave this for future investigation.

Let $d,m\in\mathbb{N}$ be fixed and let $\mathbf{Z}\in\mathscr{C}_{g}^{\bar{\alpha}}([0,T],\mathbb{R}^{m})$ , for $\bar{\alpha}\in(\frac{1}{3},\frac{1}{2})$ . In this section we assume the following

Assumptions 4.2.

Let $k\in\mathbb{N}\cup\{0\}$ , and $\alpha\in(\frac{1}{3},\bar{\alpha})$ ,

(i)

Let $(\beta,\beta^{\prime})\in\mathscr{D}_{Z}^{2\alpha}([0,T];H^{k})$ , as in Section 2. 2. (ii)

Let $\sigma:[0,T]\rightarrow\mathcal{L}(\mathbb{R}^{d};H^{k})$ be a continuous function, such that

[TABLE] 3. (iii)

Let $p=\alpha^{-1}$ , then

[TABLE]

To simplify the following discussion, we introduce the convenient notation

[TABLE]

4.1 Construction of the rough driver

4.1.1 Itô theory

Let $(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\in[0,T]},P)$ be a filtered probability space and let $W$ be a $d$ -dimensional Wiener process on it. We assume that $\sigma$ satisfies Assumption 4.2 (ii), for $k>3+\frac{d}{2}$ . We define, for $0\leq s\leq t\leq T$ ,

[TABLE]

where the integral is defined in the sense of Itô on Hilbert spaces, see [22, Section 2]. Thanks to Burkholder-Davis-Gundy (BDG) inequality for Hilbert spaces, [22, Theorem 2.4.7], we have for all $\rho\geq 1$ and $0\leq s\leq t\leq T$ ,

[TABLE]

We consider now the time-continuous stochastic process,

[TABLE]

with Hilbert-Schmidt norm (15) bounded as $\|\left(W_{t}^{\sigma}\otimes\nabla\right)\sigma_{t}\|_{\mathcal{L}(\mathbb{R}^{d};H^{k}\otimes H^{k-1})}\leq\|W_{t}^{\sigma}\|_{H^{k}}\|\sigma_{t}\|_{\mathcal{L}(\mathbb{R}^{d};H^{k})}$ , for all $t\in[0,T]$ . Using again Itô theory on Hilbert spaces, we have that $\int_{0}^{t}\left(W_{r}^{\sigma}\otimes\nabla\right)\sigma_{r}dW_{r}\in M^{2}_{T}(H^{k}\otimes H^{k-1})$ and we set, for $0\leq s\leq t\leq T$ ,

[TABLE]

Applying again BDG inequality and inequality (38), we have for all $\rho\geq 1$ and $0\leq s\leq t\leq T$ ,

[TABLE]

Lemma 4.3.

Let $W$ be a $d$ -dimensional Wiener process on the filtered probability space $(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\in[0,T]},P)$ and let $\sigma$ satisfy Assumption 4.2 (ii), with $k>3+\frac{d}{2}$ . Let $W^{\sigma}$ and $\mathbb{W}^{\sigma}$ be defined as in (37) and (39), respectively. Then, for every $\alpha\in(\frac{1}{3},\frac{1}{2})$ , for $P$ -a.e. $\omega$

[TABLE]

is a rough driver in the sense of Definition 3.1, and for all $\rho>\frac{2}{1-2\alpha}$ , we have

[TABLE]

Moreover, on small time-intervals $|t-s|\leq h\leq T$ we have, for $\bar{\alpha}\in(\alpha,\frac{1}{2})$ ,

[TABLE]

Proof.

We first study the space regularity of $\mathbf{W}$ . From the choice of $k$ , Sobolev’s embedding Theorem [4, Corollary 9.13] and inequalities (38) and (40), we have that

[TABLE]

By the Kolmogorov continuity theorem A.1, we obtain (41).

We check now that Chen’s relation (21) holds $P$ -a.s.. Indeed, we have the following,

[TABLE]

To justify the last equality we call $\tilde{H}:=\mathcal{L}(\mathbb{R}^{d};H^{k}\otimes H^{k-1})$ and we note that $(W^{\sigma}_{su}\otimes\nabla):\Omega\to L(H^{k},\tilde{H})$ is an $\mathcal{F}_{u}$ -measurable random variable taking values in the space of linear operators between two Hilbert spaces. Thanks to the fact that the operator $(W^{\sigma}_{su}\otimes\nabla)$ is measurable with respect to the left-most point of the integral, one can easily adapt [22, Lemma 2.4.1] to show that it commutes with the stochastic integral.

∎

We shall also need contractive estimates w.r.t. the vector field.

Lemma 4.4.

Let $\sigma$ and $\theta$ satisfy Assumption 4.2 (ii), with $k>3+\frac{d}{2}$ . Let $\mathbf{W}^{\sigma}$ and $\mathbf{W}^{\theta}$ be rough drivers as constructed in Lemma 4.3 w.r.t. the vector fields $\sigma$ and $\theta$ . Then, for all $\bar{\alpha}\in[\alpha,\frac{1}{2})$ and all $\rho>\frac{2}{1-2\bar{\alpha}}$ , there exists $K_{\rho}\in L^{\rho}(\Omega)$ , such that for all $h\leq T$ ,

[TABLE]

Proof.

The proof follows as an application of Kolmogorov continuity theorem as in Lemma 4.3. ∎

4.1.2 Gubinelli integration

Let $\bar{\alpha}\in(\frac{1}{3},\frac{1}{2})$ , $\mathbf{Z}\in\mathscr{C}_{g}^{\bar{\alpha}}([0,T],\mathbb{R}^{m})$ and let $\beta$ satisfy Assumption 4.2 (i), for $k\in\mathbb{N}\cup\{0\}$ and $\alpha\in(\frac{1}{3},\bar{\alpha})$ . Using Gubinelli’s integration theory (see [15, Chapter 4]) we define, for each $0\leq s\leq t\leq T$ ,

[TABLE]

which satisfies (see [15, Theorem 4.10])

[TABLE]

and we have,

[TABLE]

For $t\in[0,T]$ , we define $Z^{\beta}_{t}:=Z^{\beta}_{0t}$ and we consider $\left(Z_{t}^{\beta}\otimes\nabla\right)\beta_{t}\in\mathcal{L}(\mathbb{R}^{m};H^{k}\otimes H^{k-1})$ , with Gubinelli derivative

[TABLE]

Consequently we can define the integral $\int_{s}^{t}(Z^{\beta}_{r}\otimes\nabla)\beta_{r}d\mathbf{Z}_{r}\in H^{k}\otimes H^{k-1}$ via the local expansion

[TABLE]

Defining

[TABLE]

we get

[TABLE]

We have the following lemmas of which we omit the proofs as they follow quite easily from the discussion above, standard computations on rough integrals, and Sobolev embedding Theorem [4, Corollary 9.13].

Lemma 4.5.

Let $\bar{\alpha}\in(\frac{1}{3},\frac{1}{2})$ and $\mathbf{Z}\in\mathscr{C}_{g}^{\bar{\alpha}}([0,T],\mathbb{R}^{m})$ . Assume that $\beta$ satisfies Assumption 4.2 (i), with $k\geq 4+\frac{d}{2}$ and $\alpha\in(\frac{1}{3},\bar{\alpha})$ . Let $Z^{\beta}$ and $\mathbb{Z}^{\beta}$ be defined as in (43) and (45), respectively. Then,

[TABLE]

is a rough driver in the sense of Definition 3.1 and we have for time intervals of size $h\leq T$ ,

[TABLE]

Lemma 4.6.

Let $\bar{\alpha}\in(\frac{1}{3},\frac{1}{2})$ and $\mathbf{Z}\in\mathscr{C}_{g}^{\bar{\alpha}}([0,T],\mathbb{R}^{m})$ . Assume that $\beta$ and $\gamma$ satisfy Assumption 4.2 (i), with $k\geq 4+\frac{d}{2}$ and $\alpha\in(\frac{1}{3},\bar{\alpha})$ . Let $\mathbf{Z}^{\beta}$ and $\mathbf{Z}^{\gamma}$ be rough drivers constructed as in Lemma 4.5. Then, on time intervals of size $h\leq T$ ,

[TABLE]

Let us show that the above definition coincides with the usual definition of solutions of rough path equations.

Lemma 4.7.

Suppose $x:[0,T]\rightarrow\mathbb{R}^{d}$ is a solution of $dx_{t}=\mathbf{Z}^{\beta}_{dt}(x_{t})$ in the sense of Definition 3.4. Then $x$ also solves the classical rough path equation driven by $Z$ with coefficient $\beta$ , i.e. $(x,\beta(x))\in\mathscr{D}_{Z}^{2\alpha}$ satisfies the following equation in the sense of Davie [9],

[TABLE]

where the $\beta(x)$ is also controlled by $Z$ with Gubinelli derivative $\beta^{\prime}(x)+\nabla\beta(x)\beta(x)$ .

Proof.

Assume $x$ is a solution to the non-linear equation and let us show that it also satisfies

[TABLE]

for some remainder $\bar{x}^{\natural}$ . By definition of $Z^{\beta}$ we have

[TABLE]

Moreover

[TABLE]

by definition of $Z^{\beta}$ and $\int_{s}^{t}\nabla\beta(x_{s})Z^{\beta}_{r}(x_{s})d\mathbf{Z}_{r}$ . This shows that $|x_{st}^{\natural}-\bar{x}_{st}^{\natural}|\lesssim|t-s|^{3\alpha}$ which proves that the solutions coincide. Notice that the above bounds depend on $\|(\beta,\beta^{\prime})\|_{\alpha,Z;H^{k}}$ only. ∎

4.1.3 Mixed Itô and rough path integration

Let $W$ be a $d$ -dimensional Wiener process on the filtered probability space $(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\in[0,T]},P)$ . Let $\bar{\alpha}\in(\frac{1}{3},\frac{1}{2})$ , $\mathbf{Z}\in\mathscr{C}_{g}^{\bar{\alpha}}([0,T],\mathbb{R}^{m})$ . Assume that $\sigma$ and $\beta$ satisfy Assumption 4.2 (ii) and 4.2 (i) respectively, for $k\in\mathbb{N}\cup\{0\}$ and $\alpha\in(\frac{1}{3},\bar{\alpha})$ . Let $W^{\sigma}$ be defined as in (37) and $Z^{\beta}$ be defined as in (43). We define

[TABLE]

We remark that the first term on the right hand side of the above equation is random, whereas the second is deterministic. Define heuristically

[TABLE]

The first two terms in the right hand side are defined as in (39) and (45) respectively, we need to make the last two rigorous. For the third term, using the Itô theory in Hilbert spaces as we did is Section 4.1.1, we see that the integral

[TABLE]

is well-defined. Indeed, we have $(Z_{r}^{\beta}\otimes\nabla)\sigma_{r}\in\mathcal{L}(\mathbb{R}^{d};H^{k}\otimes H^{k-1})$ for all $0\leq r\leq T$ . Hence, we can define

[TABLE]

Similarly, we have $(\sigma_{r}\otimes\nabla)Z_{r}^{\beta}\in\mathcal{L}(\mathbb{R}^{d};H^{k}\otimes H^{k-1})$ and $\int_{s}^{t}(\sigma_{r}\otimes\nabla)Z_{r}^{\beta}dW_{r}\in M^{2}_{T}(H^{k}\otimes H^{k-1})$ . We define

[TABLE]

Lemma 4.8.

Let $W$ be a $d$ -dimensional Wiener process on the filtered probability space $(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\in[0,T]},P)$ . Let $\bar{\alpha}\in(\frac{1}{3},\frac{1}{2})$ and $\mathbf{Z}\in\mathscr{C}_{g}^{\bar{\alpha}}([0,T],\mathbb{R}^{m})$ . Assume that $\sigma$ and $\beta$ satisfy Assumption 4.2 (ii) and 4.2 (i) respectively,, with $k\geq 4+\frac{d}{2}$ and $\alpha\in(\frac{1}{3},\bar{\alpha})$ . Let $F$ and $\mathbb{F}$ be defined as in (47) and (48), respectively. Then, for $P$ -a.e. $\omega$ ,

[TABLE]

is a rough driver in the sense of Definition 3.1. Moreover, on time intervals of size $h\leq T$ we have that, for all $\rho>\frac{2}{1-2\bar{\alpha}}$ , there exists $K_{\rho}\in L^{\rho}(\Omega)$ , such that, $P$ -a.s.,

[TABLE]

where $L$ is defined in (36).

Proof.

It is immediate to verify that the couple $(F,\mathbb{F})$ satisfies Chen’s relation (21). We give now estimates on the first order term (47). As a consequence of the definition of $F$ and Lemma 4.5, we have, on an interval of size $h\leq T$ ,

[TABLE]

We use now Lemma 4.3 to control the first term in the right hand side.

Now we study the regularity of $\mathbb{F}$ . Using BDG inequality [22, Theorem 2.4.7] and inequality (44), we have for all $\rho\geq 1$ and $0\leq s\leq t\leq T$ , $t-s\leq h$ ,

[TABLE]

By Kolmogorov continuity theorem A.1, we obtain that for every $\rho>\frac{2}{1-2\alpha}$ there exists $K_{\rho}\in L^{\rho}(\Omega)$ , such that

[TABLE]

Similar considerations lead to

[TABLE]

Putting together the last inequalities, Lemma 4.3 and Lemma 4.5 yields

[TABLE]

Inequality (49) follows immediately from the Sobolev embedding theorem [4, Corollary 9.13] . ∎

Lemma 4.9.

Let $W$ be a $d$ -dimensional Wiener process on the filtered probability space $(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\in[0,T]},P)$ . Let $\bar{\alpha}\in(\frac{1}{3},\frac{1}{2})$ and $\mathbf{Z}\in\mathscr{C}_{g}^{\bar{\alpha}}([0,T],\mathbb{R}^{m})$ . Assume that $\sigma,\theta$ satisfy Assumption 4.2 (ii) and that $\beta,\gamma$ satisfy Assumption 4.2 (i), with $k\geq 4+\frac{d}{2}$ and $\alpha\in(\frac{1}{3},\bar{\alpha})$ . Let $\mathbf{F}$ and $\mathbf{G}$ be nonlinear rough drivers constructed from $F_{st}:=W_{st}^{\sigma}+Z_{st}^{\beta}$ and $G_{st}:=W_{st}^{\theta}+Z_{st}^{\gamma}$ as in Lemma 4.8.

Then, for all $\rho>\frac{2}{1-2\bar{\alpha}}$ , there exists $K_{\rho}\in L^{\rho}(\Omega)$ , such that for any time interval of size $h\leq T$ ,

[TABLE]

*where we set $M:=L(\sigma,\beta,\mathbf{Z})+L(\theta,\gamma,\mathbf{Z})$ and $L$ is defined as in (36). *

Proof.

We already have contractive estimates from Lemmas 4.4 and 4.6 for the Itô and Gubinelli terms. We look now at the mixed integrals. For every $p\geq 1$ , we have, for $|t-s|\leq h\leq T$ ,

[TABLE]

The same estimates is true for the other mixed term. We can conclude by applying Kolmogorov continuity theorem. ∎

4.2 Integrability of the random rough driver

In this section we are concerned with the study of exponential moments of the random rough driver. We will use the approach introduced by [6] and described in [15, Chapter 11].

Lemma 4.10.

Let $(\Omega:=C([0,T];\mathbb{R}^{m}),\mathcal{B}(\Omega),P)$ be the canonical Wiener space with Cameron-Martin space $\mathcal{H}\subset\Omega$ . We define on this space the canonical Wiener process as $W_{t}(\omega)=\omega(t)$ . Let $\bar{\alpha}\in(\frac{1}{3},\frac{1}{2})$ and $\mathbf{Z}\in\mathscr{C}_{g}^{\bar{\alpha}}([0,T],\mathbb{R}^{m})$ . Assume that $\sigma$ and $\beta$ satisfy Assumption 4.2, with $k\geq 4+\frac{d}{2}$ and $\alpha\in(\frac{1}{3},\bar{\alpha})$ , and let $\mathbf{F}$ be defined as in Lemma 4.8. Let $p:=\alpha^{-1}\in(2,3)$ and $q\geq 1$ , such that $\frac{1}{p}+\frac{1}{q}>1$ . Then, there exists $C:=C(p,q)>0$ and a null set $N\subset\Omega$ , such that, $\forall\omega\in N^{c}$ , $\forall[s,t]\subset[0,T]$ and $\forall h\in C^{q-var}$ ,

[TABLE]

where, $g_{s,t}:\Omega\to\mathbb{R}_{+}$ is defined as

[TABLE]

Proof.

The proof of this result follows very closely the proof of [15, Theorem 11.5]. We repeat here the important pieces, where the dependence of the stochastic integrals on the space parameter $x$ has to be taken into account. We look at the first order term of $\mathbf{F}$ . By definition, we have

[TABLE]

For every $s,t\in[0,T]$ , the term $W^{\sigma}_{st}$ is constructed as an $L_{\omega}^{2}H^{k}$ limit, hence there exists a sequence of partitions $(\Pi_{m})_{m\in\mathbb{N}}$ and a null set $N_{st}$ such that

[TABLE]

for every $\omega\in N_{st}^{c}$ . We call $N_{1}$ the intersection of $N_{st}$ over all dyadic times and we note that it is still a null set. Similarly, we can construct a null set $N_{2}$ such that the function $W^{\sigma}(\omega)$ is of bounded $p$ -variation for every $\omega\in N_{2}^{c}$ . Let $\omega\in N_{1}^{c}\cap N_{2}^{c}$ , we have,

[TABLE]

The first limit on the right hand side exists because of the choice of the null set that we made in (52). The last limit is well defined as a Young integral, since $\sigma$ and $h$ are of complementary variation, see [15, Section 4.1]. Hence, also the left hand side of 53 converges and is, by definition, $W^{\sigma}_{st}(\omega+h)$ .

Hence, we obtain, $\forall\omega\in N_{1}^{c}\cap N_{2}^{c}$ , $h\in C^{q-\operatorname{var}}$ , and for all dyadic times $[s,t]\subset[0,T]$ ,

[TABLE]

To generalize to any subset $[s,t]\subset[0,T]$ , we can use a continuity argument, see [15, Theorem 11.5].

We compute now the $p$ -variation in equation (54) and we obtain

[TABLE]

Proceeding similarly for the second order term $\mathbb{F}$ , we have that there exists a null set $N\subset\Omega$ such that $\forall\omega\in N^{c}$ , $\forall h\in C^{q-var}$ and for all times $[s,t]\subset[0,T]$ ,

[TABLE]

to obtain the third term on the right hand side, we used stochastic Fubini Theorem as follows

[TABLE]

We compute the $p$ -variation for the second order term. Using inequalities of the type $\sqrt{ab}\leq\sqrt{a}+\sqrt{b}$ , for $a,b\in\mathbb{R}_{+}$ , we obtain, for all $\omega\in N$ ,

[TABLE]

where $g_{s,t}$ is defined in (51). This concludes the proof ∎

For every $s,t\in[0,T]$ , we define the control $w_{\mathbf{F}}(s,t)=[\![\mathbf{F}]\!]_{p,[s,t]}^{p}$ and we construct the greedy partition, following the construction in Section 2.1. Let $N_{\beta}$ be defined as in (9), for any $\beta>0$ . We call $N$ the integer-valued random variable given by

[TABLE]

for $\omega\in\Omega$ . For $y\geq 0$ , let

[TABLE]

be the cumulative distribution function of a standard Gaussian random variable and $\bar{\Phi}=1-\Phi$ . We include a straightforward Lemma needed to estimate $N$ .

Lemma 4.11.

Let $C>0$ and $\bar{a}\in\mathbb{R}$ . If $Y$ is a positive random variable such that $P(Y>t)\leq\bar{\Phi}(\bar{a}+t/c)$ , for every $t>a$ , then

[TABLE]

Proof.

We use elementary considerations and Fubini theorem, to obtain

[TABLE]

∎

Theorem 4.12.

Under the same assumptions of Lemma 4.10, the random variable $N$ defined in (55) has a Gaussian tail. Moreover, there exists $C=C(T,p)>0$ , such that $C$ is bounded when $T$ is small and for all $s>1$ ,

[TABLE]

where $L$ is defined in (36).

Proof.

The main ingredient, which is still to prove, is that, for $P$ -a.e. $\omega$ ,

[TABLE]

where $g$ is defined as in (51) and $C:=C(p,q)$ . The proof of this inequality follows from Lemma 4.10 in the same way as the proof of [15, Lemma 11.12]. It follows from [15, Proposition 11.2], that we can take $q=1$ , to obtain

[TABLE]

withe $C:=C(T,p)$ . By assumption, $\sigma$ , $\beta$ and $\mathbf{Z}$ are of finite $p$ -variation. This implies that $g$ is almost surely finite and we can apply the generalized Fernique Theorem [15, Theorem 11.7] as follows. We set $f=N$ and $g=C[\![\sigma]\!]_{p}^{p}g^{p}$ defined as in (51). We must now find $a>0$ such that the following set has positive measure,

[TABLE]

We know from Lemma 4.8 that $E[g^{p}]^{\frac{1}{p}}\leq CL(\sigma,\beta,\mathbf{Z})$ . From Chebychev inequality, we have (where $C$ may change from a term to the next)

[TABLE]

Using the previous estimates, we obtain that,

[TABLE]

where $C=C(T,p)$ is again allowed to increase in the last inequality. Moreover,

[TABLE]

If we now fix $a=(C+1)[\![\sigma]\!]_{p}^{p}L(\sigma,\beta,\mathbf{Z})^{p}$ , we have that $P(A_{a})\geq 1-\frac{C}{C+1}>0$ . From Fernique Theorem [15, Theorem 11.7], we have, for $r>a$ ,

[TABLE]

where $\bar{a}=\hat{a}-a(C[\![\sigma]\!]_{p}^{p})^{-1}$ and $\hat{a}=\Phi^{-1}(P(A_{a}))$ . By our choice of $a$ and the monotonicity of $\Phi^{-1}$ , we have that $\hat{a}\geq\Phi^{-1}(1-\frac{C}{C+1})$ , which is a universal constant depending only on $(p,T)$ , but can be negative. It follows from (56) that $\hat{a}\to\infty$ as $T\to 0$ . We apply Lemma 4.11 that, with $s>1$ (chosen so that $s\leq s^{2}$ ), $a$ and $\bar{a}$ as before and $c=C[\![\sigma]\!]_{p}^{p}$ .

[TABLE]

The constant $C$ is allowed to change again in the last line, but one can easily see that it remains bounded, when $T$ is small enough. ∎

4.3 The average Itô formula

In this section we prove a version of the Itô formula which we need to make the connection between (3) and (4). We note that at the present level of knowledge, we don’t know how to make an $P$ -a.s. Itô formula, but we only have the chain rule when we average over $\Omega$ .

Proposition 4.13.

Let $(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\in[0,T]},P)$ a complete filtered probability space and $W$ be a $d$ -dimensional Wiener process on it. Let $\bar{\alpha}\in(\frac{1}{3},\frac{1}{2})$ , $\mathbf{Z}\in\mathscr{C}_{wg}^{\bar{\alpha}}([0,T],\mathbb{R}^{m})$ . Assume that $\sigma$ and $\beta$ satisfy Assumption 4.2, for $k>\frac{d}{2}+3$ and $\alpha\in(\frac{1}{3},\bar{\alpha})$ . Let $\mathbf{F}$ be defined as in Lemma 4.8.

Let $x(\xi)$ be the solution to equation (20) driven by $\mathbf{F}$ with initial condition $\xi\in\mathbb{R}^{d}$ , in the sense of Definition 3.4, given by Proposition 3.13.

Let $\Xi:\Omega\to\mathbb{R}^{d}$ be an $\mathcal{F}_{0}$ -measurable random variable. Then the process $x_{t}(\Xi)$ is $(\mathcal{F}_{t})_{t\geq 0}$ adapted. Moreover, $x$ is a random variable with values in $C^{\alpha}([0,T];\mathbb{R}^{d})$ .

Proof.

Let $t\in[0,T]$ and call $\mathbf{F}_{|_{[0,t]}}$ the restriction of $\mathbf{F}$ on the interval $[0,t]$ . We know from Proposition 3.11 that

[TABLE]

is a continuous mapping. Moreover the random variable $(\Xi,\mathbf{F}_{|_{[0,t]}})$ is $\mathcal{F}_{t}$ -measurable. Hence,

[TABLE]

is $\mathcal{F}_{t}$ -measurable.

In a similar way we see that $x$ is a random variable in $C^{\alpha}([0,T];\mathbb{R}^{d})$ , since $\omega\mapsto\mathbf{F}(\omega)$ is measurable and $x$ is continuous w.r.t. the rough driver. ∎

Proposition 4.14.

Under the same assumptions as Proposition 4.13, let $x_{t}=x_{t}(\Xi)$ . If $\phi\in C_{b}^{3}\otimes H^{k}$ , endowed with the norm defined in (16), then

[TABLE]

where $E[\nabla_{1}\phi(x_{r})\beta_{r}(x_{r})]\in\mathcal{L}(\mathbb{R}^{m};H^{k})$ is controlled by $Z$ with Gubinelli derivative $E[\nabla_{1}\phi(x_{r})(\beta_{r}^{\prime}(x_{r})+\nabla_{1}\beta_{r}(x_{r})\beta_{r}(x_{r}))+\nabla_{1}^{2}\phi(x_{r})\beta_{r}(x_{r})\otimes\beta_{r}(x_{r})]$ .

Before we proceed with the proof of Proposition 4.14, we prove two technical lemmas.

Lemma 4.15.

Under the same assumptions as Proposition 4.13, let $x^{\sharp}$ be defined in (23). For any $\rho\in\mathbb{N}$ and $|t-s|\leq h\leq T$ , we have

[TABLE]

where $L:=L(\sigma,\beta,\mathbf{Z})$ is defined in (36).

Proof.

Define the random variable $Y:=C\|\mathbf{F}\|_{\alpha,h;C^{3}}$ as in Proposition 3.7 which gives that for $|t-s|^{\alpha}\leq Y^{-1}$ we have $|x_{st}^{\sharp}|\leq Y|t-s|^{2\alpha}.$ Writing $\Omega=\{|t-s|^{\alpha}Y>1\}\cup\{|t-s|^{\alpha}Y\leq 1\}$ gives

[TABLE]

Now trivially by the definition of $x^{\sharp}$ , we have

[TABLE]

and the result follows from Lemma 4.8. ∎

Lemma 4.16.

Under the same assumptions as Proposition 4.13, we have

[TABLE]

with bounds, on a time interval of size $h\leq T$ ,

[TABLE]

where $L:=L(\sigma,\beta,\mathbf{Z})$ is defined in (36).

Proof.

We do a first order Taylor expansion to obtain

[TABLE]

We have defined

[TABLE]

We first make some deterministic bounds (i.e. uniformly in $\omega$ )

[TABLE]

Using that $x$ is adapted we get $E[\nabla_{1}\phi(x_{s})W_{st}^{\sigma}(x_{s})]=0$ so that

[TABLE]

Write now

[TABLE]

and the result follows from Lemma 4.15 with $\rho=1$ . ∎

Proof of Proposition 4.14.

We do a third order Taylor expansion to obtain, $P$ -a.s.,

[TABLE]

Where we have defined

[TABLE]

As in Lemma 4.7 we note that

[TABLE]

is uniformly in $\omega$ bounded by $|t-s|^{3\alpha}$ depending only on $\beta$ . Moreover, since $\mathbf{Z}$ is geometric and $\nabla^{2}\phi$ is a symmetric bilinear mapping we get

[TABLE]

where $\,\hat{\otimes}\,$ denotes the symmetric tensor product. This is clearly bounded by $|t-s|^{3\alpha}$ .

Using Lemma 4.15 with $\rho=1$ and $\rho=2$ and taking the expectation of $\phi(x)^{\natural}_{st}$ we obtain the result. ∎

To create the contraction mapping in the appropriate space of measures we shall need to control the difference of two measures induced by two rough SDEs.

Proposition 4.17.

Let $(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\in[0,T]},P)$ a complete filtered probability space and $W$ be a $d$ -dimensional Wiener process on it. Let $\bar{\alpha}\in(\frac{1}{3},\frac{1}{2})$ , $\mathbf{Z}\in\mathscr{C}_{wg}^{\bar{\alpha}}([0,T],\mathbb{R}^{m})$ . Assume that $(\sigma,\beta)$ and $(\theta,\gamma)$ satisfy Assumption 4.2, for $k>\frac{d}{2}+3$ , $\alpha\in(\frac{1}{3},\bar{\alpha})$ and $p=\frac{1}{\alpha}$ . Let $\mathbf{F}$ and $\mathbf{G}$ be nonlinear rough drivers constructed from $F_{st}:=W_{st}^{\sigma}+Z_{st}^{\beta}$ and $G_{st}:=W_{st}^{\theta}+Z_{st}^{\gamma}$ as in Lemma 4.8. Moreover, let $\Xi$ be an $\mathcal{F}_{0}$ -measurable random variable.

Let $x$ and $y$ solutions to equation (20) driven by $\mathbf{F}$ and $\mathbf{G}$ respectively, with the same initial condition $\Xi$ .

If $\phi\in C_{b}^{3}\otimes H^{k}$ , endowed with the norm defined in (16), we have

[TABLE]

Moreover, there exists $\rho\geq 1$ and $C(T)$ such that $\lim_{T\to 0}C(T)=0$ , and

[TABLE]

where $M:=K([\![\sigma]\!]_{p,[s,t]}^{\rho}+1)(L(\sigma,\beta,\mathbf{Z})+L(\theta,\gamma,\mathbf{Z}))$ , $L$ is defined in (36) and $K=K(\alpha,\rho)>0$ is a universal constant.

Before proceeding with the proof, we need the next two technical lemmas.

Lemma 4.18.

Under the same assumptions of Proposition 4.17, for any $\rho\geq 1$ , there exists $\bar{\rho}\geq\rho$ , $C$ and $C(T)>0$ , such that $\lim_{T\to 0}C(T)=0$ and

[TABLE]

Proof.

By applying Corollary 3.12, (50) and (49), we see that there exists $\bar{\rho}\geq 1$ and $K_{\bar{\rho}}\in L^{\bar{\rho}}(\Omega)$ such that $P$ -a.s.,

[TABLE]

Taking the $L_{\omega}^{\rho}$ norm on both sides we conclude the proof, thanks to Theorem 4.12, which gives

[TABLE]

where $C>0$ is a universal constant. ∎

Lemma 4.19.

Under the same assumptions of Proposition 4.17, for any $\rho\geq 1$ , there exists $\bar{\rho}\geq\rho$ and $C(T)>0$ , such that $\lim_{T\to 0}C(T)=0$ and, for all $s,t\in[0,T]$ ,

[TABLE]

Proof.

Let $Y:=C([\mathbf{F}]_{\alpha}+[\mathbf{G}]_{\alpha})(1+[\mathbf{F}]_{\alpha}+[\mathbf{G}]_{\alpha})^{2}$ where $C$ is the constant given in Proposition 3.11. Then, $|t-s|^{\alpha}Y\leq 1$ implies,

[TABLE]

and we notice that $E[Y^{\rho}]\leq M^{\bar{\rho}}(T^{\rho}\vee T^{\frac{\rho}{2}})^{\bar{\alpha}-\alpha}$ for some $\bar{\rho}\geq\rho\geq 1$ which follows from Lemma 4.8 and the Gaussian integrability of $N(w_{\mathbf{F}},[0,T])$ , Theorem 4.12.

We split up $\Omega=\{|t-s|^{\alpha}Y\leq 1\}\cup\{|t-s|^{\alpha}Y>1\}$ which gives

[TABLE]

For the first term above we use the crude (in time) bound

[TABLE]

The result follows from Corollary 3.12, (29) and Theorem 4.12. ∎

Proof of Proposition 4.17.

We write

[TABLE]

We start from the first term on the right hand side of (57),

[TABLE]

We have, as an application of Hölder inequality, for $0\leq s\leq t\leq T$ ,

[TABLE]

where, in the last inequality we used Lemma 4.3. Similarly, using Lemma 4.3 and 4.4, we can bound the remaining terms,

[TABLE]

Summing up the previous inequalities, we get

[TABLE]

The second term in (57) is bounded as follows using Lemmas 4.19 and 4.15,

[TABLE]

The third term in (57) is

[TABLE]

We estimate the first term in the right hand side using Lemma 4.5,

[TABLE]

Similarly, using Lemma 4.5 and 4.6,

[TABLE]

We estimate the last term in (58) using equation (44) and Lemma 4.6,

[TABLE]

Thus, there exists $\rho\geq 1$ (which may increase from a line to the next) such that the remainder satisfies, for all $s,t\in[0,T]$ ,

[TABLE]

In the last inequality we used Lemma 3.8 combined with Lemma 4.8, and also $\|x\|_{L_{t}^{\infty}}\leq T^{\alpha}[x]_{\alpha}+|x_{0}|$ . We check now the Gubinelli derivative, for each $j$ we have

[TABLE]

Similarly as for the remainder, we obtain the following,

[TABLE]

We conclude by using Lemma 4.18 to estimate $\|[x-y]_{\alpha}\|_{L_{\omega}^{2}}$ . ∎

5 Linear Rough PDE

Let $d,m\in\mathbb{N}$ be fixed and let $\mathbf{Z}\in\mathscr{C}_{g}^{\bar{\alpha}}([0,T],\mathbb{R}^{m})$ , for $\bar{\alpha}\in(\frac{1}{3},\frac{1}{2})$ . Let $\sigma$ and $\beta$ satisfy Assumptions 4.2, for $k$ large enough. In this section we prove well-posedness of measure-valued solutions to linear rough partial differential equations, which are formally given as

[TABLE]

To rigorously define the meaning of a solution to equation (59), we take a slightly more general approach, as described below.

Assumptions 5.1.

Let $n\in\mathbb{N}$ and $\alpha\in(\frac{1}{3},\bar{\alpha})$ .

(i)

Let $a:[0,T]\to C^{n+3}(\mathbb{R}^{d};\mathbb{R}^{d\times d})$ be a measurable path such that $a_{t}^{i,j}(x)\xi^{i}\xi^{j}\geq 0$ for all $x,\xi\in\mathbb{R}^{d}$ and $t\in[0,T]$ . 2. (ii)

Let $\mathbf{X}\in\mathscr{C}_{g}^{\alpha}([0,T];C^{n+3}_{b}(\mathbb{R}^{d};\mathbb{R}^{d}))$ be a geometric rough path, as described in Section 2.

The examples we have in mind are $a=\frac{1}{2}\sigma\sigma^{T}$ and $\mathbf{X}=\int\beta_{r}d\mathbf{Z}_{r}$ , as described in Proposition 5.5. In order to describe the main ideas, we argue now on a formal level assuming smoothness in time of $X$ ; rigorous definitions in the rough path case will be given later in the section. We study uniqueness of solutions to the following linear equation

[TABLE]

The proof is based on a backward duality trick; suppose we can show existence of a sufficiently regular solution to the backward PDE

[TABLE]

for a given final condition $u_{T}$ , then at least formally we have

[TABLE]

which shows that $\nu_{T}(u_{T})=\nu_{0}(u_{0})$ . Now, if $u_{T}$ is chosen in a class of functions large enough to fully determine $\nu_{T}$ , we see that it will be fully determined by $\nu_{0}$ and $u_{0}$ , thus showing uniqueness.

For simplicity only, we write equation (61) on divergence form and as a forward equation as follows

[TABLE]

which can be seen to be equivalent to (61) by replacing $X_{t}$ by $(\int_{0}^{t}\nabla a_{r}dr,X_{t})$ in (63) and then reversing time, i.e. $u_{t}\mapsto u_{T-t}$ .

The strategy to prove existence of a smooth solution to (63) is as follows. We first show how to give an intrinsic notion of solution of (60) and (63) in the context of the so-called unbounded rough drivers, see [2]. We then replace $X$ by smooth vector fields, in which case it is well know that there exists a unique solution of (63) which is smooth provided the coefficients are. We then consider the vector of derivatives $f=(u,\nabla u,\dots,\nabla^{n}u)$ and show that $f$ satisfies a vector valued equation, for which we can find bounds independent of $\dot{X}$ . The equation for $f$ will be solved in the space $L^{2}(\mathbb{R}^{d};\mathbb{R}^{N})$ , thus giving bounds on $u$ in the Sobolev-space $H^{n}(\mathbb{R}^{d})$ .

Second, we approximate $\mathbf{X}$ by a sequence of smooth vector fields and show that the corresponding sequence of solutions converge to a meaningful solution of (63). Since the solution is in $H^{n}(\mathbb{R}^{d})$ we can use Sobolev embedding [4, Corollary 9.13] to show the needed spatial regularity to justify the computations in (62).

The techniques used to prove the first step are motivated by [2] and [11], and the main technical tool is the a priori estimate found in [11].

5.1 Unbounded rough drivers

We start by rephrasing (63) in terms of so called unbounded rough drivers. The main motivation for doing so is the a priori estimate from [11].

Assume that $X$ is a smooth path, then equation (63) is well defined as a PDE. Integrating (63) from $s$ to $t$ we obtain

[TABLE]

Iterating the equation into itself we obtain

[TABLE]

where at least formally,

[TABLE]

and

[TABLE]

By the usual power counting the remainder term $u^{\natural}$ should be regular in time, but we notice that in general it is a distribution in space. Following [2] we call a scale of spaces a quadruple $(E^{n})_{n=0}^{3}$ of Banach spaces such that $E^{n+1}$ is continuously embedded into $E^{n}$ . Let $E^{-n}$ be the topological dual of $E^{n}$ (in general, $E^{-0}\neq E^{0}$ ).

Definition 5.2.

An unbounded $\alpha$ -rough driver on the scale $(E^{n})_{n}$ , is a pair $\mathbf{B}=(B^{1},B^{2})$ of mappings on $E^{n}$ such that

[TABLE]

and Chen’s relation is satisfied,

[TABLE]

We shall write $\|\mathbf{B}\|_{\alpha}$ for the smallest constant dominating the bounds in (66).

We show how to construct an unbounded rough driver given a rough path.

Proposition 5.3.

Let $N\in\mathbb{N}$ and $\mathbf{X}$ satisfy Assumption 4.2 (ii). Define for $\phi\in C^{\infty}_{c}(\mathbb{R}^{d};\mathbb{R}^{N})$

[TABLE]

where $\nabla^{\otimes}_{1}:C_{b}^{3}(\mathbb{R}^{d}\times\mathbb{R}^{d};\mathbb{R}^{d\times d})\rightarrow C_{b}^{2}(\mathbb{R}^{d}\times\mathbb{R}^{d};\mathbb{R}^{d})$ is the linear extension of the map defined on the algebraic tensor as

[TABLE]

Then $\mathbf{B}:=(B^{1},B^{2})$ is an unbounded rough driver on both scales $E_{n}:=W^{n,\rho}(\mathbb{R}^{d};\mathbb{R}^{N})$ , $\rho\geq 1$ , and $E_{n}:=C_{b}^{n}(\mathbb{R}^{d};\mathbb{R}^{N})$ . Moreover, the mapping $\mathbf{X}\mapsto\mathbf{B}$ is continuous in the operator norm.

Proof.

Let $0\leq s\leq\theta\leq t$ . By Chen’s relation for rough paths (10), and (68)

[TABLE]

which gives

[TABLE]

Continuity of the mapping follows immediately from the continuity of $\nabla_{x}^{\otimes}$ . ∎

We notice that there is no zero order term in the above unbounded rough driver. We include such a term by considering a rough path $\mathbf{X}\in\mathscr{C}^{\alpha}([0,T];C_{b}^{3}(\mathbb{R}^{1+d};\mathbb{R}^{1+d}))$ , i.e. with an additional spatial variable. Then, for $\phi\in C_{c}^{\infty}(\mathbb{R}^{d};\mathbb{R}^{N})$ let

[TABLE]

where we make the convention that summation over repeated indexes are over $1\leq j\leq d$ , i.e. excluding [math].

With this in hand we can define the notion of a solution of (60).

Definition 5.4.

A path $\nu:[0,T]\rightarrow\mathcal{M}(\mathbb{R}^{d})\subset(C_{b}(\mathbb{R}^{d}))^{*}$ is a solution to (60) if for all $\phi\in C_{b}^{3}(\mathbb{R}^{d})$ the mapping defined by

[TABLE]

satisfies $|\nu_{st}^{\natural}(\phi)|\lesssim|t-s|^{3\alpha}\|\phi\|_{C_{b}^{3}}$ . Above $\mathbf{B}=(B^{1},B^{2})$ is the unbounded rough driver constructed from $\mathbf{X}$ as in Proposition 5.3.

We see now that, in the special case when $a=\frac{1}{2}\sigma\sigma^{T}$ and $\mathbf{X}=\int\beta_{r}d\mathbf{Z}_{r}$ , existence of solutions follows from the results of Sections 3 and 4.

Proposition 5.5.

Let $\rho\geq 2$ and let $(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\in[0,T]},P)$ be a probability space that supports a $d$ -dimensional Brownian motion $W$ and an $\mathcal{F}_{0}$ -measurable random variable, $\Xi\in L^{\rho}(\Omega;\mathbb{R}^{d})$ such that the push-forward measure $P_{*}(\Xi)=\nu_{0}$ . Let $\mathbf{Z}\in\mathscr{C}_{wg}^{\bar{\alpha}}([0,T];\mathbb{R}^{m})$ be a a weakly geometric rough path. Under Assumption 4.2, we have

(i)

$\mathbf{B}$ , generated by the rough path $\int\beta_{r}d\mathbf{Z}_{r}$ as in Proposition 5.3, is an unbounded rough driver as in Definition 5.2. 2. (ii)

There exists a solution $\nu$ of (60) driven by $\mathbf{B}$ , in the sense of Definition 5.4. This solution is given by $\nu_{t}=\mathcal{L}(x_{t})$ , where, for $P$ -a.e. $\omega\in\Omega$ , $x(\omega)$ is the unique solution to equation (20) with initial condition $\Xi(\omega)$ , driven by the random rough driver $\mathbf{F}$ constructed in Lemma 4.8.

Proof.

From Sobolev embedding theorem [4, Corollary 9.13] , we have $\beta\in\mathscr{D}_{Z}^{2\alpha}([0,T];C_{b}^{3}(\mathbb{R}^{d};\mathbb{R}^{d}))$ . Thus, using the construction (11), we have that $\int\beta_{r}d\mathbf{Z}_{r}$ is a rough path over $C_{b}^{3}(\mathbb{R}^{d};\mathbb{R}^{d})$ . The first claim follows now by Proposition 5.3.

We prove now the second claim. It follows from Proposition 4.13 that the stochastic process $(x_{t})_{t\in[0,T]}$ is adapted. We can thus define $\nu:=\mathcal{L}(x)$ and denote by $\nu_{t}$ the induced time-marginals. From Itô’s formula, Proposition 4.14, we get

[TABLE]

The proof is complete once we show that $\int_{0}^{t}\nu_{r}(\nabla\phi\beta_{r})d\mathbf{Z}_{r}$ has an expansion in terms of the unbounded rough driver. Recall that we get from Lemma 4.15, we have

[TABLE]

and this gives, using the sewing lemma 2.1,

[TABLE]

Regrouping the terms we can write

[TABLE]

By definition of $B^{1}$ we get

[TABLE]

which gives

[TABLE]

Moreover

[TABLE]

This shows that we may rewrite the equation for $\nu$ as

[TABLE]

where $\nu^{\natural}\in C_{2}^{3\alpha}([0,T];(C_{b}^{3}(\mathbb{R}^{d}))^{*})$ is a remainder. ∎

5.2 A priori estimates for smooth vector fields

For this section we consider an approximation of equation (64), driven by a smooth (in time) driver,

[TABLE]

where $X$ is smooth. We will find bounds on $u$ in $H^{n}(\mathbb{R}^{d})$ depending only on a canonical unbounded rough driver generated by $X$ . The first step towards this goal is to write $u$ and all the derivatives as a vector in an $L^{2}$ space.

Let $u$ denote the (smooth) solution of (70) and let $f=(u,\nabla u,\dots,\nabla^{n}u)$ denote the vector of gradients as taking values in the truncated tensor algebra $T^{(n)}(\mathbb{R}^{d})=\bigoplus_{q=0}^{n}(\mathbb{R}^{d})^{\otimes q}$ . We will simply write $gv$ for the 1-contractive product

[TABLE]

e.g. for a $g\in(\mathbb{R}^{d})^{\otimes 2}$ and $v\in\mathbb{R}^{d}$ the product $gv$ has component $i$ given by $g_{ij}v_{j}$ .

Using Leibniz formula we have

[TABLE]

where $M_{\dot{X}}^{(q)}:T^{(n)}(\mathbb{R}^{d})\rightarrow(\mathbb{R}^{d})^{\otimes q}$ is given by

[TABLE]

We notice that the above sum is in $(\mathbb{R}^{d})^{\otimes q}$ since we are doing a contractive product of $(\mathbb{R}^{d})^{\otimes(q-j+1)}$ and $(\mathbb{R}^{d})^{\otimes(j+1)}$ .

For each $q$ we have

[TABLE]

This gives that $f$ satisfies the $T^{(n)}(\mathbb{R}^{d})$ -valued equation

[TABLE]

where we have set

[TABLE]

Remark 5.6.

We notice that if we replace $X$ above by $X^{\epsilon}$ where $\mathbf{X}^{\epsilon}$ converges to a rough path $\mathbf{X}$ , then the corresponding coefficients $V^{\epsilon}$ , $Y^{\epsilon}$ have canonical rough path lifts, $\mathbf{V}^{\epsilon}$ and $\mathbf{Y}^{\epsilon}$ , with values in $C_{b}^{3}$ which remain bounded uniformly in $\epsilon$ . This comes from the fact that there are canonical iterated integrals between the $C_{b}^{3}$ -valued paths $t\mapsto\int_{0}^{\cdot}a_{r}(x)dr$ and $t\mapsto X_{t}(x)$ ,

[TABLE]

where the first term is simply the Riemann-integral and the second term is defined using integration by parts as before.

Given the previous construction, we consider now a system of equations. We remark that this is not just a vector valued version of the results found in [18], since we are not interested in energy estimates. Indeed, the matrix $a$ is allowed to be degenerate but we require spatial smoothness. We consider the equation

[TABLE]

for given functions $a$ and $\dot{V},\dot{Y}$ smooth in time, and a given initial condition $f_{0}$ . The solution is a vector valued function $f:[0,T]\times\mathbb{R}^{d}\rightarrow\mathbb{R}^{N}$ , and the coefficients are on the form

[TABLE]

We will assume that $a$ is diagonal in (73), so component $l$ reads

[TABLE]

We begin with our main a priori estimate.

Proposition 5.7.

Assume $f$ is a solution of (73). Then there exists a constant $C=C(a,B^{1},B^{2})$ such that

[TABLE]

where $(B^{1},B^{2})$ is an unbounded rough driver depending only on the rough path lift of the path $(V,Y)$ .

Proof.

The finite-dimensional tensor $(f^{\otimes 2})^{n,l}:=f^{n}f^{l}$ then satisfies

[TABLE]

where

[TABLE]

both belongs to the space $\mathcal{L}(\mathbb{R}^{N}\otimes\mathbb{R}^{N};\mathbb{R}^{N}\otimes\mathbb{R}^{N})$ . Define now the unbounded rough driver

[TABLE]

and the drift

[TABLE]

for functions $\phi:\mathbb{R}^{d}\rightarrow\mathbb{R}^{N}\otimes\mathbb{R}^{N}$ . This gives the dynamics

[TABLE]

on the scale $(W^{n,\infty}(\mathbb{R}^{d};\mathbb{R}^{N}\otimes\mathbb{R}^{N}))_{n}$ . Let $\phi\in W^{2,\infty}(\mathbb{R}^{d};\mathbb{R}^{N}\otimes\mathbb{R}^{N})$ and write

[TABLE]

which shows that $m^{\otimes 2}$ has bounded variation in $(W^{2,\infty}(\mathbb{R}^{d};\mathbb{R}^{N}\otimes\mathbb{R}^{N})^{*})$ .

Now, by the a priori bounds, [11, Theorem 2.9], we get

[TABLE]

where $C$ depends on $\|\mathbf{B}^{\otimes 2}\|_{\alpha}$ . Testing $f^{\otimes 2}$ against the $N\times N$ identity matrix $I_{N\times N}$ and using that $a$ is positive semi-definite we get

[TABLE]

Note that $\|(\nabla f^{l})^{T}a\nabla f^{n}\|_{L^{1}}\leq N\|(\nabla f^{n})^{T}a\nabla f^{n}\|_{L^{1}}$ . Indeed, write $a=\frac{1}{2}\sigma\sigma^{T}$ and use the Cauchy-Schwarz inequality

[TABLE]

Summing over $l$ and $n$ gives that the above is bounded by $\frac{N}{2}\sum_{n=1}^{N}|\sigma^{T}\nabla f^{n}|^{2}$ . Integrating w.r.t. $x$ we get the claim.

If we choose $s,t$ such that $CN|t-s|^{\alpha}\leq\frac{1}{2}$ we get

[TABLE]

From the rough Gronwall lemma, [11, Lemma 2.11], the first bound of (75) holds.

For the second inequality we notice that the evolution of $f$ on $W^{n,2}(\mathbb{R}^{d};\mathbb{R}^{N})$ reads

[TABLE]

where $m_{t}=\int_{0}^{t}\operatorname{div}(a_{r}\nabla f_{r})dr$ and we have defined the unbounded rough driver

[TABLE]

Since the operator is self-adjoint it is easy to bound the variation of $m$ in $H^{-2}$ ;

[TABLE]

This gives, using [11, Theorem 2.9],

[TABLE]

where $C$ depends on $\|\mathbf{B}\|_{\alpha}$ and $\|a\|_{W^{1,\infty}}$ . Take now a mollifier $\psi_{\eta}$ and decompose $\phi=\psi_{\eta}*\phi+(I-\psi_{\eta})*\phi$ for any $\eta>0$ and any test function $\phi\in H^{1}(\mathbb{R}^{d};\mathbb{R}^{N})$ . This gives

[TABLE]

and for the smooth part $\psi_{\eta}*\phi$ we use the equation (77) to get

[TABLE]

Choosing $\eta=|t-s|^{\alpha}$ we get the second inequality in (75). ∎

5.3 Existence of a smooth solution

With the previous a priori estimates at hand, we are ready to prove existence of a solution.

Theorem 5.8.

Let Assumption 5.1 hold for $n>6+\frac{d}{2}$ and let $u_{0}\in C_{c}^{\infty}(\mathbb{R}^{d})$ be given. Then there exists a solution to (63) which belongs to $C_{b}^{6}$ and

[TABLE]

holds in $C^{3}_{b}$ in the sense that $u^{\natural}\in C_{2}^{3\alpha}([0,T];C_{b}^{3}(\mathbb{R}^{d}))$ , where $\mathbf{B}=(B^{1},B^{2})$ is the unbounded rough driver constructed from $\mathbf{X}$ as in Proposition 5.3.

Proof.

Denote by $u^{\epsilon}$ the solution of (63) when $X$ is replaced by $X^{\epsilon}$ , which we write

[TABLE]

Setting $f^{\epsilon}=(u^{\epsilon},\dots,\nabla^{n}u^{\epsilon})$ and choosing $N$ large (in fact $N=1+d+\dots+d^{n}$ ) we see that (71) is on the form (73) where $V^{\epsilon}$ and $Y^{\epsilon}$ are defined from $X^{\epsilon}$ using (72). We then build the unbounded rough driver $\mathbf{B}^{\epsilon,\otimes 2}$ and $\mathbf{B}^{\epsilon}$ from $V^{\epsilon}$ and $Y^{\epsilon}$ according to (76) and (78) respectively.

By the assumptions on $a$ , $\mathbf{X}$ and $u_{0}$ we get

[TABLE]

for some constant $C$ . For $\phi\in H^{n+1}$ , define $\Phi\in L^{2}(\mathbb{R}^{d};T^{(n)}(\mathbb{R}^{d}))$ by $\Phi=(\phi,\nabla\phi,\dots,\nabla^{n}\phi)$ and notice

[TABLE]

Since $H^{n+1}$ and $H^{n-1}$ are dual w.r.t. to the inner product on $H^{n}$ , we get $\|\delta u^{\epsilon}\|_{H^{n-1}(\mathbb{R}^{d})}\leq C|t-s|^{\alpha}$ . By similar reasoning we get $\|u^{\natural}_{st}\|_{H^{n-3}(\mathbb{R}^{d})}\leq C|t-s|^{3\alpha}$ using (79).

Since $u^{\epsilon}$ lies in a bounded set of $C^{\alpha}([0,T];H^{n-1}(\mathbb{R}^{d}))\cap C([0,T];H^{n}(\mathbb{R}^{d}))$ , by Arzelà-Ascoli there exists a subsequence $u^{k}:=u^{\epsilon_{k}}$ converging in $C([0,T];H^{n}_{w}(\mathbb{R}^{d}))$ some element $u$ . Here $H_{w}^{n}(\mathbb{R}^{d})$ denotes $H^{n}(\mathbb{R}^{d})$ equipped with the weak topology. Choosing now $n>6+\frac{d}{2}$ and using Sobolev embedding [4, Corollary 9.13] we get that $u^{\epsilon,\natural}$ is bounded in $C^{3\alpha}_{2}([0,T];C^{3}_{b}(\mathbb{R}^{d}))$ and $u\in C([0,T];C_{b}^{6}(\mathbb{R}^{d}))$ .

It is straightforward to take the limit in (81) and use the uniform bounds on $u^{\epsilon,\natural}$ to obtain (80). ∎

5.4 Uniqueness

Theorem 5.9.

Let Assumption 5.1 hold for $n>6+\frac{d}{2}$ . Then solutions of (60) are unique.

Proof.

Let $\nu$ be a solution to (60), i.e. for all $\phi\in C_{b}^{3}$ we have

[TABLE]

where $\nu^{\natural}\in C_{2}^{3\alpha}([0,T];(C_{b}^{3}(\mathbb{R}^{d}))^{*})$ and $\mathbf{B}=(B^{1},B^{2})$ is the unbounded rough driver constructed from $\mathbf{X}$ . Let $u$ be the solution of the backward equation (61) with final condition $\psi\in C^{\infty}_{c}(\mathbb{R}^{d})$ so that

[TABLE]

holds in $C^{3}_{b}$ . We then have

[TABLE]

where we have defined

[TABLE]

and we have used that the path is geometric which gives $\nu_{s}(B^{1}_{st}B^{1}_{st}u_{s})=\nu_{s}(B^{2}_{st}u_{s})$ . Using the equations for $u$ and $\nu$ we get $\nu^{\sharp}\in C_{2}^{2\alpha}([0,T];(C_{b}^{3}(\mathbb{R}^{d}))^{*})$ and $u^{\sharp}\in C_{2}^{2\alpha}([0,T];C_{b}^{3}(\mathbb{R}^{d}))$ . Using this and analyzing every term in (82) we see that

[TABLE]

and in particular $\nu_{T}(\psi)=\nu_{0}(u_{0})$ . If $\bar{\nu}$ is any other solution with the same initial condition, the same analysis gives $\bar{\nu}_{T}(\psi)=\nu_{0}(u_{0})$ which gives that $\nu_{T}(\psi)=\bar{\nu}_{T}(\psi)$ . Since $\psi$ was arbitrary the result follows. ∎

6 The McKean-Vlasov equation

Let $d,m\in\mathbb{N}$ be fixed. Let $(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\in[0,T]},P)$ be a complete filtered probability space and $W$ be a $d$ -dimensional Wiener process on it. Let $\Xi:\Omega\to\mathbb{R}^{d}$ be an $\mathcal{F}_{0}$ -measurable random variable. Let $\mathbf{Z}\in\mathscr{C}_{g}^{\bar{\alpha}}([0,T],\mathbb{R}^{m})$ , for $\bar{\alpha}\in(\frac{1}{3},\frac{1}{2})$ . Moreover let $\alpha\in(\frac{1}{3},\bar{\alpha})$ and $p=\frac{1}{\alpha}$ .

In this section we prove well-posedness of the equation

[TABLE]

We start by defining the notion of solution we shall use.

Definition 6.1.

Let $\rho\geq 1$ and $\alpha\in(\frac{1}{3},\frac{1}{2}]$ . We say that an $(\mathcal{F}_{t})_{t\geq 0}$ -adapted stochastic process $x:\Omega\times[0,T]\to\mathbb{R}^{d}$ is a solution to equation (83) with initial condition $\Xi\in L^{\rho}(\Omega,\mathcal{F}_{0};\mathbb{R}^{d})$ , if

(i)

$\mu_{t}:=\mathcal{L}(x_{t})$ * is such that*

[TABLE]

and $\mathbf{F}^{\mu}$ defined from $\sigma(\mu)$ and $\beta(\mu)$ as in Lemma 4.8 is a rough driver in the sense of Definition (3.1). 2. (ii)

$P$ -almost surely, $x$ satisfies

[TABLE]

in the sense of Definition 3.4.

Before proceeding we state the assumptions that will be in force throughout the section.

Assumptions 6.2.

Let $k>\frac{d}{2}+3$ and $\rho\geq 1$ ,

(i)

We assume $\beta\in\mathcal{L}(\mathbb{R}^{m},C_{b}^{3}\otimes H^{k})$ . 2. (ii)

Let $\sigma:\mathcal{P}_{\rho}(\mathbb{R}^{d})\to\mathcal{L}(\mathbb{R}^{d};H^{k})$ be a measurable function, such that there exists a constant $C_{\sigma}>0$ , with

[TABLE]

We now introduce a suitable space of measures in which will be useful for proving well-posedness of (83). The set up is reminiscent of the controlled space as introduced in [17], but tailored for measures on path spaces.

Definition 6.3.

Let $\rho\geq 1$ . We say that a pair $(\mu,\gamma)\in\mathcal{P}_{\rho}(C^{\alpha}_{0}([0,T];\mathbb{R}^{d}))\times C^{\alpha}([0,T];\mathcal{L}(\mathbb{R}^{m};C_{b}^{3}(\mathbb{R}^{d};\mathbb{R}^{d}))$ is controlled by $Z$ provided for every $\phi\in C_{b}^{3}\otimes H^{k}$ we have that

[TABLE]

Here we used the notation

[TABLE]

For $\rho\geq 1$ , we denote by $\mathcal{M}_{Z}^{2\alpha,\rho}$ the set of all such controlled pairs equipped with the metric

[TABLE]

Remark 6.4.

We note that in Definition 6.1 (i) the law, $\mu_{t}=\mathcal{L}(x_{t})$ , of the solution is only defined for the time-marginals, and a priori it is not clear how to construct from this a measure on the path space $C_{0}^{\alpha}([0,T];\mathbb{R}^{d})$ . However, since $x$ satisfies the equation in Definition 6.1 (ii), $x$ is a random variable in $C^{\alpha}([0,T];\mathbb{R}^{d})$ , and letting $h\rightarrow 0$ in (25) and (49) we see that $x$ takes values in $C_{0}^{\alpha}([0,T];\mathbb{R}^{d})$ . Hence it induces the measure $\mathcal{L}(x)$ on $C_{0}^{\alpha}([0,T];\mathbb{R}^{d})$ which clearly has time-marginals $\mu_{t}$ .

Remark 6.5.

Let $\beta$ and $\sigma$ satisfy Assumption 6.2, with $k>\frac{d}{2}+3$ , and let $(\mu,\gamma)\in\mathcal{M}_{Z}^{2\alpha,\rho}$ . Then, $\sigma(\mu)$ and $\mu(\beta),\mu(\nabla_{1}\beta\gamma)$ satisfy Assumption 4.2. Assumption 4.2 (i) is verified by replacing $\varphi=\beta^{i}$ , for $i=1,\dots,m$ , in Definition 6.3. Assumption 4.2 (ii) follows trivially by the boundedness in Assumption 4.2 (ii). We are only left with verifying 4.2 (iii). For all $s,t\in[0,T]$ ,

[TABLE]

This gives that $\sigma\in C^{\alpha}H^{k}\subset C^{p-var}H^{k}$ , if $p=\frac{1}{\alpha}$ .

Theorem 6.6.

Suppose $\sigma$ and $\beta$ satisfies Assumption 6.2 and $\rho\geq 2$ . For any $\Xi_{0}\in L^{\rho}(\Omega,\mathcal{F}_{0};\mathbb{R}^{d})$ there exists a unique solution $x$ of (83) in the sense of Definition 6.1.

Proof.

We fix $\sigma,\beta$ satisfying Assumptions 6.2 and construct the following mappings

[TABLE]

and we shall use the notation $\Gamma(\mu,\gamma):=(\mathcal{L}(x),\beta(\mu))$ . By letting $h\rightarrow 0$ in (25) and (49) we see that $\mathcal{L}(x)$ is supported on $C_{0}^{\alpha}([0,T];\mathbb{R}^{d})$ . In Lemma 6.7 and Lemma 6.8 we show that $\Gamma$ is a contraction mapping on a subset of $\mathcal{M}_{Z}^{2\alpha,\rho}$ for a small time parameter $T_{0}\leq T$ . Then, noting that $T_{0}=T_{0}(\rho,\alpha,\sigma,\beta,\mathbf{Z})$ does not depend on the initial condition $\Xi_{0}$ , the solution can be constructed iteratively on the full time interval $[0,T]$ by concatenation of the solutions defined on $[0,T_{0}]$ , $[T_{0},2T_{0}]$ etc. ∎

Lemma 6.7.

Define

[TABLE]

and the closed subset of $\mathcal{M}_{Z}^{2\alpha,\rho}$ ,

[TABLE]

Assume Assumption 6.2 with $\rho\geq 2$ . There exists a small time $T=T(\rho,\alpha,\sigma,\beta,\mathbf{Z})$ , such that $\Gamma$ leaves $\mathcal{B}_{T}$ invariant.

Proof.

We start by looking at the controlled function,

[TABLE]

To show the bounds on the rough driver, start by noting that, by linearity,

[TABLE]

and thanks to (84), $\|\sigma(\mu)\|_{L_{t}^{\infty}\mathcal{L}(\mathbb{R}^{d};H^{k})}\leq C_{\sigma}.$ This gives that for $(\mu,\gamma)\in\mathcal{B}_{T}$ , we have $L(\sigma(\mu),\beta(\mu),\mathbf{Z})\leq\bar{L}(\sigma,\beta,\mathbf{Z})$ , where $L$ is defined in (36). The previous observation and (49) imply

[TABLE]

for any $\alpha<\bar{\alpha}$ and for any $\rho\geq 1$ and for a random variable $K_{\rho}\in L^{\rho}(\Omega)$ . From the a priori estimates (26) we see that there exists a constant $C>0$ , depending only on $\rho$ (which may change from an inequality to the next), such that

[TABLE]

We may now choose $T\leq(3C\bar{L}^{1+1/\alpha})^{-\frac{2}{\bar{\alpha}-\alpha}}$ such that

[TABLE]

From Lemma 4.16 we get,

[TABLE]

and we choose $T\leq(3(1+C)(1+\bar{L}^{1+1/\alpha}))^{-\frac{2}{\bar{\alpha}-\alpha}}$ such that the above is bounded by $\frac{1}{3}$ . This shows that

[TABLE]

This, together with (87) implies $\Gamma(\mathcal{B}_{T})\subset\mathcal{B}_{T}$ . ∎

Lemma 6.8.

Assume Assumption 6.2 with $\rho\geq 2$ . There exists a constant $0<c<1$ and a small time $T=T(\rho,\alpha,\sigma,\beta,\mathbf{Z})$ , such that, for all $(\mu,\gamma),(\nu,\zeta)\in\mathcal{B}_{T}$ , we have

[TABLE]

Proof.

Let $M=K([\![\sigma]\!]_{p,[s,t]}^{\rho}+1)(L(\sigma,\beta,\mathbf{Z})+L(\theta,\gamma,\mathbf{Z}))$ be defined as in Lemma 4.17. We have seen in the proof of Lemma 6.7 that, for $(\mu,\gamma)\in\mathcal{B}_{T}$ , we have $L(\sigma(\mu),\beta(\mu),\mathbf{Z})\leq\bar{L}(\sigma,\beta,\mathbf{Z})$ . Moreover, from (84), we have $[\![\sigma(\mu)]\!]_{p}^{p}\leq C_{\sigma}T\left(\int_{C^{\alpha}_{0}}[\omega]_{\alpha}^{\rho}d\mu(\omega)\right)^{\frac{1}{\rho}}\leq\frac{T}{3}$ , for $\mu\in\mathcal{B}_{T}$ . Hence $M\leq K\bar{L}$ , for some universal constant $K=K(\alpha,\rho)$ . We estimate the Wasserstein distance of the image laws, as given in (85). From Lemma 4.17, there exists $\bar{\rho}\geq\rho$ and $C(T)>0$ , such that $\lim_{T\to 0}C(T)=0$ and

[TABLE]

We study now the Gubinelli derivative. For all $s,t\in[0,T]$ , we have

[TABLE]

Hence, using $\bar{\alpha}>\alpha$ and $\bar{L}\geq L$ ,

[TABLE]

For the last term in the definition of the metric $d$ , we have, using Proposition 4.17 and proceeding as in (89)

[TABLE]

We now add together (89), (90), and (91) to obtain

[TABLE]

Choosing $T=T(\rho,\alpha,\sigma,\beta,\mathbf{Z})$ small enough, depending on $\bar{L}$ , we conclude the proof. ∎

7 Non local rough PDEs

Let $d,m\in\mathbb{N}$ be fixed. Let $\mathbf{Z}\in\mathscr{C}_{g}^{\bar{\alpha}}([0,T],\mathbb{R}^{m})$ , for $\bar{\alpha}\in(\frac{1}{3},\frac{1}{2})$ . Moreover let $\alpha\in(\frac{1}{3},\bar{\alpha})$ and $p=\frac{1}{\alpha}$ . Let $\sigma$ and $\beta$ satisfy Assumption 6.2.

We turn to the Fokker-Planck equation induced by the rough diffusion, which formally reads

[TABLE]

We define the notion of a solution in a similar way as in the linear case, Definition 5.4, but where now the unbounded rough driver depends on the solution itself.

Definition 7.1.

We say that a path $\mu:[0,T]\rightarrow\mathcal{P}_{\rho}(\mathbb{R}^{d})$ is a solution of (92) with initial condition $\mu_{0}\in\mathcal{P}_{\rho}(\mathbb{R}^{d})$ provided

(i)

for all $\varphi\in C_{b}^{3}\otimes H^{k}$ ,

[TABLE] 2. (ii)

$\mu$ * satisfies (69) with the unbounded rough driver $\mathbf{B}=\mathbf{B}^{\mu}$ defined from*

[TABLE]

as in Proposition 5.3, and $a_{t}=\frac{1}{2}\sigma(\mu_{t})\sigma(\mu_{t})^{T}$ .

Existence of a solution to (92) is relatively straightforward.

Theorem 7.2.

Suppose $\sigma$ and $\beta$ satisfies Assumptions 6.2, $\mu_{0}\in\mathcal{P}_{\rho}(\mathbb{R}^{d})$ for $\rho\geq 2$ and $\mathbf{Z}\in\mathcal{C}_{wg}^{\bar{\alpha}}([0,T];\mathbb{R}^{m})$ for $\bar{\alpha}\in(\frac{1}{3},\frac{1}{2})$ . Let $(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\in[0,T]},P)$ be a complete probability space that supports a $d$ -dimensional Brownian motion $W$ and an $\mathcal{F}_{0}$ -measurable random variable, $\Xi\in L^{\rho}(\Omega;\mathbb{R}^{d})$ such that the push-forward measure $P_{*}(\Xi)=\mu_{0}$ . Then, there exists a solution $\mu$ of (92), in the sense of Definition 7.1. This solution is given by $\mu_{t}=\mathcal{L}(x_{t})$ , where $x$ is the unique solution to the McKean-Vlasov equation (83) with initial condition $\Xi$ , in the sense of Definition 6.1.

Proof.

The proof is completed by following the same steps as in Proposition 5.5 except the unbounded rough driver depends on the solution itself. ∎

The following result will be crucial for proving uniqueness of the non-local Fokker-Planck equation.

Proposition 7.3.

Let $\bar{\alpha}\in(\frac{1}{3},\frac{1}{2})$ , $\alpha\in(\frac{1}{3},\bar{\alpha})$ and $\mathbf{Z}\in\mathscr{C}_{wg}^{\bar{\alpha}}([0,T],\mathbb{R}^{m})$ is weakly geometric. Define for $(\mu,\gamma)\in\mathcal{M}_{Z}^{2\alpha,\rho}$ and $\phi\in C_{b}^{3}\otimes H^{k}$ ,

[TABLE]

Then $\mathbf{X}^{\phi}\in\mathscr{C}_{g}^{\alpha}([0,T];H^{k})$ .

Proof.

We prove this result in two steps. First we show that the controlled path $(\mu(\phi),\mu(\nabla_{1}\phi\gamma))$ can be continuously approximated by controlled paths which takes values in a finite-dimensional space. This clearly gives that $\mathbf{X}^{\phi}$ can be approximated by a sequence of finite dimensional rough paths. In the second step we use that the finite dimensional rough path is weakly geometric to find a smooth approximation of $\mathbf{X}^{\phi}$ .

Step 1. For simplicity we only show this for $\phi\in C^{3}_{b}(\mathbb{R}^{d})\otimes L^{2}(\mathbb{R}^{d})$ , the general case follows by replacing $\phi$ by $D^{\beta}_{2}\phi$ for $|\beta|\leq k$ . Let $\{e_{n}\}$ be an orthonormal basis of $L^{2}(\mathbb{R}^{d})$ and define

[TABLE]

We now show that $(\phi^{N}(\mu),\nabla\phi^{N}(\mu)\gamma)\rightarrow(\phi(\mu),\nabla\phi(\mu)\gamma)$ in $\mathscr{D}_{Z}^{2\alpha^{\prime}}([0,T];L^{2})$ for any $\alpha^{\prime}\in(\alpha,\bar{\alpha})$ .

Start with the first component.

[TABLE]

Now for fixed $\omega$ , $\theta$ and every $s,t\in[0,T]$ we have the monotone convergence

[TABLE]

as $N\rightarrow\infty$ since $\phi\in C_{b}^{3}(\mathbb{R}^{d})\otimes L^{2}(\mathbb{R}^{d})$ . Moreover, for fixed $N$ , as a function of $s$ and $t$ the above is continuous. By Dini’s theorem we get

[TABLE]

as $N\rightarrow\infty$ . This gives

[TABLE]

by monotone convergence. In a similar way one can show that $\nabla_{1}\phi^{N}(\mu\gamma)$ converges to $\nabla_{1}\phi(\mu\gamma)$ in $C^{\alpha}([0,T];L^{2}(\mathbb{R}^{d}))$ .

To see the convergence of the remainder, $\phi^{N}(\mu)_{st}^{\sharp}:=\delta\phi^{N}(\mu)_{st}-\nabla_{1}\phi^{N}(\mu_{s}\gamma_{s})Z_{st}$ , we note first that this term is obviously bounded in $C_{2}^{2\alpha}([0,T];L^{2}(\mathbb{R}^{d}))$ . Furthermore, writing

[TABLE]

Using Dini’s theorem and monotone convergence as before we get that for any $\epsilon>0$ there exists $N_{\epsilon}$ such that for all $N\geq N_{\epsilon}$ we have $\sup_{s,t}\|\phi^{N}(\mu)_{st}^{\sharp}-\phi(\mu)_{st}^{\sharp}\|_{L^{2}}<\epsilon$ .

This gives, uniformly in $s,t$

[TABLE]

where we have used the geometric interpolation $a\wedge b\leq a^{1-\kappa}b^{\kappa}$ for any $\kappa\in(0,1)$ . By choosing $\kappa$ correctly we get $\phi^{N}(\mu)^{\sharp}\rightarrow\phi(\mu)^{\sharp}$ in $C_{2}^{2\alpha^{\prime}}([0,T];L^{2}(\mathbb{R}^{d}))$ .

Step 2. We now proceed to prove that $\mathbf{X}^{\phi}$ can be approximated by a smooth path. Let $\epsilon>0$ . From the above continuity we can choose $N$ such that

[TABLE]

where $\mathbf{X}^{\phi^{N}}$ is constructed by replacing $\phi$ with $\phi^{N}$ in (93).

As spelled out in Lemma A.3, there exists $\alpha<\alpha^{\prime}$ and a smooth path $X^{N,\epsilon}$ such that $[\mathbf{X}^{\phi^{N}}-\mathbf{X}^{N,\epsilon}]_{\alpha}<\frac{\epsilon}{2}$ . This gives

[TABLE]

∎

Theorem 7.4.

Suppose $\sigma$ , $\beta$ satisfies Assumptions 6.2 for $k>9+d$ and $\mu_{0}\in\mathcal{P}_{\rho}(\mathbb{R}^{d})$ is given with $\rho\geq 2$ . Then there exists at most one solution $\mu$ of (92) in the sense of Definition 7.1.

Proof.

Let $\mu$ be a solution of (92). From the the assumptions on $\beta$ and $\sigma$ we may construct the time-dependent coefficients $(\sigma(\mu),(\beta(\mu),\nabla_{1}\beta(\beta(\mu)\mu)))$ from which we construct the rough driver $\mathbf{F}^{\mu}$ as in Lemma 4.8. Denote by $x^{\mu}$ the solution of

[TABLE]

i.e. $dx^{\mu}_{t}=\mathbf{F}_{dt}^{\mu}(x_{t})$ . From Proposition 4.14 we see that $\nu$ satisfies

[TABLE]

as in Definition 7.1, where $X_{st}(x)=\int_{s}^{t}\beta(\mu_{r},x)d\mathbf{Z}_{r}$ and $\mathbb{X}_{st}(x,y)=\int_{s}^{t}\beta(\mu_{r},x)\int_{s}^{r}\beta(\mu_{u},y)d\mathbf{Z}_{u}d\mathbf{Z}_{r}$ . From the assumption on $\beta$ , the Sobolev embedding [4, Corollary 9.13] $H^{k}\subset C_{b}^{n+3}(\mathbb{R}^{d};\mathbb{R}^{d})$ for $k>\frac{d}{2}+n+3$ and Proposition 7.3 we see that $\mathbf{X}\in\mathscr{C}_{g}^{\alpha}([0,T];C_{b}^{n+3}(\mathbb{R}^{d};\mathbb{R}^{d}))$ . Now if $n>6+\frac{d}{2}$ , we get from Theorem 5.9 that there exists at most one solution of (94). In particular, we see that $\mu_{t}=\nu_{t}$ which gives that $x^{\mu}$ is a solution of (83). Since this equation is well-posed, this uniquely describes $\mu$ . ∎

Appendix A Appendix

A.1 Kolmogorov continuity theorem

In this section we prove a Kolmogorov continuity type theorem for rough drivers. The proof is done exactly as in [15, Theorem 3.1], so we only sketch the proof to convince the reader that the steps are the same.

Theorem A.1.

Suppose $\mathbf{F}=(F,\mathbb{F})$ is a random rough driver such that

[TABLE]

for $q$ and $\beta$ such that $q\beta>1$ . Then for every $\alpha\in(0,\beta-\frac{1}{q})$ we have

[TABLE]

and if $\beta-\frac{1}{q}>\frac{1}{3}$ then $\mathbf{F}$ is rough driver for $\alpha\in(\frac{1}{3},\beta-\frac{1}{q})$ .

Proof.

Take $T=1$ for simplicity and denote by $D_{n}$ the uniform partition of $[0,1]$ with mesh $2^{-n}$ and let

[TABLE]

By assumption on $\mathbf{F}$ we get

[TABLE]

Let $s,t\in\bigcup D_{n}$ and choose $m$ such that $|D_{m+1}|<|t-s|\leq|D_{m}|$ . There exists a partition $\{t_{i}\}_{i=0}^{N}$ of $[s,t]$ such that $(t_{i},t_{i+1})\in D_{n}$ for some $n\geq m+1$ , and for each fixed such $n$ there are at most two such intervals from $D_{n}$ . We get

[TABLE]

and using $\mathbb{F}_{st}=\sum_{i=0}^{N-1}\mathbb{F}_{t_{i}t_{i+1}}+\nabla F_{t_{i}t_{i+1}}F_{st_{i}}$ , which is easily seen from Chen’s relation, we get

[TABLE]

This gives

[TABLE]

where

[TABLE]

which belongs to $L^{q}(\Omega)$ and $L^{q/2}(\Omega)$ respectively. This proves the claim. ∎

A.2 Weakly geometric rough paths

We prove that rough path integration w.r.t. a weakly geometric rough path yields a weakly geometric rough path.

Lemma A.2.

Assume $\mathbf{Z}$ is weakly geometric and $E$ is a separable Hilbert space and $(Y,Y^{\prime})\in\mathscr{D}_{Z}^{2\alpha}([0,T];E)$ . Then the rough path $\mathbf{X}$ defined by

[TABLE]

is also weakly geometric.

Proof.

Let $\{e_{i}\}$ be an orthonormal basis of $E$ and use the component notation

[TABLE]

The components of the integrals may thus be spelled out

[TABLE]

where the above are scalar integrals defined by their local expansions

[TABLE]

respectively. Since $\Xi^{i,j}_{st}-X_{s}^{i}\Xi_{st}^{j}=Y_{s}^{i,l}Y_{s}^{j,k}\mathbb{Z}_{st}^{l,k}$ and by definition of $X$ we get

[TABLE]

which gives

[TABLE]

Now, since $\mathbf{Z}$ is weakly geometric we have

[TABLE]

which gives

[TABLE]

It is straightforward to check that the above left hand side is the increment from $s$ to $t$ of the function $t\mapsto\mathbb{X}_{0t}^{i,j}+\mathbb{X}_{0t}^{j,i}-X_{t}^{i}X_{t}^{j}$ . Since $3\alpha>1$ we get that this function is constant and equal to 0. ∎

In the next lemma we show how to construct the approximation in Proposition 7.3.

Lemma A.3.

Fix $N,K,d,m>0$ , $\bar{\alpha}\in(\frac{1}{3},\frac{1}{2})$ and let $\mathbf{Z}\in\mathscr{C}_{wg}^{\bar{\alpha}}([0,T];\mathbb{R}^{m})$ be a weakly geometric rough path. Moreover, for $i=1,\dots,d$ , $n=1,\dots,N$ and $k=1,\dots,K$ , let $e_{n}\in L^{2}(\mathbb{R}^{d})$ be an orthonormal basis and $\theta^{i,k,n}\in\mathscr{D}_{Z}^{2\alpha^{\prime}}([0,T],\mathbb{R})$ , for $\alpha^{\prime}\in(\frac{1}{3},\bar{\alpha})$ . Let $\phi=\phi^{i,k}=\sum_{n=1}^{N}\theta^{i,k,n}e_{n}$ and construct $\mathbf{X}^{\phi}$ as in (93). Then, for every $\alpha\in(\frac{1}{3},\alpha^{\prime})$ there exists $\mathbf{X}^{\epsilon}$ such that

[TABLE]

Proof.

We take $(\bar{e}_{i})_{i=1,\dots,d}$ an orthonormal basis of $\mathbb{R}^{d}$ and, for $\bar{\iota}=1,\dots,dN$ , we define $\xi^{\bar{\imath}}:=e_{n}\bar{e}_{i}\in L^{2}(\mathbb{R}^{d};\mathbb{R}^{d})$ , where $\bar{\imath},i,n$ satisfy the relation

[TABLE]

Let $V^{N}$ be the finite dimensional vector space defined as

[TABLE]

We note that $\dim(V^{N})=dN$ . On this space we construct a rough path as follows, for $\bar{\imath},\bar{\jmath}=1,\dots,dN$ ,

[TABLE]

Here and in the following we always assume that the triples $(\bar{\imath},i,n)$ and $(\bar{\jmath},j,m)$ satisfy relation (95). Moreover, we always use the convention that we are summing over repeated indices, in this case $k,l=1,\dots,K$ . It is immediate to see that $\mathbf{X}^{\phi}=(\sum_{\bar{\imath}=1}^{dN}X^{\bar{\imath}},\sum_{\bar{\imath},\bar{\jmath}=1}^{dN}\mathbb{X}^{\bar{\imath},\bar{\jmath}})$ .

We prove now that $(X,\mathbb{X})$ is geometric, i.e. that the following relation holds

[TABLE]

Let us look more in detail what the tensor product on the right hand side is, for $\bar{\imath},\bar{\jmath}=1,\dots,dN$ ,

[TABLE]

Each of these terms is a tensor product which is mostly zero. Let us now describe each component of (96). We start by introducing the indexes

[TABLE]

We assume from now that the couple $(\imath,f)$ and $(\jmath,g)$ always assume the previous relation. We obtain

[TABLE]

Similarly, we see that

[TABLE]

The symmetry condition reduces to verify the scalar equality

[TABLE]

which is satisfied thanks to Lemma A.2.

The rough path $\mathbf{X}^{\phi}$ is thus in $\mathscr{C}_{wg}^{\alpha^{\prime}}([0,T],V^{N})$ . Since $V^{N}$ is a finite dimensional space, we can find a smooth approximation $\mathbf{X}^{\epsilon}$ in $\mathscr{C}^{\alpha}([0,T],V^{N})$ , for some $\alpha\in(\frac{1}{3},\alpha^{\prime})$ . Hence, since $V^{N}\subset L^{2}(\mathbb{R}^{d};\mathbb{R}^{d})$ , this is also an approximation in $\mathscr{C}^{\alpha}([0,T],L^{2}(\mathbb{R}^{d};\mathbb{R}^{d}))$ . ∎

A.3 A separable subspace of the Hölder space

Proposition A.4.

The space $C_{0}^{\alpha}([0,T];E)$ is equal to the closure of $C^{1}([0,T];E)$ with respect to the $C^{\alpha}$ -topology. In particular, $C_{0}^{\alpha}([0,T];E)$ is separable if $E$ is separable.

Proof.

For simplicity we assume $E=\mathbb{R}$ . We clearly have $[f]_{\alpha,h}\leq h^{1-\alpha}\|\nabla f\|_{\infty}$ so that $C^{1}([0,T])\subset C_{0}^{\alpha}([0,T])$ , which shows one inclusion by taking the closure.

To see the reversed inclusion, we take $f\in C_{0}^{\alpha}([0,T])$ , a standard mollifier $\rho_{n}(u)=n\rho(nu)$ and let $f^{n}_{t}=\int_{0}^{T}f_{u}\,\rho_{n}(t-u)du=\int_{t}^{T-t}f_{t-u}\,\rho_{n}(u)du$ . Then $f^{n}$ is smooth and we get for $|t-s|\leq h$

[TABLE]

so that $[f^{n}]_{\alpha,h}\leq[f]_{\alpha,h}$ . Let us show that $f^{n}$ converges uniformly to $f$ .

[TABLE]

which converges to 0 uniformly in $t$ .

Now, write

[TABLE]

which gives

[TABLE]

By assumption on $f$ , letting $h\rightarrow 0$ gives that $f^{n}\rightarrow f$ in $C^{\alpha}([0,T])$ . ∎

Bibliography23

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] I. Bailleul, R. Catellier, and F. Delarue. Mean field rough differential equations. ar Xiv preprint ar Xiv:1802.05882 , 2018.
2[2] I. Bailleul and M. Gubinelli. Unbounded rough drivers. Annales de la Faculté des Sciences de Toulouse. Mathématiques. , 26(4):795–830, 2017.
3[3] I. Bailleul and S. Riedel. Rough flows. To appear in Journal of the Mathematical Society of Japan, ar Xiv preprint: https://arxiv.org/abs/1505.01692 , 2018.
4[4] H. Brezis. Functional analysis, Sobolev spaces and partial differential equations . Universitext. Springer, New York, 2011.
5[5] R. Carmona, F. Delarue, and D. Lacker. Mean field games with common noise. Ann. Probab. , 44(6):3740–3803, 11 2016.
6[6] T. Cass, C. Litterer, and T. Lyons. Integrability and tail estimates for gaussian rough differential equations. Ann. Probab. , 41(4):3026–3050, 07 2013.
7[7] T. Cass and T. Lyons. Evolving communities with individual preferences. Proc. Lond. Math. Soc. (3) , 110(1):83–107, 2015.
8[8] M. Coghi and B. Gess. Stochastic nonlinear fokker-planck equations. ar Xiv preprint ar Xiv:1904.07894 , 2019.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Rough nonlocal diffusions

Abstract

Contents

1 Introduction

Theorem** (see Theorems 7.2 and 7.4).**

Existing literature

Main contributions

Structure of the paper

2 Notations and preliminary results

2.1 Hölder and p-variation spaces

2.2 Rough paths

Controlled space

Sewing lemma and rough path integration

Lemma 2.1**.**

2.3 Taylor’s formula

2.4 Wasserstein metric

2.5 Spatial function spaces

3 Non linear integration

Definition 3.1**.**

Remark 3.2**.**

Example 3.3**.**

Definition 3.4**.**

Remark 3.5**.**

3.1 A priori estimates

Lemma 3.6**.**

Proof.

Proposition 3.7**.**

Proof.

Lemma 3.8**.**

Proof.

3.2 A priori contractive estimates

Remark 3.9**.**

Lemma 3.10**.**

Proof.

Proposition 3.11**.**

Proof.

Corollary 3.12**.**

Proof.

3.3 Well-posedness of nonlinear RDEs

Theorem 3.13**.**

Proof.

4 Rough non-linearities

Example 4.1**.**

Assumptions 4.2**.**

4.1 Construction of the rough driver

4.1.1 Itô theory

Lemma 4.3**.**

Proof.

Lemma 4.4**.**

Proof.

4.1.2 Gubinelli integration

Lemma 4.5**.**

Lemma 4.6**.**

Lemma 4.7**.**

Proof.

4.1.3 Mixed Itô and rough path integration

Lemma 4.8**.**

Proof.

Lemma 4.9**.**

Proof.

4.2 Integrability of the random rough driver

Lemma 4.10**.**

Proof.

Lemma 4.11**.**

Proof.

Theorem 4.12**.**

Proof.

4.3 The average Itô formula

Proposition 4.13**.**

Proof.

Proposition 4.14**.**

Lemma 4.15**.**

Proof.

Lemma 4.16**.**

Theorem (see Theorems 7.2 and 7.4).

Lemma 2.1.

Definition 3.1.

Remark 3.2.

Example 3.3.

Definition 3.4.

Remark 3.5.

Lemma 3.6.

Proposition 3.7.

Lemma 3.8.

Remark 3.9.

Lemma 3.10.

Proposition 3.11.

Corollary 3.12.

Theorem 3.13.

Example 4.1.

Assumptions 4.2.

Lemma 4.3.

Lemma 4.4.

Lemma 4.5.

Lemma 4.6.

Lemma 4.7.

Lemma 4.8.

Lemma 4.9.

Lemma 4.10.

Lemma 4.11.

Theorem 4.12.

Proposition 4.13.

Proposition 4.14.

Lemma 4.15.

Lemma 4.16.

Proposition 4.17.

Lemma 4.18.

Lemma 4.19.

Assumptions 5.1.

Definition 5.2.

Proposition 5.3.

Definition 5.4.

Proposition 5.5.

Remark 5.6.

Proposition 5.7.

Theorem 5.8.

Theorem 5.9.

Definition 6.1.

Assumptions 6.2.

Definition 6.3.

Remark 6.4.

Remark 6.5.

Theorem 6.6.

Lemma 6.7.

Lemma 6.8.

Definition 7.1.

Theorem 7.2.

Proposition 7.3.

Theorem 7.4.

Theorem A.1.

Lemma A.2.

Lemma A.3.

Proposition A.4.