
TL;DR
This paper develops a new rough path framework to analyze nonlinear Fokker-Planck equations driven by rough signals, specifically addressing McKean-Vlasov diffusions with common noise, and establishes their well-posedness.
Contribution
It introduces a self-contained nonlinear rough integration theory and defines solutions for the associated Fokker-Planck equations, proving their well-posedness.
Findings
Established a well-posedness theory for rough McKean-Vlasov equations.
Developed a novel nonlinear rough integration framework.
Provided a solution concept for rough Fokker-Planck equations.
Abstract
We consider a nonlinear Fokker-Planck equation driven by a deterministic rough path which describes the conditional probability of a McKean-Vlasov diffusion with "common" noise. To study the equation we build a self-contained framework of non-linear rough integration theory which we use to study McKean-Vlasov equations perturbed by rough paths. We construct an appropriate notion of solution of the corresponding Fokker-Planck equation and prove well-posedness.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Rough nonlocal diffusions
Michele Coghi , Torstein Nilssen
WIAS Berlin, Mohrenstraße 39, 10117 Berlin. Support from the Berlin Mathematics Research Center MATH+ is gratefully acknowledged.
Institute of Mathematics, Technical University of Berlin, Germany, Financial support by the DFG via Research Unit FOR 2402 is gratefully acknowledged.
Abstract
We consider a nonlinear Fokker-Planck equation driven by a deterministic rough path which describes the conditional probability of a McKean-Vlasov diffusion with "common" noise. To study the equation we build a self-contained framework of non-linear rough integration theory which we use to study McKean-Vlasov equations perturbed by rough paths. We construct an appropriate notion of solution of the corresponding Fokker-Planck equation and prove well-posedness.
MSC Classification Numbers: 60H05, 60H15, 60J60, 35K55.
Key words: Rough paths, Stochastic PDEs, McKean-Vlasov, non-local equations.
Contents
1 Introduction
The term diffusion is sometimes used interchangeably when talking either about the macroscopic (Eulerian) description of the density of a substance occupying some space or the infinitesimal (Lagrangian) description of the particles of the substance. Many physical phenomena are however inherently nonlinear in the sense that the dynamic of the system will depend not only on space but also on the density of the substance itself. In this paper we study this type of nonlinear diffusion from both the Eulerian and Lagrangian perspective when the diffusion is perturbed by a rough path. We are motivated by dynamics that arise from interacting particle systems with common noise;
[TABLE]
Here each particle is influenced by 2 independent sources of noise, the Brownian motion 111Since we will in this paper only consider geometric rough paths, we shall consider Stratonovich integration for this term. is visible for all particles (common noise) and the Brownian motion represents a noise term specific for particle . Since is influencing every particle, taking the limit will only average out the individual noise terms, giving, at least formally, the mean-field dynamics
[TABLE]
We note that the conditional law heuristically satisfies the non-local Fokker-Plank equation
[TABLE]
where we have used the notation etc. and for a matrix valued function . In fact, we can also address the case when is a certain type of Lipschitz nonlinearity on , where denotes the set of probability measures on , see Assumption 6.2. We will only address the case when and are linear in their second argument.
In practice, (2) is difficult to solve since it needs to be formulated on a very large state space, namely where is the underlying probability space. Even when is finite, this space is too large to do analysis since it is difficult to find compact subsets that is used for proving well-posedness of (1) and (2). For a long time, well-posedness for equation (2) was known only for densities, see [20]. A proper well-posedness result in the space of measures was obtain just very recently in [8].
In this paper we take a different approach, namely we study equation (1) for a fixed sample path of the Brownian motion. Our method relies on the theory of rough paths and as such, allows the study of (1) where is replaced by any path that can be lifted to a rough path. In particular, no markovianity or martingale structure is needed for the common noise.
From now on we replace by a (deterministic) rough path , and equation (2) becomes
[TABLE]
The main contribution of this paper is the following.
Theorem** (see Theorems 7.2 and 7.4).**
Given a probability measure on with finite -th moment, for any , there exists a unique measure-valued path , which solves (3) with initial condition .
Moreover we will prove in Theorem 7.2 that the unique solution is given as , namely the law of solution to the McKean-Vlasov equation
[TABLE]
We will show well-posedness of (4) in Section 6.
The strategy to prove uniqueness to equation (3) relies on showing that every solution must be the law of the McKean-Vlasov equation. As it will be clear in the proof of Theorem 7.4, this also necessitates to be able to have well-posedness of the equation
[TABLE]
for given time inhomogeneous functions , and , where the time dependence is induced by the law. Moreover, a common approach to proving well-posedness of (4) is to construct the solution as a fixed point in the space of measures on an appropriate function space. Towards this end one would e.g. define inductively
[TABLE]
Once again, it is necessary to give a meaning to equation (5). If we consider the case and the equation reads
[TABLE]
It is well-known that the above integration does not make sense unless we impose additional structure on , namely that there exists a Taylor-type expansion around the irregular path , which is exactly the notion of controlled rough paths as introduced by Gubinelli in [17]. If one aims to solve a mean-field equation on the form
[TABLE]
where denotes the law of , and is an appropriate function on the space of measures, it is reasonable to expect that has such a decomposition and that one could solve the equation as a fixed-point in an appropriate space of measures.
Following this logic, if we want to consider the equation with added Brownian motion (4) as a fixed-point, this would necessitate being able to solve equation (5). The usual way, see [12], [13] and [14], to study this hybrid rough path and Itô equation is to consider the joint rough path
[TABLE]
and recast the equation on the form of a rough path equation
[TABLE]
Again, one would need to make an expansion of in terms of the path . However, thinking towards the goal of solving mean-field equations, the simplest examples shows that there is no reason to expect that is controlled by a fixed Brownian path in any sense - the law of the solution is an average over all Brownian sample paths.
Instead, if we define as a Wiener-Itô integral and as a rough path integral, then on small time scales one would expect
[TABLE]
to be small, so that one could use and to define a notion of non-linear 222We choose to call the integration non-linear since a mapping is obviously never linear. integration. At the heart of all stochastic integration is the difficulty that the above is not enough to guarantee a canonically defined integration map in the pathwise sense. The most fundamental understanding of the rough path theory is that one can construct integrals once additional information about the driving path is given by some off-line argument e.g. stochastic integration.
Existing literature
The stochastic equation, i.e. (1) and (2) has been studied in [20] and [21] but focusing on the case where the initial condition has a density. The measure-valued case was studied very recently in [8]. Under more restrictive conditions, either on the class of solutions or on the coefficients (like strong parabolicity), the well-posedness of solutions to SPDE of the type (2) had been previously considered by Dawson, Vaillancourt in [10].
McKean-Vlasov equations from a rough path perspective has already been introduced in [7] and, more recently in [1], focusing on the Lagrangian description. In [1] the equation is driven by a general random rough path, which gives the additional difficulty of needing to keep track of the rough path as a -valued path. The latter space is present to consider a probability measure as the law of a random variable and Lions’ approach to calculus for the Wasserstein metric. The approach by Gubinelli on controlled rough paths is then used to solve the equation as a fixed-point in the mixed and -space.
We mention also [5] where the authors study mean-field games in the presence of a common noise as in (1). The authors use tightness arguments along with approximations to prove existence of a (probabilistically) weak solutions. Then, the authors prove a Yamada-Watanabe type principle for these equations to prove existence and uniqueness of (probabilistically) strong solutions.
In Section 3 we build a version of the rough path theory that allow for time dependent coefficients. The results in this section should be compared to [3] where the authors solves equations on this form. There, the main focus is flows build from a non-linear version of the sewing lemma. Very recently, right before the completion of the present paper, the authors of [23] introduce the very same object, here called a nonlinear rough path. The authors use a similar set up as in [17] to solve rough equations with time-dependent coefficients.
The papers [3] and [23] does not contain the same precise estimates as the present paper, which is crucially needed to set up a contraction mapping for the McKean-Vlasov equation (4).
Main contributions
The main contribution of this paper is the formulation and well-posedness of the nonlinear Fokker-Planck equation in terms of the appropriate rough path topology. We believe this is the first paper to study a rough non-local diffusion from both the Lagrangian and Eulerian perspective. Furthermore we believe it is the first work to prove well-posedness of an equation with a nonlinearity in the noise term on this form.
It is plausible that the well-posedness of the McKean-Vlasov equation equation in the present paper can be seen as a particular case of the equation studied in [1] by doing a rough path lift of and as in (6), but now as a rough path with values in an -space. However, our proof of the well-posedness of the nonlinear Fokker-Planck equation necessitate well-posedness of a rough path equation with time-dependent coefficients. As already mentioned, it is not reasonable to expect that the coefficients could be controlled by a single Brownian path thus one could not use [1] for the time dependent case. Moreover, for the same reason, time dependent coefficients are also needed to understand the McKean-Vlasov equation as a fixed point of linear diffusions in an appropriate space of measures.
In addition, we prove a result on existence of a solution to a linear, possibly degenerate, rough PDE which could be of independent interest.
Structure of the paper
The paper is structured as follows. In Section 2 we introduce the necessary concepts from rough path theory, including controlled rough paths, that will be needed for the paper. In Section 3 we introduce the corresponding integration theory to handle non-linear integration and differential equations. In Section 4 we show how to concretely build rough drivers from Itô integration theory and the theory of controlled paths. These examples will also act exactly as the rough drivers needed to formulate the McKean-Vlasov equation as a fixed point. Moreover, this section contains an average, in , Itô formula that allows us to prove that the law of a diffusion solves the Fokker-Planck equation (linear or nonlinear). In Section 5 we prove well-posedness for a linear RPDE with time dependent coefficients. In Section 6 we construct the appropriate space for solving the McKean-Vlasov equation. In Section 7 we prove uniqueness of our main equation, which hinges on the results of the previous sections.
2 Notations and preliminary results
2.1 Hölder and p-variation spaces
For we let denote the simplex . For and a Banach space we denote by the space of all continuous mappings such that
[TABLE]
It can be checked that the above space is independent of , and we will write for simplicity . When it is clear from the context, we will also omit the Banach space , writing and . We let denote the space of all paths such that the increment belongs to . For simplicity we will write . It is well known that local and global Hölder norms are comparable for paths, in the sense that
[TABLE]
for all (see Exercise 4.25 in [15]). It is well known that the Hölder spaces are not separable. However, the subspace
[TABLE]
is separable, as proved in Proposition A.4.
We let be the space of all continuous mappings such that
[TABLE]
where the above supremum is taken over all partitions of . If we define it can be shown that is a control, namely continuous and superadditive i.e. . Moreover, we see that if there exists a control such that , then , so that we could equivalently define
[TABLE]
We will write and when the space is clear from the context we will simply write and .
To see the relationship between Hölder continuity and -variation, notice that for any partition we have
[TABLE]
when , which gives the bound
[TABLE]
Given a control , we construct the greedy partition, following [15, Chapter 11]; for , define the partition as
[TABLE]
so that , for all , and . Define now the integer
[TABLE]
2.2 Rough paths
Assume is a Banach space and equip with the projective tensor norm. We call a pair
[TABLE]
for a rough path provided Chen’s relation,
[TABLE]
holds where we have defined the second order increment operator . We denote by the (non-linear) set of all rough paths which we equip with the subset metric,
[TABLE]
For a path of bounded variation, there is a canonical rough path, where the latter is the iterated integral \big{(}\int Z\otimes dZ\big{)}_{st}=\int_{s}^{t}Z_{sr}\otimes dZ_{r} which is well defined when is of bounded variation. We denote by the set of geometric rough paths, which is the closure of the set of bounded variation paths in the rough path metric.
We notice that if is geometric, then is also weakly geometric which means , and we denote by the set of all such rough paths. When is finite dimensional it is known that (see e.g. [16, Proposition 8.12]) if is weakly geometric, there exists a sequence of smooth paths such that in for all .
Controlled space
Given a path taking values in we denote by the (linear) space of all controlled path, given by pairs of mappings
[TABLE]
such that
[TABLE]
We call the Gubinelli derivative of . The above definition is sometimes better understood in coordinates where we abuse notation and write for the matrix representing the Gubinelli derivative. Above and for the remainder of the paper we shall use the convention of summation over repeated indices. We equip the space of all controlled paths with the norm
[TABLE]
Sewing lemma and rough path integration
We recall here the main result used to obtain estimates in the theory of rough paths, namely the sewing lemma.
Lemma 2.1**.**
Suppose is such that
[TABLE]
for some and . Then there exists a unique pair and such that
[TABLE]
with for depending only on .
In fact, we have and we think of as being an integral with local expansion .
With this in hand we can define the rough path integral. Given a rough path and a controlled path , define the local expansion
[TABLE]
Using Chen’s relation it is straightforward to check that and we shall write .
This construction also gives rise to a new rough path, namely
[TABLE]
where the latter integral is defined by the local expansion
[TABLE]
One can then check that and that this operation is continuous from to . Moreover, at least when is a separable Hilbert space, weak geometricity is preserved under rough path integration as spelled out in Lemma A.2.
We shall also use the sewing lemma to get a priori estimates by a slight (straightforward) generalization of the sewing lemma. Assume that is such that there exists controls and and a positive function such that
[TABLE]
for some . Then there exists a universal constant such that
[TABLE]
2.3 Taylor’s formula
For a path and a function (where is a finite-dimensional vector space) we use the notation
[TABLE]
With this notation at hand the first and second order Taylor’s formula reads
[TABLE]
respectively. We obviously get .
2.4 Wasserstein metric
We shall work with the Wasserstein metric on measures on Hölder spaces, but since separability of the underlying space is required for the Wasserstein metric to give a complete space, we shall use the subspaces . When the dimension is clear from the context we shall simply write . Given two probability measure say that is a coupling of and provided its first (respectively second) marginal is equal to (respectively ). We define the Wasserstein metric
[TABLE]
where the above infimum ranges over all couplings of the measures and . Since is separable we have that is a complete space w.r.t. .
We note that the -th moment of a probability measure can be written where is the Dirac-Delta centered in the path constantly equal to [math].
2.5 Spatial function spaces
We fix . For any multi-index , we set
[TABLE]
and . For and an integer , we let be the Sobolev space of real-valued functions on with finite norm
[TABLE]
Let , be the Sobolev space of square integrable functions over , endowed with the norm . For a Hilbert space , we endow the space of linear functionals with the Hilbert-Schmidt norm
[TABLE]
Moreover, we call the space of -valued, time-continuous, square integrable martingales endowed with the norm
[TABLE]
Let . We denote by the space of continuous functions such that
- (i)
For all , the function . 2. (ii)
For all , the function . 3. (iii)
We have
[TABLE]
We endow the space with the induced norm . Above we have used the Frechet derivative in the first variable and the weak derivative in the second variable.
Contrary to , this space is well suited for the convolution and we see that if .
3 Non linear integration
In this section we build the theory of rough paths to accommodate for time-dependent coefficients. We aim to solve the equation
[TABLE]
for given function which is a distribution in time but regular in space. We shall use the framework akin to the definition by Davie in [9]. To illustrate the set up, assume that is a smooth solution of (17). Integrating the equation and using Taylor’s formula we obtain
[TABLE]
Here we have defined the driver of the equation as follows
[TABLE]
and the remainder as
[TABLE]
With the above notation, we rewrite equation (17) as
[TABLE]
As is usual in rough path theory, we shall now read the definition (18) in the opposite direction - we assume we are given a pair of functions satisfying some compatibility conditions (in Definition 3.1 below), and take this as a definition of the non-linearity . We will then take to be implicitly defined and say that is a solution provided is of high time regularity.
We can read (17) in integral form as and can be regarded as a rough version of the semimartingale integration theory by Kunita in [19].
We shall use a similar definition as in [3], with a noticeable difference that we allow our driver to depend on two spatial points. Moreover, we will not only be dealing with weakly geometric drivers.
Definition 3.1**.**
For , a pair of functions is called a -rough driver provided Chen’s relation,
[TABLE]
holds. The set of all such pairs is equipped with the metrics
[TABLE]
Most of the time we will work on the diagonal of the spatial points and write simply , and we shall also write .
For a pair of functions is called an -rough driver provided (21) holds. The set of all such pairs is equipped with the metric
[TABLE]
Remark 3.2**.**
The reason for using both -variation and -Hölder continuous drivers is that the construction using Kolmogorov continuity theorem (Lemma 4.3, below) gives us more easily bounds in the sense of Hölder continuity. However, to estimate the difference between two solutions we need exponential bounds, and it is well known that even when is a Brownian motion, the random variable is not exponentially integrable. This problem is circumvented by using -variation, more specifically using the local accumulation , see Section 4.2 for the details.
From (8) it is clear that if is an -rough driver, then it is also a -rough driver with . When the notion is clear from the context, we shall simply say that is a rough driver.
Example 3.3**.**
Consider a rough path , where we identify with a subspace 333Since we are on the unbounded domain , we don’t know if one can identify these spaces, but the inclusion is enough for our purposes of so that Chen’s relation reads
[TABLE]
Let now and where is the multiplication of vector fields, i.e. the linear extension of the mapping defined by
[TABLE]
It is straightforward to check that this gives a rough driver, and we notice that the mapping is continuous.
With this at hand we can define the notion of a solution.
Definition 3.4**.**
Let be a rough driver as in Definition 3.1 and . A path is called a solution to (20) provided defined by
[TABLE]
is such that .
Remark 3.5**.**
One drawback with this method compared to linear integration is the lack of "universality" in the Itô-Lyons map; recall that the stochastic equation
[TABLE]
and its corresponding mapping can be factorized into a discontinuous map, and a continuous one . One of the nice features of this decomposition is the fact that is universal in the sense that it does not depend on the vector field driving the equation, which allows to fix a subset for which one can do deterministic analysis on the differential equation.
In our case, however, the subset of will depend on the driving vector fields since we are building a non-linear integration theory depending on the coefficients.
3.1 A priori estimates
Let be a -rough driver and assume is a solution of equation (20) in the sense of Definition 3.4. In this section we use (12) and (13) to deduce a priori estimates. We let be the smallest control such that
[TABLE]
Define the controlled quantity,
[TABLE]
Lemma 3.6**.**
Let , we have the following chain rule, ,
[TABLE]
Proof.
We have from Taylor’s formula
[TABLE]
where
[TABLE]
By the definition of brackets (14), we get
[TABLE]
The result follows. ∎
With this in hand we turn to an a priori estimate for the nonlinear RDE.
Proposition 3.7**.**
Let . There exists constants and depending only on such that for all such that we have
[TABLE]
Proof.
We start with the easily verifiable identity for a function and path
[TABLE]
Using Chen’s relation we get
[TABLE]
We get from Lemma 3.6, provided
[TABLE]
and clearly
[TABLE]
From the sewing lemma there exists a constant such that
[TABLE]
From equations (22) and (23) we have
[TABLE]
and consequently
[TABLE]
If now is such that we get
[TABLE]
which gives
[TABLE]
∎
The above bound translates now to global estimates on the solution itself in the following way.
Lemma 3.8**.**
Assume now that is an -rough driver with . Then we have, for small enough depending on ,
[TABLE]
Moreover, we have the global estimate
[TABLE]
for a constant depending only on .
Proof.
Since is Hölder continuous we have for all . Choose now such that where is as in Proposition 3.7. For we have
[TABLE]
from which (25) follows.
From (7) we get, choosing now ,
[TABLE]
for some universal constant depending only on . ∎
3.2 A priori contractive estimates
Let , and assume , are two -rough drivers. We take two solutions and of equation (20) in the sense of Definition (3.4), with initial conditions and and driven by and respectively.
To illustrate the ideas of this section, we give the following remark.
Remark 3.9**.**
Assume that , , and are smooth in time, so that we can write
[TABLE]
where we have used Gronwall’s inequality in the last step. The purpose of this subsection is to replicate these estimates also for the rough case. The steps are similar to the previous subsection, except we compare two solutions.
We start by writing
[TABLE]
Let and so that the above gives the estimate
[TABLE]
We begin with the analogue of Lemma 3.6 that allows us to estimate nonlinearities of the remainders.
Lemma 3.10**.**
Let . Then using the notation as in Lemma 3.6 we have the estimate
[TABLE]
Proof.
We write
[TABLE]
The first two terms above can be written
[TABLE]
Which gives the bound
[TABLE]
Now write
[TABLE]
We see that
[TABLE]
which gives (28). ∎
Proposition 3.11**.**
Assume that and are -rough drivers with . Then there exists universal constants such that
[TABLE]
Moreover,
[TABLE]
[TABLE]
for all such that . In particular, we have uniqueness for equation (20) and the solution is continuous w.r.t. the initial condition.
Proof.
Using Chen’s relation we get
[TABLE]
Replacing and in (28) we get
[TABLE]
Replacing and in (28) we get
[TABLE]
Use also the estimate
[TABLE]
Let now be such that which gives
[TABLE]
From (12) and (13) we get that there exists a universal constant such that
[TABLE]
Choose now such that , so that
[TABLE]
For the solution we have
[TABLE]
Let now be such that we get
[TABLE]
where . From the rough Gronwall lemma, [11, Lemma 2.11], we get
[TABLE]
and we notice that this holds for all subintervals , i.e. no smallness assumption. Now, choose the finest partition of such that . We have
[TABLE]
and on we get
[TABLE]
provided . An easy induction shows that
[TABLE]
By definition of the greedy partition (9) we get
[TABLE]
Letting and using the bound this shows (29). To see (30) we plug the above into (27) to get
[TABLE]
using in the last step. Using (33) gives
[TABLE]
This gives (30). The bound (31) is proved in a similar way. ∎
Corollary 3.12**.**
Assume that and are -rough drivers with . Then there exists a universal constant such that,
[TABLE]
Proof.
Use bounds on the form for all in inequality (34). This gives the Hölder estimate
[TABLE]
which holds when is such that we have , in particular when .
Let be the constant given by Proposition 3.11 and set . It follows by (7) and Proposition 3.11 that (the value of changes in the following lines, but it only depends on )
[TABLE]
This concludes the proof. ∎
3.3 Well-posedness of nonlinear RDEs
Since uniqueness of equation (20) follows from Proposition 3.11, it is only left to prove existence of a solution. We do so by using a Picard iteration.
Theorem 3.13**.**
Let be a -variation rough driver. There exists a unique solution of equation (20), in the sense of Definition 3.4, with initial condition .
Proof.
Uniqueness is given by Proposition 3.11. We study now existence. Define , and
[TABLE]
which gives
[TABLE]
Consequently, there exists a pair such that
[TABLE]
and we have for some universal constant .
We prove inductively that there exists universal constants and such that for we have and .
Given and we let
[TABLE]
We then get
[TABLE]
which gives
[TABLE]
provided is such that . This gives that there exists such that
[TABLE]
so will do. Provided is such that we also get
[TABLE]
which proves the induction hypothesis.
From Arzelà-Ascoli we get that there exists a subsequence converging in to some element . Clearly we get
[TABLE]
Since all the terms of (35) (or rather, the one with replaced by ) converges, we get that also must converge to a limit denoted . Then and satisfies (22) and from the uniform bounds on we see that indeed is a solution. ∎
4 Rough non-linearities
In this section we show how to construct the rough drivers that are used for solving the McKean-Vlasov equation (4). We start by constructing rough drivers corresponding to Itô theory, i.e. given a vector field and a Brownian motion , we want to define
[TABLE]
where the latter integration is in the sense of Itô. As the following example demonstrates, it is not possible to simply integrate a function to produce a rough driver.
Example 4.1**.**
Let and , then the mapping is -a.s. unbounded as . Indeed, let and and for , then is an i.i.d. Gaussian sequence, which -a.s. diverges.
The above example shows that we need some decay on our vector fields as . We choose to assume that belongs to a Sobolev space where is large enough to use Sobolev embedding to show that is a rough driver. The reason for this choice is the relatively simple and well established theory of Itô integration that is available for Hilbert spaces. We conjecture that this regularity can be significantly lowered (e.g. with decay as in [3, Corollary 9]) and leave this for future investigation.
Let be fixed and let , for . In this section we assume the following
Assumptions 4.2**.**
Let , and ,
- (i)
Let , as in Section 2. 2. (ii)
Let be a continuous function, such that
[TABLE] 3. (iii)
Let , then
[TABLE]
To simplify the following discussion, we introduce the convenient notation
[TABLE]
4.1 Construction of the rough driver
4.1.1 Itô theory
Let be a filtered probability space and let be a -dimensional Wiener process on it. We assume that satisfies Assumption 4.2 (ii), for . We define, for ,
[TABLE]
where the integral is defined in the sense of Itô on Hilbert spaces, see [22, Section 2]. Thanks to Burkholder-Davis-Gundy (BDG) inequality for Hilbert spaces, [22, Theorem 2.4.7], we have for all and ,
[TABLE]
We consider now the time-continuous stochastic process,
[TABLE]
with Hilbert-Schmidt norm (15) bounded as , for all . Using again Itô theory on Hilbert spaces, we have that and we set, for ,
[TABLE]
Applying again BDG inequality and inequality (38), we have for all and ,
[TABLE]
Lemma 4.3**.**
Let be a -dimensional Wiener process on the filtered probability space and let satisfy Assumption 4.2 (ii), with . Let and be defined as in (37) and (39), respectively. Then, for every , for -a.e.
[TABLE]
is a rough driver in the sense of Definition 3.1, and for all , we have
[TABLE]
Moreover, on small time-intervals we have, for ,
[TABLE]
Proof.
We first study the space regularity of . From the choice of , Sobolev’s embedding Theorem [4, Corollary 9.13] and inequalities (38) and (40), we have that
[TABLE]
[TABLE]
By the Kolmogorov continuity theorem A.1, we obtain (41).
We check now that Chen’s relation (21) holds -a.s.. Indeed, we have the following,
[TABLE]
To justify the last equality we call and we note that is an -measurable random variable taking values in the space of linear operators between two Hilbert spaces. Thanks to the fact that the operator is measurable with respect to the left-most point of the integral, one can easily adapt [22, Lemma 2.4.1] to show that it commutes with the stochastic integral.
∎
We shall also need contractive estimates w.r.t. the vector field.
Lemma 4.4**.**
Let and satisfy Assumption 4.2 (ii), with . Let and be rough drivers as constructed in Lemma 4.3 w.r.t. the vector fields and . Then, for all and all , there exists , such that for all ,
[TABLE]
Proof.
The proof follows as an application of Kolmogorov continuity theorem as in Lemma 4.3. ∎
4.1.2 Gubinelli integration
Let , and let satisfy Assumption 4.2 (i), for and . Using Gubinelli’s integration theory (see [15, Chapter 4]) we define, for each ,
[TABLE]
which satisfies (see [15, Theorem 4.10])
[TABLE]
and we have,
[TABLE]
For , we define and we consider , with Gubinelli derivative
[TABLE]
Consequently we can define the integral via the local expansion
[TABLE]
Defining
[TABLE]
we get
[TABLE]
We have the following lemmas of which we omit the proofs as they follow quite easily from the discussion above, standard computations on rough integrals, and Sobolev embedding Theorem [4, Corollary 9.13].
Lemma 4.5**.**
Let and . Assume that satisfies Assumption 4.2 (i), with and . Let and be defined as in (43) and (45), respectively. Then,
[TABLE]
is a rough driver in the sense of Definition 3.1 and we have for time intervals of size ,
[TABLE]
Lemma 4.6**.**
Let and . Assume that and satisfy Assumption 4.2 (i), with and . Let and be rough drivers constructed as in Lemma 4.5. Then, on time intervals of size ,
[TABLE]
Let us show that the above definition coincides with the usual definition of solutions of rough path equations.
Lemma 4.7**.**
Suppose is a solution of in the sense of Definition 3.4. Then also solves the classical rough path equation driven by with coefficient , i.e. satisfies the following equation in the sense of Davie [9],
[TABLE]
where the is also controlled by with Gubinelli derivative .
Proof.
Assume is a solution to the non-linear equation and let us show that it also satisfies
[TABLE]
for some remainder . By definition of we have
[TABLE]
Moreover
[TABLE]
by definition of and . This shows that which proves that the solutions coincide. Notice that the above bounds depend on only. ∎
4.1.3 Mixed Itô and rough path integration
Let be a -dimensional Wiener process on the filtered probability space . Let , . Assume that and satisfy Assumption 4.2 (ii) and 4.2 (i) respectively, for and . Let be defined as in (37) and be defined as in (43). We define
[TABLE]
We remark that the first term on the right hand side of the above equation is random, whereas the second is deterministic. Define heuristically
[TABLE]
The first two terms in the right hand side are defined as in (39) and (45) respectively, we need to make the last two rigorous. For the third term, using the Itô theory in Hilbert spaces as we did is Section 4.1.1, we see that the integral
[TABLE]
is well-defined. Indeed, we have for all . Hence, we can define
[TABLE]
Similarly, we have and . We define
[TABLE]
Lemma 4.8**.**
Let be a -dimensional Wiener process on the filtered probability space . Let and . Assume that and satisfy Assumption 4.2 (ii) and 4.2 (i) respectively,, with and . Let and be defined as in (47) and (48), respectively. Then, for -a.e. ,
[TABLE]
is a rough driver in the sense of Definition 3.1. Moreover, on time intervals of size we have that, for all , there exists , such that, -a.s.,
[TABLE]
where is defined in (36).
Proof.
It is immediate to verify that the couple satisfies Chen’s relation (21). We give now estimates on the first order term (47). As a consequence of the definition of and Lemma 4.5, we have, on an interval of size ,
[TABLE]
We use now Lemma 4.3 to control the first term in the right hand side.
Now we study the regularity of . Using BDG inequality [22, Theorem 2.4.7] and inequality (44), we have for all and , ,
[TABLE]
By Kolmogorov continuity theorem A.1, we obtain that for every there exists , such that
[TABLE]
Similar considerations lead to
[TABLE]
Putting together the last inequalities, Lemma 4.3 and Lemma 4.5 yields
[TABLE]
Inequality (49) follows immediately from the Sobolev embedding theorem [4, Corollary 9.13] . ∎
Lemma 4.9**.**
Let be a -dimensional Wiener process on the filtered probability space . Let and . Assume that satisfy Assumption 4.2 (ii) and that satisfy Assumption 4.2 (i), with and . Let and be nonlinear rough drivers constructed from and as in Lemma 4.8.
Then, for all , there exists , such that for any time interval of size ,
[TABLE]
*where we set and is defined as in (36). *
Proof.
We already have contractive estimates from Lemmas 4.4 and 4.6 for the Itô and Gubinelli terms. We look now at the mixed integrals. For every , we have, for ,
[TABLE]
The same estimates is true for the other mixed term. We can conclude by applying Kolmogorov continuity theorem. ∎
4.2 Integrability of the random rough driver
In this section we are concerned with the study of exponential moments of the random rough driver. We will use the approach introduced by [6] and described in [15, Chapter 11].
Lemma 4.10**.**
Let be the canonical Wiener space with Cameron-Martin space . We define on this space the canonical Wiener process as . Let and . Assume that and satisfy Assumption 4.2, with and , and let be defined as in Lemma 4.8. Let and , such that . Then, there exists and a null set , such that, , and ,
[TABLE]
where, is defined as
[TABLE]
Proof.
The proof of this result follows very closely the proof of [15, Theorem 11.5]. We repeat here the important pieces, where the dependence of the stochastic integrals on the space parameter has to be taken into account. We look at the first order term of . By definition, we have
[TABLE]
For every , the term is constructed as an limit, hence there exists a sequence of partitions and a null set such that
[TABLE]
for every . We call the intersection of over all dyadic times and we note that it is still a null set. Similarly, we can construct a null set such that the function is of bounded -variation for every . Let , we have,
[TABLE]
The first limit on the right hand side exists because of the choice of the null set that we made in (52). The last limit is well defined as a Young integral, since and are of complementary variation, see [15, Section 4.1]. Hence, also the left hand side of 53 converges and is, by definition, .
Hence, we obtain, , , and for all dyadic times ,
[TABLE]
To generalize to any subset , we can use a continuity argument, see [15, Theorem 11.5].
We compute now the -variation in equation (54) and we obtain
[TABLE]
Proceeding similarly for the second order term , we have that there exists a null set such that , and for all times ,
[TABLE]
to obtain the third term on the right hand side, we used stochastic Fubini Theorem as follows
[TABLE]
We compute the -variation for the second order term. Using inequalities of the type , for , we obtain, for all ,
[TABLE]
where is defined in (51). This concludes the proof ∎
For every , we define the control and we construct the greedy partition, following the construction in Section 2.1. Let be defined as in (9), for any . We call the integer-valued random variable given by
[TABLE]
for . For , let
[TABLE]
be the cumulative distribution function of a standard Gaussian random variable and . We include a straightforward Lemma needed to estimate .
Lemma 4.11**.**
Let and . If is a positive random variable such that , for every , then
[TABLE]
Proof.
We use elementary considerations and Fubini theorem, to obtain
[TABLE]
∎
Theorem 4.12**.**
Under the same assumptions of Lemma 4.10, the random variable defined in (55) has a Gaussian tail. Moreover, there exists , such that is bounded when is small and for all ,
[TABLE]
where is defined in (36).
Proof.
The main ingredient, which is still to prove, is that, for -a.e. ,
[TABLE]
where is defined as in (51) and . The proof of this inequality follows from Lemma 4.10 in the same way as the proof of [15, Lemma 11.12]. It follows from [15, Proposition 11.2], that we can take , to obtain
[TABLE]
withe . By assumption, , and are of finite -variation. This implies that is almost surely finite and we can apply the generalized Fernique Theorem [15, Theorem 11.7] as follows. We set and defined as in (51). We must now find such that the following set has positive measure,
[TABLE]
We know from Lemma 4.8 that . From Chebychev inequality, we have (where may change from a term to the next)
[TABLE]
Using the previous estimates, we obtain that,
[TABLE]
where is again allowed to increase in the last inequality. Moreover,
[TABLE]
If we now fix , we have that . From Fernique Theorem [15, Theorem 11.7], we have, for ,
[TABLE]
where and . By our choice of and the monotonicity of , we have that , which is a universal constant depending only on , but can be negative. It follows from (56) that as . We apply Lemma 4.11 that, with (chosen so that ), and as before and .
[TABLE]
The constant is allowed to change again in the last line, but one can easily see that it remains bounded, when is small enough. ∎
4.3 The average Itô formula
In this section we prove a version of the Itô formula which we need to make the connection between (3) and (4). We note that at the present level of knowledge, we don’t know how to make an -a.s. Itô formula, but we only have the chain rule when we average over .
Proposition 4.13**.**
Let a complete filtered probability space and be a -dimensional Wiener process on it. Let , . Assume that and satisfy Assumption 4.2, for and . Let be defined as in Lemma 4.8.
Let be the solution to equation (20) driven by with initial condition , in the sense of Definition 3.4, given by Proposition 3.13.
Let be an -measurable random variable. Then the process is adapted. Moreover, is a random variable with values in .
Proof.
Let and call the restriction of on the interval . We know from Proposition 3.11 that
[TABLE]
is a continuous mapping. Moreover the random variable is -measurable. Hence,
[TABLE]
is -measurable.
In a similar way we see that is a random variable in , since is measurable and is continuous w.r.t. the rough driver. ∎
Proposition 4.14**.**
Under the same assumptions as Proposition 4.13, let . If , endowed with the norm defined in (16), then
[TABLE]
where is controlled by with Gubinelli derivative .
Before we proceed with the proof of Proposition 4.14, we prove two technical lemmas.
Lemma 4.15**.**
Under the same assumptions as Proposition 4.13, let be defined in (23). For any and , we have
[TABLE]
where is defined in (36).
Proof.
Define the random variable as in Proposition 3.7 which gives that for we have Writing gives
[TABLE]
Now trivially by the definition of , we have
[TABLE]
and the result follows from Lemma 4.8. ∎
Lemma 4.16**.**
Under the same assumptions as Proposition 4.13, we have
[TABLE]
with bounds, on a time interval of size ,
[TABLE]
where is defined in (36).
Proof.
We do a first order Taylor expansion to obtain
[TABLE]
We have defined
[TABLE]
We first make some deterministic bounds (i.e. uniformly in )
[TABLE]
Using that is adapted we get so that
[TABLE]
Write now
[TABLE]
and the result follows from Lemma 4.15 with . ∎
Proof of Proposition 4.14.
We do a third order Taylor expansion to obtain, -a.s.,
[TABLE]
Where we have defined
[TABLE]
As in Lemma 4.7 we note that
[TABLE]
is uniformly in bounded by depending only on . Moreover, since is geometric and is a symmetric bilinear mapping we get
[TABLE]
where denotes the symmetric tensor product. This is clearly bounded by .
Using Lemma 4.15 with and and taking the expectation of we obtain the result. ∎
To create the contraction mapping in the appropriate space of measures we shall need to control the difference of two measures induced by two rough SDEs.
Proposition 4.17**.**
Let a complete filtered probability space and be a -dimensional Wiener process on it. Let , . Assume that and satisfy Assumption 4.2, for , and . Let and be nonlinear rough drivers constructed from and as in Lemma 4.8. Moreover, let be an -measurable random variable.
Let and solutions to equation (20) driven by and respectively, with the same initial condition .
If , endowed with the norm defined in (16), we have
[TABLE]
Moreover, there exists and such that , and
[TABLE]
where , is defined in (36) and is a universal constant.
Before proceeding with the proof, we need the next two technical lemmas.
Lemma 4.18**.**
Under the same assumptions of Proposition 4.17, for any , there exists , and , such that and
[TABLE]
Proof.
By applying Corollary 3.12, (50) and (49), we see that there exists and such that -a.s.,
[TABLE]
Taking the norm on both sides we conclude the proof, thanks to Theorem 4.12, which gives
[TABLE]
where is a universal constant. ∎
Lemma 4.19**.**
Under the same assumptions of Proposition 4.17, for any , there exists and , such that and, for all ,
[TABLE]
Proof.
Let where is the constant given in Proposition 3.11. Then, implies,
[TABLE]
and we notice that for some which follows from Lemma 4.8 and the Gaussian integrability of , Theorem 4.12.
We split up which gives
[TABLE]
For the first term above we use the crude (in time) bound
[TABLE]
The result follows from Corollary 3.12, (29) and Theorem 4.12. ∎
Proof of Proposition 4.17.
We write
[TABLE]
We start from the first term on the right hand side of (57),
[TABLE]
We have, as an application of Hölder inequality, for ,
[TABLE]
where, in the last inequality we used Lemma 4.3. Similarly, using Lemma 4.3 and 4.4, we can bound the remaining terms,
[TABLE]
Summing up the previous inequalities, we get
[TABLE]
The second term in (57) is bounded as follows using Lemmas 4.19 and 4.15,
[TABLE]
The third term in (57) is
[TABLE]
We estimate the first term in the right hand side using Lemma 4.5,
[TABLE]
Similarly, using Lemma 4.5 and 4.6,
[TABLE]
We estimate the last term in (58) using equation (44) and Lemma 4.6,
[TABLE]
Thus, there exists (which may increase from a line to the next) such that the remainder satisfies, for all ,
[TABLE]
In the last inequality we used Lemma 3.8 combined with Lemma 4.8, and also . We check now the Gubinelli derivative, for each we have
[TABLE]
Similarly as for the remainder, we obtain the following,
[TABLE]
We conclude by using Lemma 4.18 to estimate . ∎
5 Linear Rough PDE
Let be fixed and let , for . Let and satisfy Assumptions 4.2, for large enough. In this section we prove well-posedness of measure-valued solutions to linear rough partial differential equations, which are formally given as
[TABLE]
To rigorously define the meaning of a solution to equation (59), we take a slightly more general approach, as described below.
Assumptions 5.1**.**
Let and .
- (i)
Let be a measurable path such that for all and . 2. (ii)
Let be a geometric rough path, as described in Section 2.
The examples we have in mind are and , as described in Proposition 5.5. In order to describe the main ideas, we argue now on a formal level assuming smoothness in time of ; rigorous definitions in the rough path case will be given later in the section. We study uniqueness of solutions to the following linear equation
[TABLE]
The proof is based on a backward duality trick; suppose we can show existence of a sufficiently regular solution to the backward PDE
[TABLE]
for a given final condition , then at least formally we have
[TABLE]
which shows that . Now, if is chosen in a class of functions large enough to fully determine , we see that it will be fully determined by and , thus showing uniqueness.
For simplicity only, we write equation (61) on divergence form and as a forward equation as follows
[TABLE]
which can be seen to be equivalent to (61) by replacing by in (63) and then reversing time, i.e. .
The strategy to prove existence of a smooth solution to (63) is as follows. We first show how to give an intrinsic notion of solution of (60) and (63) in the context of the so-called unbounded rough drivers, see [2]. We then replace by smooth vector fields, in which case it is well know that there exists a unique solution of (63) which is smooth provided the coefficients are. We then consider the vector of derivatives and show that satisfies a vector valued equation, for which we can find bounds independent of . The equation for will be solved in the space , thus giving bounds on in the Sobolev-space .
Second, we approximate by a sequence of smooth vector fields and show that the corresponding sequence of solutions converge to a meaningful solution of (63). Since the solution is in we can use Sobolev embedding [4, Corollary 9.13] to show the needed spatial regularity to justify the computations in (62).
The techniques used to prove the first step are motivated by [2] and [11], and the main technical tool is the a priori estimate found in [11].
5.1 Unbounded rough drivers
We start by rephrasing (63) in terms of so called unbounded rough drivers. The main motivation for doing so is the a priori estimate from [11].
Assume that is a smooth path, then equation (63) is well defined as a PDE. Integrating (63) from to we obtain
[TABLE]
Iterating the equation into itself we obtain
[TABLE]
where at least formally,
[TABLE]
and
[TABLE]
By the usual power counting the remainder term should be regular in time, but we notice that in general it is a distribution in space. Following [2] we call a scale of spaces a quadruple of Banach spaces such that is continuously embedded into . Let be the topological dual of (in general, ).
Definition 5.2**.**
An unbounded -rough driver on the scale , is a pair of mappings on such that
[TABLE]
and Chen’s relation is satisfied,
[TABLE]
We shall write for the smallest constant dominating the bounds in (66).
We show how to construct an unbounded rough driver given a rough path.
Proposition 5.3**.**
Let and satisfy Assumption 4.2 (ii). Define for
[TABLE]
where is the linear extension of the map defined on the algebraic tensor as
[TABLE]
Then is an unbounded rough driver on both scales , , and . Moreover, the mapping is continuous in the operator norm.
Proof.
Let . By Chen’s relation for rough paths (10), and (68)
[TABLE]
which gives
[TABLE]
Continuity of the mapping follows immediately from the continuity of . ∎
We notice that there is no zero order term in the above unbounded rough driver. We include such a term by considering a rough path , i.e. with an additional spatial variable. Then, for let
[TABLE]
where we make the convention that summation over repeated indexes are over , i.e. excluding [math].
With this in hand we can define the notion of a solution of (60).
Definition 5.4**.**
A path is a solution to (60) if for all the mapping defined by
[TABLE]
satisfies . Above is the unbounded rough driver constructed from as in Proposition 5.3.
We see now that, in the special case when and , existence of solutions follows from the results of Sections 3 and 4.
Proposition 5.5**.**
Let and let be a probability space that supports a -dimensional Brownian motion and an -measurable random variable, such that the push-forward measure . Let be a a weakly geometric rough path. Under Assumption 4.2, we have
- (i)
, generated by the rough path as in Proposition 5.3, is an unbounded rough driver as in Definition 5.2. 2. (ii)
There exists a solution of (60) driven by , in the sense of Definition 5.4. This solution is given by , where, for -a.e. , is the unique solution to equation (20) with initial condition , driven by the random rough driver constructed in Lemma 4.8.
Proof.
From Sobolev embedding theorem [4, Corollary 9.13] , we have . Thus, using the construction (11), we have that is a rough path over . The first claim follows now by Proposition 5.3.
We prove now the second claim. It follows from Proposition 4.13 that the stochastic process is adapted. We can thus define and denote by the induced time-marginals. From Itô’s formula, Proposition 4.14, we get
[TABLE]
The proof is complete once we show that has an expansion in terms of the unbounded rough driver. Recall that we get from Lemma 4.15, we have
[TABLE]
and this gives, using the sewing lemma 2.1,
[TABLE]
Regrouping the terms we can write
[TABLE]
By definition of we get
[TABLE]
which gives
[TABLE]
Moreover
[TABLE]
This shows that we may rewrite the equation for as
[TABLE]
where is a remainder. ∎
5.2 A priori estimates for smooth vector fields
For this section we consider an approximation of equation (64), driven by a smooth (in time) driver,
[TABLE]
where is smooth. We will find bounds on in depending only on a canonical unbounded rough driver generated by . The first step towards this goal is to write and all the derivatives as a vector in an space.
Let denote the (smooth) solution of (70) and let denote the vector of gradients as taking values in the truncated tensor algebra . We will simply write for the 1-contractive product
[TABLE]
e.g. for a and the product has component given by .
Using Leibniz formula we have
[TABLE]
where is given by
[TABLE]
We notice that the above sum is in since we are doing a contractive product of and .
For each we have
[TABLE]
This gives that satisfies the -valued equation
[TABLE]
where we have set
[TABLE]
Remark 5.6**.**
We notice that if we replace above by where converges to a rough path , then the corresponding coefficients , have canonical rough path lifts, and , with values in which remain bounded uniformly in . This comes from the fact that there are canonical iterated integrals between the -valued paths and ,
[TABLE]
where the first term is simply the Riemann-integral and the second term is defined using integration by parts as before.
Given the previous construction, we consider now a system of equations. We remark that this is not just a vector valued version of the results found in [18], since we are not interested in energy estimates. Indeed, the matrix is allowed to be degenerate but we require spatial smoothness. We consider the equation
[TABLE]
for given functions and smooth in time, and a given initial condition . The solution is a vector valued function , and the coefficients are on the form
[TABLE]
We will assume that is diagonal in (73), so component reads
[TABLE]
We begin with our main a priori estimate.
Proposition 5.7**.**
Assume is a solution of (73). Then there exists a constant such that
[TABLE]
where is an unbounded rough driver depending only on the rough path lift of the path .
Proof.
The finite-dimensional tensor then satisfies
[TABLE]
where
[TABLE]
both belongs to the space . Define now the unbounded rough driver
[TABLE]
and the drift
[TABLE]
for functions . This gives the dynamics
[TABLE]
on the scale . Let and write
[TABLE]
which shows that has bounded variation in .
Now, by the a priori bounds, [11, Theorem 2.9], we get
[TABLE]
where depends on . Testing against the identity matrix and using that is positive semi-definite we get
[TABLE]
Note that . Indeed, write and use the Cauchy-Schwarz inequality
[TABLE]
Summing over and gives that the above is bounded by . Integrating w.r.t. we get the claim.
If we choose such that we get
[TABLE]
From the rough Gronwall lemma, [11, Lemma 2.11], the first bound of (75) holds.
For the second inequality we notice that the evolution of on reads
[TABLE]
where and we have defined the unbounded rough driver
[TABLE]
Since the operator is self-adjoint it is easy to bound the variation of in ;
[TABLE]
This gives, using [11, Theorem 2.9],
[TABLE]
where depends on and . Take now a mollifier and decompose for any and any test function . This gives
[TABLE]
and for the smooth part we use the equation (77) to get
[TABLE]
Choosing we get the second inequality in (75). ∎
5.3 Existence of a smooth solution
With the previous a priori estimates at hand, we are ready to prove existence of a solution.
Theorem 5.8**.**
Let Assumption 5.1 hold for and let be given. Then there exists a solution to (63) which belongs to and
[TABLE]
holds in in the sense that , where is the unbounded rough driver constructed from as in Proposition 5.3.
Proof.
Denote by the solution of (63) when is replaced by , which we write
[TABLE]
Setting and choosing large (in fact ) we see that (71) is on the form (73) where and are defined from using (72). We then build the unbounded rough driver and from and according to (76) and (78) respectively.
By the assumptions on , and we get
[TABLE]
for some constant . For , define by and notice
[TABLE]
Since and are dual w.r.t. to the inner product on , we get . By similar reasoning we get using (79).
Since lies in a bounded set of , by Arzelà-Ascoli there exists a subsequence converging in some element . Here denotes equipped with the weak topology. Choosing now and using Sobolev embedding [4, Corollary 9.13] we get that is bounded in and .
It is straightforward to take the limit in (81) and use the uniform bounds on to obtain (80). ∎
5.4 Uniqueness
Theorem 5.9**.**
Let Assumption 5.1 hold for . Then solutions of (60) are unique.
Proof.
Let be a solution to (60), i.e. for all we have
[TABLE]
where and is the unbounded rough driver constructed from . Let be the solution of the backward equation (61) with final condition so that
[TABLE]
holds in . We then have
[TABLE]
where we have defined
[TABLE]
and we have used that the path is geometric which gives . Using the equations for and we get and . Using this and analyzing every term in (82) we see that
[TABLE]
and in particular . If is any other solution with the same initial condition, the same analysis gives which gives that . Since was arbitrary the result follows. ∎
6 The McKean-Vlasov equation
Let be fixed. Let be a complete filtered probability space and be a -dimensional Wiener process on it. Let be an -measurable random variable. Let , for . Moreover let and .
In this section we prove well-posedness of the equation
[TABLE]
We start by defining the notion of solution we shall use.
Definition 6.1**.**
Let and . We say that an -adapted stochastic process is a solution to equation (83) with initial condition , if
- (i)
* is such that*
[TABLE]
and defined from and as in Lemma 4.8 is a rough driver in the sense of Definition (3.1). 2. (ii)
-almost surely, satisfies
[TABLE]
in the sense of Definition 3.4.
Before proceeding we state the assumptions that will be in force throughout the section.
Assumptions 6.2**.**
Let and ,
- (i)
We assume . 2. (ii)
Let be a measurable function, such that there exists a constant , with
[TABLE]
We now introduce a suitable space of measures in which will be useful for proving well-posedness of (83). The set up is reminiscent of the controlled space as introduced in [17], but tailored for measures on path spaces.
Definition 6.3**.**
Let . We say that a pair is controlled by provided for every we have that
[TABLE]
Here we used the notation
[TABLE]
For , we denote by the set of all such controlled pairs equipped with the metric
[TABLE]
Remark 6.4**.**
We note that in Definition 6.1 (i) the law, , of the solution is only defined for the time-marginals, and a priori it is not clear how to construct from this a measure on the path space . However, since satisfies the equation in Definition 6.1 (ii), is a random variable in , and letting in (25) and (49) we see that takes values in . Hence it induces the measure on which clearly has time-marginals .
Remark 6.5**.**
Let and satisfy Assumption 6.2, with , and let . Then, and satisfy Assumption 4.2. Assumption 4.2 (i) is verified by replacing , for , in Definition 6.3. Assumption 4.2 (ii) follows trivially by the boundedness in Assumption 4.2 (ii). We are only left with verifying 4.2 (iii). For all ,
[TABLE]
This gives that , if .
Theorem 6.6**.**
Suppose and satisfies Assumption 6.2 and . For any there exists a unique solution of (83) in the sense of Definition 6.1.
Proof.
We fix satisfying Assumptions 6.2 and construct the following mappings
[TABLE]
and we shall use the notation . By letting in (25) and (49) we see that is supported on . In Lemma 6.7 and Lemma 6.8 we show that is a contraction mapping on a subset of for a small time parameter . Then, noting that does not depend on the initial condition , the solution can be constructed iteratively on the full time interval by concatenation of the solutions defined on , etc. ∎
Lemma 6.7**.**
Define
[TABLE]
and the closed subset of ,
[TABLE]
Assume Assumption 6.2 with . There exists a small time , such that leaves invariant.
Proof.
We start by looking at the controlled function,
[TABLE]
To show the bounds on the rough driver, start by noting that, by linearity,
[TABLE]
and thanks to (84), This gives that for , we have , where is defined in (36). The previous observation and (49) imply
[TABLE]
for any and for any and for a random variable . From the a priori estimates (26) we see that there exists a constant , depending only on (which may change from an inequality to the next), such that
[TABLE]
We may now choose such that
[TABLE]
From Lemma 4.16 we get,
[TABLE]
and we choose such that the above is bounded by . This shows that
[TABLE]
This, together with (87) implies . ∎
Lemma 6.8**.**
Assume Assumption 6.2 with . There exists a constant and a small time , such that, for all , we have
[TABLE]
Proof.
Let be defined as in Lemma 4.17. We have seen in the proof of Lemma 6.7 that, for , we have . Moreover, from (84), we have , for . Hence , for some universal constant . We estimate the Wasserstein distance of the image laws, as given in (85). From Lemma 4.17, there exists and , such that and
[TABLE]
We study now the Gubinelli derivative. For all , we have
[TABLE]
Hence, using and ,
[TABLE]
For the last term in the definition of the metric , we have, using Proposition 4.17 and proceeding as in (89)
[TABLE]
We now add together (89), (90), and (91) to obtain
[TABLE]
Choosing small enough, depending on , we conclude the proof. ∎
7 Non local rough PDEs
Let be fixed. Let , for . Moreover let and . Let and satisfy Assumption 6.2.
We turn to the Fokker-Planck equation induced by the rough diffusion, which formally reads
[TABLE]
We define the notion of a solution in a similar way as in the linear case, Definition 5.4, but where now the unbounded rough driver depends on the solution itself.
Definition 7.1**.**
We say that a path is a solution of (92) with initial condition provided
- (i)
for all ,
[TABLE] 2. (ii)
* satisfies (69) with the unbounded rough driver defined from*
[TABLE]
as in Proposition 5.3, and .
Existence of a solution to (92) is relatively straightforward.
Theorem 7.2**.**
Suppose and satisfies Assumptions 6.2, for and for . Let be a complete probability space that supports a -dimensional Brownian motion and an -measurable random variable, such that the push-forward measure . Then, there exists a solution of (92), in the sense of Definition 7.1. This solution is given by , where is the unique solution to the McKean-Vlasov equation (83) with initial condition , in the sense of Definition 6.1.
Proof.
The proof is completed by following the same steps as in Proposition 5.5 except the unbounded rough driver depends on the solution itself. ∎
The following result will be crucial for proving uniqueness of the non-local Fokker-Planck equation.
Proposition 7.3**.**
Let , and is weakly geometric. Define for and ,
[TABLE]
Then .
Proof.
We prove this result in two steps. First we show that the controlled path can be continuously approximated by controlled paths which takes values in a finite-dimensional space. This clearly gives that can be approximated by a sequence of finite dimensional rough paths. In the second step we use that the finite dimensional rough path is weakly geometric to find a smooth approximation of .
Step 1. For simplicity we only show this for , the general case follows by replacing by for . Let be an orthonormal basis of and define
[TABLE]
We now show that in for any .
Start with the first component.
[TABLE]
Now for fixed , and every we have the monotone convergence
[TABLE]
as since . Moreover, for fixed , as a function of and the above is continuous. By Dini’s theorem we get
[TABLE]
as . This gives
[TABLE]
by monotone convergence. In a similar way one can show that converges to in .
To see the convergence of the remainder, , we note first that this term is obviously bounded in . Furthermore, writing
[TABLE]
Using Dini’s theorem and monotone convergence as before we get that for any there exists such that for all we have .
This gives, uniformly in
[TABLE]
where we have used the geometric interpolation for any . By choosing correctly we get in .
Step 2. We now proceed to prove that can be approximated by a smooth path. Let . From the above continuity we can choose such that
[TABLE]
where is constructed by replacing with in (93).
As spelled out in Lemma A.3, there exists and a smooth path such that . This gives
[TABLE]
∎
Theorem 7.4**.**
Suppose , satisfies Assumptions 6.2 for and is given with . Then there exists at most one solution of (92) in the sense of Definition 7.1.
Proof.
Let be a solution of (92). From the the assumptions on and we may construct the time-dependent coefficients from which we construct the rough driver as in Lemma 4.8. Denote by the solution of
[TABLE]
i.e. . From Proposition 4.14 we see that satisfies
[TABLE]
as in Definition 7.1, where and . From the assumption on , the Sobolev embedding [4, Corollary 9.13] for and Proposition 7.3 we see that . Now if , we get from Theorem 5.9 that there exists at most one solution of (94). In particular, we see that which gives that is a solution of (83). Since this equation is well-posed, this uniquely describes . ∎
Appendix A Appendix
A.1 Kolmogorov continuity theorem
In this section we prove a Kolmogorov continuity type theorem for rough drivers. The proof is done exactly as in [15, Theorem 3.1], so we only sketch the proof to convince the reader that the steps are the same.
Theorem A.1**.**
Suppose is a random rough driver such that
[TABLE]
for and such that . Then for every we have
[TABLE]
and if then is rough driver for .
Proof.
Take for simplicity and denote by the uniform partition of with mesh and let
[TABLE]
By assumption on we get
[TABLE]
Let and choose such that . There exists a partition of such that for some , and for each fixed such there are at most two such intervals from . We get
[TABLE]
and using , which is easily seen from Chen’s relation, we get
[TABLE]
This gives
[TABLE]
where
[TABLE]
which belongs to and respectively. This proves the claim. ∎
A.2 Weakly geometric rough paths
We prove that rough path integration w.r.t. a weakly geometric rough path yields a weakly geometric rough path.
Lemma A.2**.**
Assume is weakly geometric and is a separable Hilbert space and . Then the rough path defined by
[TABLE]
is also weakly geometric.
Proof.
Let be an orthonormal basis of and use the component notation
[TABLE]
The components of the integrals may thus be spelled out
[TABLE]
where the above are scalar integrals defined by their local expansions
[TABLE]
respectively. Since and by definition of we get
[TABLE]
which gives
[TABLE]
Now, since is weakly geometric we have
[TABLE]
which gives
[TABLE]
It is straightforward to check that the above left hand side is the increment from to of the function . Since we get that this function is constant and equal to 0. ∎
In the next lemma we show how to construct the approximation in Proposition 7.3.
Lemma A.3**.**
Fix , and let be a weakly geometric rough path. Moreover, for , and , let be an orthonormal basis and , for . Let and construct as in (93). Then, for every there exists such that
[TABLE]
Proof.
We take an orthonormal basis of and, for , we define , where satisfy the relation
[TABLE]
Let be the finite dimensional vector space defined as
[TABLE]
We note that . On this space we construct a rough path as follows, for ,
[TABLE]
Here and in the following we always assume that the triples and satisfy relation (95). Moreover, we always use the convention that we are summing over repeated indices, in this case . It is immediate to see that .
We prove now that is geometric, i.e. that the following relation holds
[TABLE]
Let us look more in detail what the tensor product on the right hand side is, for ,
[TABLE]
Each of these terms is a tensor product which is mostly zero. Let us now describe each component of (96). We start by introducing the indexes
[TABLE]
We assume from now that the couple and always assume the previous relation. We obtain
[TABLE]
Similarly, we see that
[TABLE]
The symmetry condition reduces to verify the scalar equality
[TABLE]
which is satisfied thanks to Lemma A.2.
The rough path is thus in . Since is a finite dimensional space, we can find a smooth approximation in , for some . Hence, since , this is also an approximation in . ∎
A.3 A separable subspace of the Hölder space
Proposition A.4**.**
The space is equal to the closure of with respect to the -topology. In particular, is separable if is separable.
Proof.
For simplicity we assume . We clearly have so that , which shows one inclusion by taking the closure.
To see the reversed inclusion, we take , a standard mollifier and let . Then is smooth and we get for
[TABLE]
so that . Let us show that converges uniformly to .
[TABLE]
which converges to 0 uniformly in .
Now, write
[TABLE]
which gives
[TABLE]
By assumption on , letting gives that in . ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] I. Bailleul, R. Catellier, and F. Delarue. Mean field rough differential equations. ar Xiv preprint ar Xiv:1802.05882 , 2018.
- 2[2] I. Bailleul and M. Gubinelli. Unbounded rough drivers. Annales de la Faculté des Sciences de Toulouse. Mathématiques. , 26(4):795–830, 2017.
- 3[3] I. Bailleul and S. Riedel. Rough flows. To appear in Journal of the Mathematical Society of Japan, ar Xiv preprint: https://arxiv.org/abs/1505.01692 , 2018.
- 4[4] H. Brezis. Functional analysis, Sobolev spaces and partial differential equations . Universitext. Springer, New York, 2011.
- 5[5] R. Carmona, F. Delarue, and D. Lacker. Mean field games with common noise. Ann. Probab. , 44(6):3740–3803, 11 2016.
- 6[6] T. Cass, C. Litterer, and T. Lyons. Integrability and tail estimates for gaussian rough differential equations. Ann. Probab. , 41(4):3026–3050, 07 2013.
- 7[7] T. Cass and T. Lyons. Evolving communities with individual preferences. Proc. Lond. Math. Soc. (3) , 110(1):83–107, 2015.
- 8[8] M. Coghi and B. Gess. Stochastic nonlinear fokker-planck equations. ar Xiv preprint ar Xiv:1904.07894 , 2019.
