Higher order regularity of nonlinear Fokker-Planck PDEs with respect to the measure component
Alvin Tse

TL;DR
This paper derives a general formula for higher order derivatives of functionals on the Wasserstein space, applied to solutions of nonlinear Fokker-Planck PDEs, with implications for mean-field games and propagation of chaos.
Contribution
It introduces a new formula for higher order derivatives of functionals composed with Fokker-Planck PDE solutions, advancing the understanding of measure-dependent PDEs.
Findings
Derived a general formula for higher order linear functional derivatives.
Established connections with propagation of chaos and mean-field game theory.
Provides tools for analyzing regularity of nonlinear measure-valued PDEs.
Abstract
In this article, we establish a general formula for higher order linear functional derivatives for the composition of an arbitrary smooth functional on the 1-Wasserstein space with the solution of a Fokker-Planck PDE. This formula has important links with the theory of propagation of chaos and mean-field games.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Higher order regularity of nonlinear Fokker-Planck PDEs with respect to the measure component
Alvin Tse This research benefited from the support of the “Chaire Risques Financiers”, Fondation du Risque.
Corresponding e-mail: [email protected]
Université Paris-Est, Cermics (ENPC), INRIA, F-77455 Marne-la-Vallée, France
Abstract
In this article, we establish a general formula for higher order linear functional derivatives for the composition of an arbitrary smooth functional on the 1-Wasserstein space with the solution of a Fokker-Planck PDE. This formula has important links with the theory of propagation of chaos and mean-field games.
Résumé
Dans cet article, nous établissons une formule générale pour les dérivées fonctionnelles linéaires d’ordre supérieur pour la composition d’une fonctionnelle régulière arbitraire sur l’espace 1-Wasserstein avec la solution d’une EDP de Fokker-Planck. Cette formule a des liens importants avec la théorie de la propagation du chaos et des jeux à champ moyen.
**Keywords: Fokker-Planck PDEs, Linear functional derivatives, Propagation of chaos
**
2010 AMS subject classifications: 35R06, 60H30, 65C35
1 Introduction
Let denote the 1-Wasserstein space of probability measures on , where denotes the -dimensional torus. In this paper, we consider nonlinear Fokker-Planck PDEs of the form
[TABLE]
for some function and probability measure . This type of equations has been a rich area of research in the last decades. The case in which does not depend on has been treated in most classical works, such as Chapter 6 of [3]. In [1], this type of equations is considered to construct weak solutions to a class of distribution-dependent SDEs. The case corresponding to probability measures on the path space is considered in [19].
Let be a continuous function (w.r.t. the topology of ). This paper explores the smoothness w.r.t. the measure component for function defined by
[TABLE]
under sufficient regularity of and . The notion of smoothness that we consider, i.e. the linear functional derivative, is widely adopted in the literature of McKean-Vlasov equations and mean-field games, such as [8], [9] and [12]. A continuous function (w.r.t. the product topology of ) is said to be the linear functional derivative of , if for any ,
[TABLE]
We then introduce higher-order derivatives through iteration: for any and ,
[TABLE]
provided that the -th order derivative is well defined. These derivatives are defined up to an additive constant via (1.3) and (1.4). They are normalised by the convention
[TABLE]
The main result of this paper is Theorem 4.5. The definitions of the assumptions are found in Section 1.4.2. The definitions of the higher-order Kolmogorov equations and the multi-indices can be found in (3.4) and Definitions 4.1- 4.3 respectively.
Theorem (Main result)****.
Let . Assume (Int--()), (Lip--()), (TLip--()) and (TReg--()). Then exists and is given by
[TABLE]
In particular, if we also assume (TInt--()), then
[TABLE]
1.1 Links of the main result with the theory of quantitative propagation of chaos
This result has intricate links with the theory of McKean-Vlasov stochastic differential equations (MVSDEs) and mean-field optimal control. Let us consider a probability space equipped with a -dimensional Brownian motion . Denoting the law of random variable by , we consider a -dimensional MVSDE given by
[TABLE]
Lipschitz condition on ensures uniqueness of the solution to (1.7) ([30]) and it can be easily checked that in this case
[TABLE]
MVSDEs provide a probabilistic representation to the solutions of a class of nonlinear PDEs. A particular example of such nonlinear PDEs was first studied by McKean ([27]). These equations describe the limiting behaviour of an individual particle evolving within a large system of particles undergoing diffusive motion and interacting in a ‘mean-field’ sense, as the population size grows to infinity. More precisely, we consider the following system of particles,
[TABLE]
where , are independent -dimensional Brownian motions and , are i.i.d. random variables with the same distribution as . A particular characteristic of the limiting behaviour of the system, is that any finite subset of particles becomes asymptotically independent of each other. This phenomenon is known as propagation of chaos. We refer the reader to [17, 28, 30] for the classical results in this direction and to [6, 15, 21, 24, 29] for an account (non-exhaustive) of recent results. Nonetheless, most results are only qualitative and do not give us a rate of convergence.
For deterministic , it is shown in [11] that under sufficient regularity of and , the weak error between the particle system (1.8) and its mean-field limit (1.7) is given by
[TABLE]
(A more complicated formula is also given in [11] for non-deterministic initial conditions.) To obtain a full expansion of the form
[TABLE]
for some positive constants that do not depend on , one would even need to consider higher order linear derivatives (see [11]).
Note that in most practical applications, the test function being considered is linear, therefore its linear derivatives have simple closed-form formulae. In this case, the advantage of (LABEL:k_order_full_formula) is that it expresses completely in terms of higher order Kolmogorov equations , which are intrinsically Cauchy problems.
Despite being out of the scope of this paper, we remark that it is not difficult to compute the expression for
[TABLE]
by perturbing each of the measures in . This is much simpler than the linearisation procedure performed in this paper, where we perturb measure , which is more cumbersome and technical. Through more sophisticated techniques of global Schauder estimates, it should even be possible to obtain a control of (1.10) that decays over time , which allows us to obtain a uniform estimate of propagation of chaos in , by (1.9). This is a closely related research direction.
1.2 Main method of proof in this paper
The main idea of proof comes from [8], based on their idea of ‘linearising’ a forward-backward mean-field game system by perturbating the measure component. Our strategy follows a similar argument as Proposition 3.4.3 and Corollary 3.4.4 in [8].
To explore regularity of (1.1) along the measure component, we perturb probability measure along direction . Take any smooth test function . We have
[TABLE]
We define
[TABLE]
in the sense of distributions. Then one should expect that
[TABLE]
(In particular, for the linear case when , we have
[TABLE]
which is a consequence of the definition of the linear functional derivative.) Applying (1.12) to (1.11), by differentiating (1.11) w.r.t. at [math], we have
[TABLE]
Note that, in the distribution sense, (1.13) can be rewritten as the linearised forward Kolmogorov equation
[TABLE]
This is what we expect by differentiating (1.1) formally in . To show that this is indeed the case, we consider the difference to prove differentiability of with respect to the measure.
We adopt the approach of Schauder theory and most of the results follow from Theorem 2.2, which is a fundamental result of Schauder estimates on the viscous transport equation. Based on Schauder theory, it is shown in Theorem 2.6 that there exists some constant such that
[TABLE]
under the assumptions (Int--()), (Lip--()), (TLip--()) and (TReg--()), where . Therefore, we can show that
[TABLE]
Nonetheless, to show that indeed has a linear functional derivative, we need to express the integral on the right hand side in terms of the signed measure . Here is where probability theory comes into action. For every and , we consider the decoupled process defined by
[TABLE]
For every and , we define a function such that
[TABLE]
which satisfies the backward Kolmogorov equation
[TABLE]
Note that
[TABLE]
and therefore 111Note that if the law of is equal to the law of , then the law of is also equal to the law of . Therefore, if we are only interested in the law of the process , where is distributed as , then it is proper to adopt the notation .
[TABLE]
By linearisating with respect to in the same way as (1.11) and (1.12), we obtain that
[TABLE]
Consequently, by replacing by , we can deduce from (1.12) the existence of the first order linear derivative of . We repeat the same procedure for higher order linear derivatives of . It is precisely this combination of forward and backward equations that allows us to prove existence of the linear derivatives of .
1.3 Comparison with other approaches in the literature
There are various alternative methods for establishing smoothness of functions of the form (1.2) in the literature, all of which are probabilistic.
The method of Malliavin calculus is adopted in [12]. That paper proves smoothness of , for being in the form
[TABLE]
where is infinitely differentiable with bounded partial derivatives.
The method of parametrix is considered in [13]. We represent in terms of the transition density of (defined above in (1.15)). This method is applied to the case in which and are of the form
[TABLE]
for some functions , and . Nonetheless, it is not clear whether this method can be applied to and with more general forms.
Finally, a ‘variational’ approach is adopted in [7]. The core idea is to prove smoothness of by viewing the lift of (i.e. the map ) as a composition of the map and the lift of (i.e. the map ). In [7], the smoothness of is proven up to the second order, under fairly general conditions on and .
1.4 Notations and main assumptions
1.4.1 Notations
The scalar product between two vectors is denoted by . denotes the space of integrable probability measures and denotes the 1-Wasserstein distance, defined by
[TABLE]
where denotes the set of couplings between and , i.e. all measures on such that and for every .
To write the norms of a Sobolev space and its dual, we use the notations
[TABLE]
Moreover, for dual elements with their arguments, we use the notation
[TABLE]
Denoting , for any and , we use the notation
[TABLE]
denotes, for , the space of measurable functions with spatial generalized derivatives up to order that all belong to . We define
[TABLE]
For functions such that each component function belongs to , we write with
[TABLE]
For any signed measures , we write to denote
[TABLE]
if this iterated integral is well-defined.
Unless otherwise specified, is a constant that only depends on , , , and , whose value varies from line to line.
1.4.2 Main assumptions
Throughout this work, we work with the following assumptions on and . (Int--()) denotes the condition that, for each , ,
[TABLE]
(Lip--()) denotes the condition that, for each , ,
[TABLE]
where
[TABLE]
For the test function , we shall impose the following assumptions. (TLip--()) denotes the condition that, for each ,
[TABLE]
where
[TABLE]
(TReg--()) denotes the condition that, for each and ,
[TABLE]
Finally, (TInt--()) denotes the integrability condition that, for each ,
[TABLE]
1.5 Practical examples of our model
We now give a result of a class of drift terms and test functions that satisfies the above assumptions, followed by practical examples of our model.
Theorem 1.1**.**
Let . Suppose that for each , belongs to and that belongs to . We then define functions and by
[TABLE]
Then satisfies (Int--()) and (Lip--()). Moreover, satisfies (TLip--()), (TReg--()) and (TInt--()).
Proof.
Let be arbitrary. Let
[TABLE]
It can be shown easily by the definition of linear functional derivatives (along with the condition of normalisation) that
[TABLE]
It can be easily checked that
[TABLE]
[TABLE]
and
[TABLE]
Moreover, by the Kantorovich Rubinstein duality (see Remark 6.5 in [31]),
[TABLE]
Similarly,
[TABLE]
This allows us to show that
[TABLE]
[TABLE]
[TABLE]
These calculations show that and satisfy the aforementioned regularity properties in the theorem. Note that is arbitrary in , since the dependence on measure is linear for functions and . ∎
Example 1.2** (Kuramoto model).**
The Kuramoto model is used to describe the behaviour of synchronization for a large set of coupled oscillators and is defined in dimension (see, e.g., [2]):
[TABLE]
Example 1.3** (Aggregation models).**
Aggregation models are commonly used in the analysis of mean-field models in biology, ecology, for space homogeneous granular media (see [4, 5, 10, 20, 26]). In such models, the drift term typically takes the form
[TABLE]
for some smooth functions . According to Theorem 1.1, our analysis would be applicable to functions , where .
2 Regularity of first order linear derivative in measure of
2.1 Analysis of the forward Kolmogorov equation
The first step in the analysis of PDEs is the regularity of . The following result concerns regularity of (1.1) and is standard in the literature.
Lemma 2.1**.**
Suppose that is jointly Lipschitz continuous in the space and measure variables w.r.t. the Euclidean and metrics. Then (1.1) has a unique solution and satisfies
[TABLE]
for some constant .
Proof.
The fact that (1.1) has a unique solution follows from the strong uniqueness of (1.7), by Theorem 1.1 of [30]. The estimate follows from the proof of Lemma 3.1 in [7]. ∎
The following result is a modified version of Proposition 3.4.3 in [8] from Hölder spaces to Sobolev spaces.
Proposition 2.2**.**
Let , and . Then, for any , the Cauchy problem
[TABLE]
has a unique solution in the following space:
[TABLE]
where is the space of real-valued functions (on ) that are continuous in time and space, differentiable in space, and the derivative of which is continuous in time and space, and where is the space of functions such that , , and belong to . The unique solution satisfies
[TABLE]
where only depends on .
Proof.
First Step. We start with uniqueness. Uniqueness of a solution (in ) is a trivial consequence of the solvability of the SDE:
[TABLE]
and, then, of Itô-Krylov’s formula (see for instance [22]), which guarantees that
[TABLE]
Notice that Itô’s formula does not suffice since the solution may just have first order -derivative and second order -derivatives in . Obviously, Itô-Krylov’s formula here applies because of the non-degeneracy of the noise.
Second Step. Existence of a solution with generalized second order derivatives is a well known fact in the literature. The main reference is the monograph of Ladyzenskaja et al., [25]; a more precise application of the results of [25] to our setting may be found in [14], see Theorem 2.1 therein. The latter says that existence of a solution hold in the space defined in the statement.
Third Step. Now, the main point is to prove that the solution satisfies the required bounds. By mollifying the coefficients and in space (using a standard convolution argument), we may easily assume that the coefficients and are smooth in space, and that their derivatives up the order satisfy the same Lipschitz bounds as the original coefficients. If we can prove that the solution associated with the equation with mollified coefficients satisfies the inequality announced in the statement, with a constant therein that remains uniform along the mollification, then we are done: it suffices to observe that the solution associated with the mollified equation converges (as the mollification parameter tends to [math]) to the original by passing to the limit along the stochastic representation (based upon (2.3)–(2.4)).
So, from now on, we assume that the coefficients and are smooth in space, and that their derivatives up the order satisfy the same Lipschitz bounds as the original coefficients. The key point is then to observe that we can differentiate with respect to in the representation formula (2.4), since the solution to (2.3) generates a smooth flow (see for instance [23]). As a by-product, we deduce that, for any ,
[TABLE]
Obviously, the bound of the above left-hand side depends on the (additional) smoothness of and . Now, by expanding by means of Itô’s formula, we get that
[TABLE]
where is the standard heat kernel. We know from Theorem 11 in [16, Chapter 1] that there exists a bounded density on the torus such that, for any with ,
[TABLE]
for a constant only depending on the bound of . Taking the derivative with respect to ,
[TABLE]
where the constant depends on (and is allowed to vary from line to line). By a standard variant of Gronwall’s lemma (see for instance [18, Lemma 7.1.1 and Exercise 1]), we get
[TABLE]
which is exactly the announced result when . Differentiating twice (2.5) (hence differentiating twice the heat kernel in the right-hand side), performing an integration by parts in the resulting second and third terms in the right-hand side of (2.5) and eventually plugging the above bound in the resulting formula, we then get
[TABLE]
where now depends on , and in turn
[TABLE]
which yields, by the same variant of Gronwall’s lemma,
[TABLE]
This is exactly the desired result when .
Now, we can iterate by induction, assuming that the result holds true for a given . It suffices to take derivatives (in ) in the left-hand side of (2.5) and then to use an integration by parts to pass derivatives from the heat kernel onto and in the resulting second and third terms in the right-hand side of (2.5). We then get
[TABLE]
where now depends on . Plugging the bound, we have, for (as given by the induction assumption), we get
[TABLE]
and, then, refined Gronwall’s lemma applies as before. ∎
The core analysis of forward Kolmogorov equations depends heavily on the following fact. The main ideas of the proof follow from the proof of Lemma 3.3.1 in [8].
Theorem 2.3** (Bound for forward Kolmogorov equations).**
Let and . Assume (Int--()). Let r\in L^{\infty}\big{(}[0,T],(W^{n,\infty}(\mathbb{T}^{d}))^{\prime}\big{)}. Then the Cauchy problem defined by
[TABLE]
interpreted as
[TABLE]
for each , has a unique solution in L^{\infty}\big{(}[0,T],(W^{n,\infty}(\mathbb{T}^{d}))^{\prime}\big{)} such that
[TABLE]
for some constant .
Proof.
We consider the space , where . We recall that the norm of is given by
[TABLE]
For , we consider the Cauchy problem
[TABLE]
By Schauder estimates, setting defines a continuous and compact map . (See Step 1 in the proof of Lemma 3.3.1 in [8]). We show the existence of solution to (2.7) by applying the Leray-Schauder theorem, i.e. by showing that the set
[TABLE]
is bounded. To this end, we pick an arbitrary , which satisfies the Cauchy problem
[TABLE]
The estimates rely on the classical argument of duality pairing. Fix and . Let be the solution to the Cauchy problem
[TABLE]
By Theorem 2.2, satisfies
[TABLE]
By the definition of (2.9), we have
[TABLE]
Therefore, by (2.10),
[TABLE]
We now estimate each of the three terms on the right hand side by (2.11). Firstly,
[TABLE]
By (2.11) and (Int--()), we obtain the estimate
[TABLE]
Finally, by (2.11),
[TABLE]
By (2.12), along with estimates (2.13), (2.14) and (2.15), we have
[TABLE]
which concludes by Gronwall’s inequality that
[TABLE]
Now we pick . Then (2.12) becomes
[TABLE]
By combining (2.16) with the argument of (2.14),
[TABLE]
Similarly,
[TABLE]
Therefore, by combining (2.17), (2.18) and (2.19), we have
[TABLE]
Combining with (2.16) gives
[TABLE]
Consequently, by the Leray-Schauder theorem, the map admits a fixed point. This shows the existence of solution to (2.7). For uniqueness, one simply has to apply a Gronwall argument to (2.12). Finally, the estimate for the solution follows by repeating the proof up to (2.16), but with . ∎
Lemma 2.4**.**
Assume (Int--()), where . Then the Cauchy problem defined in (1.14) has a unique solution in L^{\infty}\big{(}[0,T],(W^{n,\infty}(\mathbb{T}^{d}))^{\prime}\big{)}.
Proof.
This is immediate from Theorem 2.3. ∎
For every , , let
[TABLE]
Let . By (1.1) and (1.14), we have
[TABLE]
We rewrite the final two terms as
[TABLE]
Therefore, we obtain that
[TABLE]
In distributional sense, we write
[TABLE]
where
[TABLE]
We first establish the regularity of .
Lemma 2.5**.**
Assume (Int--()) and (Lip--()), where . Then c(\cdot,\mu,\hat{\mu})\in L^{\infty}\big{(}[0,T],(W^{n,\infty}(\mathbb{T}^{d}))^{\prime}\big{)}.
Proof.
For any
[TABLE]
Next, we estimate each of the two terms. By (2.1) and (Int--()), since ,
[TABLE]
Similarly, by (Lip--()),
[TABLE]
Combining (2.24) and (2.25), we have
[TABLE]
which implies that is a bounded operator with its operator norm given by
[TABLE]
∎
The following theorem is a straightforward consequence of the above results.
Theorem 2.6**.**
Assume (Int--()), (Lip--()), (TLip--()) and (TReg--()), where . Then the following statements hold.
- (i)
There exists some constant such that
[TABLE] 2. (ii)
For defined by (1.2),
[TABLE] 3. (iii)
[TABLE]
Proof.
- (i)
This follows from (2.22), estimate (2.26) and Theorem 2.3. 2. (ii)
Let be the optimal transport plan from to . The computation from the proof of Proposition 5.44 from [9] shows that
[TABLE]
By (TLip--()), (2.1) and the fact that
[TABLE]
there exists some constant such that
[TABLE]
By assumption (TReg--()) and part (i), there exists some constant such that
[TABLE]
which completes the proof. 3. (iii)
[TABLE]
by (2.30) and the fact that .
∎
2.2 Analysis of the backward Kolmogorov equation
We observe that, in (2.27), the integral is with respect to the signed measure . To show that indeed has a linear functional derivative, we need to express the integral in terms of the signed measure . To this end, we fix and and introduce the decoupled process by
[TABLE]
For every and , we define a function such that
[TABLE]
Note that
[TABLE]
It is well-known that (see, for example, equation (3.4) in [7])
[TABLE]
Therefore,
[TABLE]
for any . If exists, taking derivative w.r.t. at [math] gives
[TABLE]
Hence, it suffices to study the regularity of . In most of the analysis for , we suppress the parameters and , for simplicity of notations. By the standard Feynman-Kac equation (Kolmogorov backward equation), satisfies the PDE
[TABLE]
Lemma 2.7**.**
Assume (Int--()), where . Suppose that . Then the Cauchy problem defined in (2.34) has a unique solution in L^{\infty}\big{(}[0,t],W^{n+1,\infty}(\mathbb{T}^{d})\big{)}. Moreover, there exists a constant (depending on ) such that for any ,
[TABLE]
Proof.
The fact that v\in L^{\infty}\big{(}[0,t],W^{n+1,\infty}(\mathbb{T}^{d})\big{)} follows from Proposition 2.2. For the second part, take any . Let
[TABLE]
Then satisfies the Cauchy problem
[TABLE]
Using the same argument as (2.24), by (2.1), (Int--()) and Proposition 2.2, there exists a constant such that
[TABLE]
∎
The core analysis of backward Kolmogorov equations depends on the following fact.
Theorem 2.8** (Bound for backward Kolmogorov equations).**
Assume (Int--()), where . Suppose that . Let q\in L^{\infty}\big{(}[0,t],(W^{n,\infty}(\mathbb{T}^{d}))^{\prime}\big{)} and \gamma\in L^{\infty}\big{(}[0,t],W^{n,\infty}(\mathbb{T}^{d})\big{)}. Then the Cauchy problem
[TABLE]
has a unique solution in L^{\infty}\big{(}[0,t],W^{n+1,\infty}(\mathbb{T}^{d})\big{)} such that
[TABLE]
for some constant depending on .
Proof.
By (Int--()) and Proposition 2.2,
[TABLE]
∎
Formal differentiation of (2.34) w.r.t. the measure component gives
[TABLE]
We now study the regularity of .
Lemma 2.9**.**
Assume (Int--()), where . Suppose that . Then the Cauchy problem defined in (2.37) has a unique solution in L^{\infty}\big{(}[0,t],W^{n+1,\infty}(\mathbb{T}^{d})\big{)}. Moreover, satisfies the relation
[TABLE]
Proof.
The first part of the lemma follows directly from Theorem 2.8. For the second part, we note that satisfies
[TABLE]
where the final term uses the normalisation condition of . Integrating both sides w.r.t. with measure , we have
[TABLE]
By comparing (2.37) and (2.38), the result follows by an argument of stability similar to Corollary 3.4.2 of [8]. ∎
As before, we consider the difference
[TABLE]
Then satisfies the Cauchy problem
[TABLE]
where
[TABLE]
The following result is immediate.
Theorem 2.10**.**
Assume (Int--()) and (Lip--()), where . Suppose that . Then exists and is given by
[TABLE]
Proof.
We proceed in the same way as in the proof of Lemma 2.5. By (Int--()), (Lip--()), (2.1), (2.30) and Lemma 2.7, we deduce from (2.40) that
[TABLE]
for some constant depending on . Therefore, by Theorem 2.8,
[TABLE]
Therefore, by Lemma 2.9,
[TABLE]
We conclude the result by the characterisation of linear functional derivatives in Remark 5.47 of [9]. ∎
Corollary 2.11** (Existence of the first order linear derivative).**
Assume (Int--()), (Lip--()), (TLip--()) and (TReg--()), where . Then exists and is given by
[TABLE]
for every .
Proof.
Fix . Firstly, we recall from (2.32) that
[TABLE]
Since satisfies (TLip--()) and (TReg--()), the function
[TABLE]
satisfies (TLip-) and (TReg--()). Moreover,
[TABLE]
lies in . Therefore, by part (iii) of Theorem 2.6 and Theorem 2.10, we differentiate (2.41) w.r.t. at [math], which gives (by (LABEL:v_phi_connection_2))
[TABLE]
Putting , we have
[TABLE]
Finally, by part (iii) of Theorem 2.6, we conclude that exists and is given by
[TABLE]
∎
3 Higher order forward and backward Kolmogorov equations
In this section, we repeat the same procedure in the previous section to establish regularity of higher order Kolmogorov equations. In order to proceed with an iteration argument, we first introduce the following class of multi-indices in the class .
3.1 Definitions and notations for iteration in multi-indices in the class
Definition 3.1** (Class of multi-indices).**
For any , the class contains all multi-indices of the form
[TABLE]
where , and are non-negative integers and , , , , , are positive integers satisfying
- (i)
2. (ii)
3. (iii)
exactly one of and is equal to , 4. (iv)
[TABLE] 5. (v)
for any
[TABLE]
In particular, is called the order of defined by
[TABLE]
Moreover, for any , we define the magnitude of by
[TABLE]
If , for some , we write
[TABLE]
Remark 3.2**.**
This definition is modified accordingly when one of , and is zero. When , we set \lambda:=\big{(}0,\hat{\beta},(\hat{\alpha}_{\ell})_{1\leq\ell\leq\hat{\beta}}\big{)}. On the other hand, when , we set \lambda:=\Big{(}{\hat{n}},(\beta_{j})_{j=1}^{{\hat{n}}},({\alpha_{i,j}})_{\begin{subarray}{c}1\leq i\leq{\hat{n}}\\ 1\leq j\leq\beta_{i}\end{subarray}},0\Big{)}. Finally, when , for some , the column entry of disappears in the array .
Next, we introduce the recurrence map for multi-indices, followed by the sequence of multi-dimensional vectors of elements in .
Definition 3.3** (Recurrence map ).**
Let be given by the form (3.1). We define a recurrence map by
[TABLE]
Definition 3.4** (Multi-dimensional vectors of elements in ).**
We first define
[TABLE]
For every , we define a multi-dimensional vector of elements in by the recurrence relation
[TABLE]
for \lambda_{k}=\big{(}\lambda^{(1)}_{k},\ldots,\lambda^{(m(\lambda_{k}))}_{k}\big{)}.
3.2 Analysis of higher order forward Kolmogorov equations
In this subsection, we consider the following Cauchy problem (defined recursively by (3.5), (3.7), Definition 3.3 and Definition 3.4):
[TABLE]
where, for , For given by (3.1), we define
[TABLE]
Note that can be interpreted as an element in the dual space (under the assumption (Int--())):
[TABLE]
For any , we define
[TABLE]
Theorem 3.5**.**
Let . Assume (Int--()), where . Then (3.6) is well-defined and the Cauchy problem defined by (3.4) has a unique solution in L^{\infty}\big{(}[0,T],(W^{n+k-1,\infty}(\mathbb{T}^{d}))^{\prime}\big{)} and satisfies
[TABLE]
Also, if we assume (Int--()), then
[TABLE]
for any , for some constant .
Proof.
We proceed by strong induction for (3.8). The base step follows clearly from (1.14) and Theorem 2.3, since
[TABLE]
Suppose that (3.8) holds for . Take any and . We first show that (3.6) is well-defined, i.e. is indeed in , for any . Note that , which implies by (3.2) and (3.3) that
[TABLE]
where the final step follows from (Int--()). Therefore, the first statement that the Cauchy problem has a unique solution in L^{\infty}\big{(}[0,T],(W^{n+k-1,\infty}(\mathbb{T}^{d}))^{\prime}\big{)} and (3.8) both follow directly from Theorem 2.3, by the assumption of (Int--()) and the fact that is in .
It remains to prove (3.9) under the stronger assumption (Int--()). Let . Again, we proceed by strong induction. The base step is omitted as it is a special case of the procedure of the induction step. Suppose that (3.9) holds for . Replacing by in (3.4), we have
[TABLE]
On the other hand, we have
[TABLE]
Next, we compute that
[TABLE]
Note that the first term in (3.13) can be rewritten as
[TABLE]
by which we can estimate by the assumption (Int--()). For every , we know that by definition. For and
[TABLE]
By the induction hypothesis, for every ,
[TABLE]
Similarly, by the induction hypothesis, for ,
[TABLE]
Hence, by (3.13), (LABEL:eq:_linear_derivative_higher_order_trick), (3.15), (3.16), (3.17) and the assumption of (Int--()), we obtain that
[TABLE]
Let
[TABLE]
Subtracting (3.11) by (3.12) gives
[TABLE]
Let be an element in the dual space defined by
[TABLE]
Clearly, by (Int--()) and (3.18), it follows from the same argument as Lemma 2.5 to deduce that
[TABLE]
By (LABEL:dk+1_formula) (and replacing by arbitrary test functions ) we note that satisfies the Cauchy problem
[TABLE]
Therefore, by Theorem 2.3 and (Int--()),
[TABLE]
This completes the proof by (3.21). ∎
Theorem 3.6**.**
Let . Assume (Int--()) and (Lip--()), where . Then
[TABLE]
for any , for some constant .
Proof.
We proceed by strong induction. The base case is done in Theorem 2.6. Assume that the theorem holds for , where . Then
[TABLE]
Take . We first recall from the definition of (given in Definition 3.4) that the PDE for is given by
[TABLE]
Recalling the definition of in (3.19), we define
[TABLE]
Subtracting (LABEL:dk+1_formula) by (3.25) (and replacing by arbitrary test functions ), we observe that satisfies the Cauchy problem
[TABLE]
where
[TABLE]
and , are elements in the dual space defined by
[TABLE]
and, by (3.13),
[TABLE]
Note that the term {\Big{\langle}\xi,c_{1}^{(k+1)}(t,\mu,\mu_{1},\ldots,\mu_{k},\mu_{k+1})\Big{\rangle}}_{n+k+1,\infty} can be rewritten as
[TABLE]
By Theorem 3.5, the first term of (3.27) is controlled by
[TABLE]
By the same argument as (LABEL:eq:_linear_derivative_higher_order_trick) and (3.15), the second and third terms of (3.27) are controlled by
[TABLE]
where the estimate for the first term follows from (Lip--()) with the same argument as (2.25). This shows that
[TABLE]
Similarly, by (Int--()), (Lip--()) and Theorem 3.5, along with a similar argument applied to the induction hypothesis (3.24) (as in estimates (3.15), (3.16) and (3.17)), we can show that, for ,
[TABLE]
Therefore,
[TABLE]
Finally, by (Int--()), (3.26) and Theorem 2.3, we conclude that
[TABLE]
∎
3.3 Analysis of higher order backward Kolmogorov equations
In this subsection, we fix and consider the following Cauchy problem (defined recursively by (3.29), Definition 3.3 and Definition 3.4):
[TABLE]
where
[TABLE]
The following theorem gives the regularity of by Schauder estimates.
Theorem 3.7**.**
Let . Assume (Int--()), where . Suppose that . Then the Cauchy problem defined by (3.28) has a unique solution in L^{\infty}\big{(}[0,t],W^{n+1,\infty}(\mathbb{T}^{d})\big{)}.
Proof.
We proceed by strong induction. The base step is proven in Lemma 2.9. For the induction step, we assume that the statement is true for , where . For each , by (Int--()),
[TABLE]
which implies that . This completes the induction step by repeating the same argument as in the proof of Theorem 2.8. ∎
The following theorem is an analogue of Theorem 3.6 for backward Kolmogorov equations. The computations in the proof follow the same ideas as those in the previous subsection, i.e. Theorem 3.5 and Theorem 3.6. Consequently, the proof is omitted for brevity.
Theorem 3.8**.**
Let . Assume (Int--()) and (Lip--()), where . Suppose that . Then
[TABLE]
for any , for some constant .
We now establish the th order linear derivative of in terms of .
Theorem 3.9**.**
Let . Assume (Int--()) and (Lip--()), where . Suppose that . Then
[TABLE]
where the linear derivative is taken with respect to . Consequently, exists and is given by
[TABLE]
Proof.
Replacing by in Theorem 3.8 gives
[TABLE]
It follows from a similar argument as Lemma 2.9 to show that
[TABLE]
This proves the first equality. For the second equality, an inductive argument gives
[TABLE]
∎
3.4 Connection between higher order forward and backward equations
In this section, we follow the same approach as Section 2.2 to show that integrals with respect to the signed measure can be re-expressed in terms of the signed measure .
Theorem 3.10**.**
Let . Assume (Int--()) and (Lip--()), where . Suppose that . We define a sequence of functions , , by the following iteration:
[TABLE]
[TABLE]
for , where is taken with respect to . Then the sequence is well-defined and
[TABLE]
Proof.
By Theorem 3.9, the sequence is well-defined. To prove the equality, we proceed via an induction argument. The base step is established in (LABEL:v_phi_connection_2). For the inductive step, we assume that
[TABLE]
By replacing by in Theorem 3.6, we have
[TABLE]
for any , for some constant . Since , it follows from the proof of Theorem 2.6 to observe that
[TABLE]
On the other hand, by the chain rule of differentiation,
[TABLE]
The proof is complete by combining (3.33) and (3.34). ∎
4 Regularity of higher order derivatives in measure of
4.1 Definitions and notations for iteration in multi-indices in the class
In order to obtain a general formula for the th order linear derivative of , we proceed with another iteration argument. Therefore, we need to introduce another class of multi-indices.
Definition 4.1** (Class of multi-indices).**
For any , the class contains all multi-indices of the form
[TABLE]
where and are non-negative integers and , , , are positive integers satisfying
- (i)
[TABLE] 2. (ii)
[TABLE] 3. (iii)
for any
[TABLE]
In particular, is called the order of defined by
[TABLE]
Moreover, for any , we define the magnitude of by
[TABLE]
If , for some , we write
[TABLE]
Next, we introduce the recurrence map for multi-indices in , followed by the sequence of multi-dimensional vectors of elements in .
Definition 4.2** (Recurrence map ).**
Let be given by the form (4.1). We define a recurrence map by
[TABLE]
Definition 4.3** (Multi-dimensional vectors of elements in ).**
We first define
[TABLE]
For every , we define a multi-dimensional vector of elements in by the recurrence relation
[TABLE]
for \Lambda_{k}=\big{(}\Lambda^{(1)}_{k},\ldots,\Lambda^{(m(\Lambda_{k}))}_{k}\big{)}.
4.2 Analysis of higher order linear derivatives of
We begin by establishing a higher-order analogue of Theorem 2.6.
Lemma 4.4**.**
*Let . Assume (Int--()), (Lip--()) and
(TReg--()), where . Then, for and ,*
[TABLE]
for every
Proof.
Since , the condition (Int--()) implies (Int--()). Similarly, the condition (Lip--()) implies (Lip--()). By Theorem 3.6, we have
[TABLE]
for any , for some constant . On the other hand, the condition (TReg--()) implies (TReg--()). The rest of the proof is identical to the proof of Theorem 2.6. ∎
We are now in a position to prove the main result of the paper. Clearly, one can obtain the minimal condition by setting (as in the introduction).
Theorem 4.5**.**
Let and . Assume (Int--()), (Lip--()), (TLip--()) and (TReg--()). Then exists and is given by
[TABLE]
In particular, if we also assume (TInt--()), then
[TABLE]
Proof.
We proceed by induction on . We first prove the statement for . By Corollary 2.11, we know that exists. Therefore, by (2.28),
[TABLE]
By the normalisation convention of ,
[TABLE]
Therefore, putting gives
[TABLE]
We now assume that this statement holds for . Therefore, for any , we have
[TABLE]
By the chain rule of differentiation,
[TABLE]
Therefore, the right-hand derivative at exists and is given by
[TABLE]
By the assumptions (TReg--()) (which implies (TReg--())) and (TLip--()), we can repeat the same argument as in Theorem 2.6. For any signed measures and ,
[TABLE]
The second part of (LABEL:right_hand_derivative_long_expression) can be computed by Lemma 4.4. Therefore, by (LABEL:right_hand_derivative_long_expression), (4.9) and Lemma 4.4,
[TABLE]
For each and , we define functions by
[TABLE]
By the assumption (TReg--()), it is clear that Therefore, using the notations (3.31) and (3.32), Theorem 3.10 implies that
[TABLE]
which shows that exists and is given by
[TABLE]
By adopting the same normalisation argument as (4.5) and (4.6), formula (4.10) gives
[TABLE]
Finally, if we also assume (TInt--()), then by Theorem 3.5, for any ,
[TABLE]
∎
Acknowledgements
The author is indebted to Prof. Pierre Cardaliaguet and Dr. Łukasz Szpruch for useful suggestions in various occasions, and to Prof. François Delarue for the help in developing the proof of Proposition 2.2.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Viorel Barbu and Michael Röckner. From nonlinear Fokker-Planck equations to solutions of distribution dependent SDE. Annals of Probability, 48(4):1902–1920, 2020.
- 2[2] Lorenzo Bertini, Giambattista Giacomin, and Khashayar Pakdaman. Dynamical aspects of mean field plane rotators and the Kuramoto model. Journal of Statistical Physics , 138(1):270–290, 2010.
- 3[3] Vladimir I Bogachev, Nicolai V Krylov, Michael Röckner, and Stanislav V Shaposhnikov. Fokker-Planck-Kolmogorov Equations , volume 207. American Mathematical Soc., 2015.
- 4[4] François Bolley, Arnaud Guillin, and Florent Malrieu. Trend to equilibrium and particle approximation for a weakly selfconsistent Vlasov-Fokker-Planck equation. ESAIM: Mathematical Modelling and Numerical Analysis, 44(5):867–884, 2010.
- 5[5] François Bolley, Arnaud Guillin, and Cédric Villani. Quantitative concentration inequalities for empirical measures on non-compact spaces. Probability Theory and Related Fields , 137(3-4):541–593, 2007.
- 6[6] Mireille Bossy, Jean-François Jabir, and Denis Talay. On conditional Mc Kean Lagrangian stochastic models. Probability theory and related fields , 151(1 − - 2):319 − - 351, 2011.
- 7[7] Rainer Buckdahn, Juan Li, Shige Peng, and Catherine Rainer. Mean-field stochastic differential equations and associated PD Es. The Annals of Probability, 45(2):824 − - 878, 2017.
- 8[8] Pierre Cardaliaguet, François Delarue, Jean-Michel Lasry, and Pierre-Louis Lions. The master equation and the convergence problem in mean field games. volume 201. Princeton University Press, 2019.
