On the fundamental solution and a variational formulation of a degenerate diffusion of Kolmogorov type
Manh Hong Duong, Hoang Minh Tran

TL;DR
This paper constructs the fundamental solution for a degenerate Kolmogorov diffusion and introduces a variational scheme for its adjoint, leveraging optimal transport and mean squared derivative costs, with proven convergence.
Contribution
It provides a novel fundamental solution and a variational scheme for the adjoint of a degenerate Kolmogorov diffusion, extending previous methods.
Findings
Successfully constructed the fundamental solution.
Developed a convergent variational scheme.
Extended results to more general degenerate diffusions.
Abstract
In this paper, we construct the fundamental solution to a degenerate diffusion of Kolmogorov type and develop a time-discrete variational scheme for its adjoint equation. The so-called mean squared derivative cost function plays a crucial role occurring in both the fundamental solution and the variational scheme. The latter is implemented by minimizing a free energy functional with respect to the Kantorovich optimal transport cost functional associated with the mean squared derivative cost. We establish the convergence of the scheme to the solution of the adjoint equation generalizing previously known results for the Fokker-Planck equation and the Kramers equation.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
On the fundamental solution and a variational formulation for a degenerate diffusion of Kolmogorov type
Manh Hong Duong Corresponding author: M. H. Duong Department of Mathematics, Imperial College London, London SW7 2AZ, UK. Email: [email protected].
Hoang Minh Tran
Data Analytics Department, Esmart Systems, Håkon Melbergs vei 16, 1783 Halden, Norway. Email: [email protected]
Abstract
In this paper, we construct the fundamental solution to a degenerate diffusion of Kolmogorov type and develop a time-discrete variational scheme for its adjoint equation. The so-called mean squared derivative cost function plays a crucial role occurring in both the fundamental solution and the variational scheme. The latter is implemented by minimizing a free energy functional with respect to the Kantorovich optimal transport cost functional associated with the mean squared derivative cost. We establish the convergence of the scheme to the solution of the adjoint equation generalizing previously known results for the Fokker-Planck equation and the Kramers equation. ††footnotetext: 2010 Mathematics Subject Classification. Primary: 49S05; Secondary: 49J40, 35Q84.
Key words and phrases. Generalized gradient flow structure, variational methods, hypo-elliptic PDEs, optimal transport.
1 Introduction
In this paper, we are interested in the following partial differential equation
[TABLE]
and its adjoint equation when
[TABLE]
In the above equations, the unknowns are and where and are time and spatial co-ordinates, respectively. The notations and denote the gradient, the divergence and the Laplacian operators respectively. The subscripts in these operators indicate that they only act on the corresponding variables. The two equations are complemented with initial data. The initial data for (1) is a probability measure, , on . Since the right-hand side of (1) is a divergence form, noting that the summation term can be written as , for each , is also a probability measure on . Since we will be interested in the fundamental solution of (2), the initial data for (2) is a Dirac measure for . Two special cases of (1), that correspond to and respectively, are the Fokker-Planck equation
[TABLE]
and the Kramers equation
[TABLE]
Their corresponding adjoint equations with are, respectively, the diffusion equation and the ultra-parabolic equation
[TABLE]
Equations (1) and (2), particularly the Fokker-Plank equation and the Kramers equation, play an important role in statistical mechanics [Ris84], reaction-rate theory [HTB90] and mathematical finance [LPP02, Pas05]. For instance, in the context of statistical mechanics, (1) has been used to describe the time evolution of the probability density function of a many-particle system. Equation (1) with can also be viewed as a simplified model of a finite Markovian approximation for the generalised Langevin dynamics [OP11, Duo15] or a model of a harmonic chains of oscillators that arises in the context of non-equilibrium statistical mechanics [EH00, BL08, DM10].
Equations (1) and (2) belong to a class of degenerate diffusions of Kolmogorov type where the Laplacian acts only in some of variables. Two main issues that have been getting a lot of research interest in this class is (i) constructing the fundamental solution of (2) and (ii) developing a time-discrete variational scheme for (1). To motivate our work, let us discuss relevant literature on these issues for the Fokker-Planck equation and the Kramers equation. Regarding the first issue, it is a classical result that the diffusion equation has the fundamental solution
[TABLE]
In the seminal paper [Kol34], Kolmogorov show that
[TABLE]
is the fundamental solution to the ultra-parabolic equation (5). It is this equation that was Hörmander’s starting point to develop the hypo-elliptic theory [Hör67], which has become a powerful tool in the theory of partial differential equations, see e.g. [Bra14] for discussions. Since Kolmogorov’s paper there has been a considerable amount of work on extending his result to other hypoelliptic equations including (2), see e.g., [Web51, Kup72, Pol94, FP05, DM10, IB15]. We refer to the mentioned papers and references therein for more information on this direction.
Regarding the second issue, the functions and also play a crucial role in the more recent development of time-discrete variational schemes for the Fokker-Planck equation and the Kramers equation, respectively. In the seminal paper [JKO98], Jordan-Kinderlehrer-Otto prove a remarkable result that the Fokker-Planck equation (3) can be seen as a gradient flow of the free energy, which is the sum of the potential energy and the Boltzmann entropy , with respect to the Wasserstein distance on the space of probability measures with finite second moments. Let be two probability measures on some Euclidean space having finite second moments. Throughout this paper, denotes the set of all probability measures on having and as first and second marginals. The Wasserstein distance between them is defined via
[TABLE]
The main ingredient in [JKO98] is the following variational approximation scheme, which is now known as the JKO-scheme,
JKO-scheme: Let be a time-step. Define . Then, for each , is determined as
[TABLE]
The main result in [JKO98] then states that, after an appropriate interpolation, the sequence constructed from the JKO-scheme converges to the solution of the Fokker-Planck equation (3). This result has sparked a large body of research in the last two decades in the field of partial differential equations linking the field to some other branches of mathematics such as optimal transport theory, geometric measure theory and functional inequalities, see monographs [Vil03, AGS08, Vil09] for great expositions on the development.
Inspired by the JKO-scheme, Huang [Hua00] and then Duong et al. [DPZ14] have established different approximation schemes for the Kramers equation (4). The challenge here is that the techniques in [JKO98] can not be directly applied as the Kramers equation is neither a gradient flow nor a Hamiltonian flow even though these schemes are of the same form as in (9). The free energy functional is the same, but instead of the Wasserstein distance, the following Monge-Kantorovich optimal transport cost has been used in these papers
[TABLE]
where is defined in (1). The cost functional has also been used to construct time-discrete variational schemes for other evolution equations such as the system of isentropic Euler equations [GW09, Wes10] and the compressible Euler equations [CSW14].
Through the above discussions, we notice a similarity between the Fokker-Planck equation and the Kramers equation. That is the role of the cost functions and . They appear both in the discrete-time variational scheme and the fundamental solution. In addition, these cost functions also satisfy the following property: they minimize the velocity and acceleration integrals respectively
[TABLE]
Recently in [DT16] we have studied the minimization problem
[TABLE]
where and the infimum is taken over all curves that satisfy the boundary conditions
[TABLE]
The optimal value is called the mean squared derivative cost function and has been found to be useful in the modelling and design of various real-world systems such as motor control, biometrics and online-signatures and robotics, see [DT16] for further discussion.
Inspired by the role of and for the Fokker-Planck equation and the Kramers equation as discussed above, it is natural to ask
- (Q1)
Is the function
[TABLE]
where is the normalising constant, the fundamental solution to Equation (2)? 2. (Q2)
Can Equation (1) be approximated by a discrete-time variational scheme in the spirit of the JKO-scheme where is replaced by ?
The aim of the present paper is to provide affirmative answers to these questions. We now describe our main results.
1.1 Main results of the present paper
Our first main result is the following theorem about the fundamental solution to (2).
Theorem 1.1**.**
*The function defined in (13) is the fundamental solution to (2). That is, for each y, , as a function of and x, satisfies (2). In addition,
[TABLE]
This theorem together with the explicit formula for the mean squared derivative cost function in [DT16, Theorem 1.2] (see also Theorem 2.1 below) provide an explicit formula for the fundamental solution .
Our second main result is concerned with a variational formulation of (1). We first introduce the approximation scheme in the spirit of the JKO-scheme. Let be given and be the mean square derivative cost function defined in (11). Let and be two probability measures on having finite second moments. The Monge-Kantorovich optimal transport cost between and is defined by
[TABLE]
The variational approximation scheme of this paper is constructed as follows.
Scheme 1.2**.**
Let . For , define as the solution of the minimization problem
[TABLE]
Next we introduce the concept of a weak solution of (1). A function is called a weak solution of equation (1) with initial datum if it satisfies the following weak formulation of (1):
[TABLE]
Throughout the paper we make the following assumptions.
Assumption 1.3**.**
[TABLE]
and there exists a constant such that for all
[TABLE]
The second main result of the paper is about the convergence of the approximation scheme to a weak solution of (1).
Theorem 1.4**.**
Suppose that satisfies Assumption 1.3. Let satisfy . For any sufficiently small, let be the sequence of the solutions of the (16). For any , define the piecewise-constant time interpolation
[TABLE]
Then for any ,
[TABLE]
where is the unique weak solution of Equation (1) with initial value . Moreover
[TABLE]
and as ,
[TABLE]
1.2 Comparison to related work
Our work is twofold: to construct the fundamental solution to equation (2) (Theorem 1.1) and to develop a time-discrete variational scheme for equation (1) which is the adjoint equation of (2) with an additional external force field (Theorem 1.4). The mean squared derivative cost function (11) plays a central role appearing both in the fundamental solution and in the approximation scheme. We now give further comments on these issues.
On the fundamental solution of (2). The fundamental solution to (2) is not new. As mentioned above, it has been shown in [Web51, Hör67, Kup72, Pol94, FP05, DM10, IB15]. In these papers, the fundamental solution is constructed using various methods such as parametrix and Fourier-transform. In this paper, we provide a direct verification based on elementary combinatorial and linear algebra techniques. We use explicit formulas for the mean squared derivative cost function that we obtained in our recent work [DT16]. Our method is closed to [Kup72]. However, this work does not represent the fundamental solution of (2) in terms of the mean squared derivative cost function. The reference [DM10] provides an implicit representation via the controllability property of a differential system but this work does not address the variational formulation of (1).
Variational formulation for equation (1). Theorem 1.4 is a generalisation of the main results of [JKO98, Hua00, DPZ14] for an arbitrary . Even though the standard procedure in these papers can be used to prove the theorem, two additional difficulties arise. The first difficulty is how to select an appropriate perturbation flow to derive the Euler-Lagrange equation for the sequence . The other difficulty is to prove the convergence of the scheme to a solution of (1) which amounts to show the error terms vanish as . These difficulties can be solved by using the explicit formulas involving the derivative mean squared cost function derived in our previous work [DT16]. We also note that [Hua00] and [DPZ14] considered the full Kramers equation (i.e., with ) with an external force field where is a given potential. In this case, the right-hand side of (1) has an additional term . [Hua00] has dealt with this term by adding into the free-energy functional resulting in an unusual scale. In contrast, this term has been encoded in the cost function in [DPZ14]. In the present work for arbitrary number of variables , we have made an assumption that depends only on the last co-ordinate . Due to this assumption Equation (1) resembles the Fokker-Planck equation in the last co-ordinate and the cost function depends only on the -th order derivative of giving rise to a controllable formula. It is not clear to us at the moment how to adapt [Hua00, DPZ14] to deal with a more general case where depends on more than the last co-ordinate or when the co-ordinates are coupled in a more complex way as in [EH00, DM10, OP11]. We leave this issue for future research.
Microscopic interpretation of the variational scheme. The main results in recent papers [ADPZ11, DLR13, DPZ13, EMR15] show that Scheme 1.2 with and for the Fokker-Planck equation and the Kramers equation, respectively, can be derived from large-deviation principles of empirical measures associated to underlying stochastic processes. These results provide, among other things, microscopic interpretations for Scheme 1.2 in these cases. We expect that these results can be extended to the general case , but we will not follow this direction in this paper.
1.3 Organization of the paper
The rest of the paper is organised as follows. In Section 2, we summarize the main properties of the mean squared cost function in [DT16]. In Section 3 and Section 4 we prove Theorem 1.1 and Theorem 1.4, respectively. Finally, Appendix 5 contains proofs of technical lemmas.
2 The mean square derivative cost function
In this section, we collect relevant results on the mean squared derivative cost function in [DT16].
Theorem 2.1**.**
[DT16*, Theorem 1.2]**
The mean square derivative cost function is given by
[TABLE]
where
[TABLE]
and with
[TABLE]
Moreover, the matrix and its inverse have explicit -decompositions given in the following theorem.
Theorem 2.2**.**
[DT16, Theorem 1.3]**
- (1)
, where and are defined as follows
[TABLE] 2. (2)
The inverse of is given by the product of the following two matrices:
[TABLE]
Note that throughout this paper, all the matrices and are of order . Each entry of these matrices should be understood as a -dimensional matrix that is equal to the entry multiplies with the -dimensional identity matrix . For instance, , for should be understood as the matrix . The multiplication of matrices are carried out as the multiplication of block matrices.
In the sequel sections, we also need the following the property of the cost function .
Lemma 2.3**.**
*There exists a constant independent of such that
[TABLE]
Proof.
It follows from the formula for that only the symmetric part of contributes to . We have
[TABLE]
where
[TABLE]
By the definition of b, we have
[TABLE]
Therefore
[TABLE]
where , which implies that .
By Cauchy-Schwarz inequality, for sufficiently small, we have and for some constant . We use the notation to denote a universal constant that may change from line to line. Therefore, we get
[TABLE]
This finishes the proof of this lemma. ∎
3 The fundamental solution of the adjoint equation (2)
In this section we prove Theorem 1.1. The proof consists of two main steps which are Proposition 3.1 and Proposition 3.2 below.
Proposition 3.1**.**
Let be defined as in (13). Then it is a solution of (2) if and only if satisfies the following equation
[TABLE]
Proof.
Let . We compute each term in (2) from the representation of in (13). For the simplicity of notation, in the following computations, we denote .
First, we calculate the time-derivative of .
[TABLE]
Next, we calculate . We have
[TABLE]
from which by taking the sum over from to , we get
[TABLE]
The Laplacian, , with respect to variable is computed analogously
[TABLE]
It follows from (36), (37) and (38) that is a solution of (2) if and only if
[TABLE]
The above equality is equivalent to
[TABLE]
By re-arranging the terms in the above equality and recalling that , we obtain (35). This finishes the proof of the proposition. ∎
The following matrices will play important role in the rest of the paper
[TABLE]
Note that . Each entry of these matrices should be understood as a matrix of order that equals to the entry multiply with the -dimensional identity matrix.
Proposition 3.2**.**
The following assertions hold
- (1)
* is anti-symmetric.* 2. (2)
. 3. (3)
* is anti-symmetric.* 4. (4)
.
Proof.
The assertions of the lemma are proved by using combinatorial techniques and are given in Appendix 5. ∎
The following lemma is elementary but will be used several times in the sequel. We include it for the sake of completeness and reference.
Lemma 3.3**.**
Suppose that is an anti-symmetric matrix, then for all we have,
[TABLE]
Proof.
We have . ∎
We are now ready to prove Theorem 1.1.
Proof of Theorem 1.1.
To prove Theorem 1.1, by Proposition 3.1 it is sufficient to prove (35). According to Theorem 2.1 we have
[TABLE]
where
[TABLE]
where and are given in (39). Therefore, we have
[TABLE]
Next we will verify Proposition 3.1. We need to show that the function satisfies Equation (35).
We compute the time-derivative of first.
[TABLE]
From (44), we have
[TABLE]
Using the matrices and defined in (40), and can be computed as follows
[TABLE]
and similarly
[TABLE]
Therefore, we get
[TABLE]
where to obtain the second equality we have used the fact that .
The Laplacian is then computed via the Trace operator.
[TABLE]
We now verify that
[TABLE]
Substituting the computations from (3) to (48), we need to verify that
[TABLE]
or equivalently, using (41)–(43), we need to verify that
[TABLE]
According to Proposition 3.2 we have
- (i)
and , 2. (ii)
and are anti-symmetric. By Lemma 3.3, we have .
Therefore, using (i)–(ii) above, we obtain
[TABLE]
which is equal to the left-hand side of (3) as required.
Finally, the initial condition (14) follows from the representation of in Theorem 2.1 and the formula of b in (24). ∎
4 The variational formulation of Equation (1)
4.1 Well-posedness of Scheme 1.2 and the Euler-Langrange equation
In this section we prove the well-posedness of Scheme 1.2 and establish the Euler-Lagrange equations for the sequence of its minimizers.
Under Assumption 1.3 the free energy functional
[TABLE]
is well-defined in . The following two lemmas show that Scheme 1.2 is well-defined. Their proofs are now classical, see e.g., [Vil03, Theorem 1.3], [JKO98, Proposition 4.1], and [Hua00, Lemma 4.2]). Hence we will omit them here.
Lemma 4.1**.**
*Let be given. There exists a unique optimal plan such that
[TABLE]
Lemma 4.2**.**
Let be given. If is small enough, then the minimization problem
[TABLE]
has a unique solution.
Next we establish the Euler-Lagrange equation for the sequence of minimizers of Scheme 1.2. We will need two auxiliary lemmas whose proofs are presented in Appendix 5.
Lemma 4.3**.**
* where*
[TABLE]
In particular and for . Note that where should be understood as .
Lemma 4.4**.**
Let . Then
[TABLE]
In particular, and for all . Note also that where should be understood as .
Having these two lemmas, we are now ready to derive the Euler-Lagrange equation for the sequence of minimizers in Scheme 1.2.
Lemma 4.5** (Euler-Lagrange equation for the sequence of minimizers).**
*Let be the sequence of the minimizers of Scheme 1.2. Then we have
[TABLE]
where is the optimal plan in .
Proof.
Let be given and let be the unique solution of the minimization problem
[TABLE]
We will show that
[TABLE]
where is the optimal plan in .
Although establishing the Euler-Lagrange equation for the minimizer has become a well-established route, see e.g., [JKO98] and [Hua00, DPZ13] for that of the Fokker-Planck equation () and of the Kramers equation () respectively, there is one additional difficulty. That is how to select an appropriate perturbation flow from that the Euler-Langrange equation is deduced. We first define a perturbation of by a push-forward under an appropriate flow. Let . We define the flows such that
[TABLE]
Let be the push forward of under the flow , i.e., for any we have
[TABLE]
Since , we have . Taking derivatives with respect to of both sides gives
[TABLE]
We then compute, using (14) and (15), the stationarity condition for following the calculations in [JKO98, Hua00, DPZ13]
[TABLE]
According to (44) we have
[TABLE]
Therefore,
[TABLE]
Let . We choose such that
[TABLE]
where is given in Lemma 4.4 that implies that .
Using Lemmas 4.3 and 4.4, we compute
[TABLE]
(ii) .
(iii) .
Substituting these calculations back into (58) we obtain
[TABLE]
which is the desired equality (55).
Applying (55) for the minimizers of Scheme 1.2 yields the statement of Lemma 4.5. ∎
4.2 A priori estimates
In this section, we derive a priori estimates for the sequence of the minimizers of Scheme 1.2. The proofs of Lemma 4.6, Lemma 4.7 and Lemma 4.8 below are now standard, see e.g. [JKO98, Hua00, DPZ13], hence we omit them.
The following lemma provides an upper bound for the sum . From now on, we denote by the second moment of a probability measure , i.e., .
Lemma 4.6**.**
Let be the sequence of the minimizers of Scheme 1.2 for fixed . Then for any positive integer and sufficiently small , we have
[TABLE]
for some constant independent of .
The next lemma shows boundedness of the second moment and the entropy locally in time.
Lemma 4.7**.**
There exist positive constants , , and , independent of the initial data, such that for any , the solutions for Scheme 1.2 satisfy
[TABLE]
where .
The last lemma of this section extends Lemma 4.7 to any final time .
Lemma 4.8**.**
Let be the sequence of the minimizers of Scheme 1.2 for fixed . For any , there exists a constant depending on and on the initial data such that
[TABLE]
[TABLE]
[TABLE]
for any and , where .
4.3 Proof of Theorem 1.4
Having established the Euler-Lagrange equation and a priori estimates in previous sections, in this section we prove Theorem 1.4. The proof is similar to that of [JKO98, Hua00, DPZ13]. Therefore, we only present the part that is different, that is to prove the convergence of the discrete Euler-Lagrange equations to the weak formulation (17) of Equation (1) as . The key point is to link the Euler-Lagrange equation for the sequence of minimizers obtained in Lemma 4.5 to a time-discretization of Equation (1).
Let be a given final time. For each we set . Let be the sequence of minimizers of Scheme 1.2 and let be the piecewise-constant interpolation (20). By Lemma 4.8 we have
[TABLE]
Since the function has super-linear growth, (66) guarantees that there exists a subsequence, denoted again by , and a function such that
[TABLE]
We now prove that the limit satisfies the weak formulation (17). Let be given. All constants below depend on the parameters of the problem, on the initial datum , and on , but are independent of and of .
Let be the optimal plan for . For any , we have
[TABLE]
where the error term comes from the Taylor expansion of and can be estimated by
[TABLE]
Multiplying (68) with and combining with (54) we get
[TABLE]
where
[TABLE]
It is worthy noting that depends on through the -dependence of . Integrating (70) with respect to from to , we obtain
[TABLE]
Summing this relation from to gives
[TABLE]
where
[TABLE]
By a discrete integration by parts, the left hand side of (72) can be written as
[TABLE]
[TABLE]
Next we show that as . Indeed, we have
[TABLE]
Taking the limit in (75) yields equation (17) proving (21).
The proof of the stronger convergence (22) and of the continuity (23) at follows from the equi-near-continuity estimate, see [JKO98, Hua00, DPZ13]
[TABLE]
where is the Wasserstein distance between and defined in (8). This estimate follows from the inequality (see (34))
[TABLE]
and the estimates (66) and (64).
5 Appendix
This appendix contains proofs of technical lemmas in the previous sections.
5.1 Proof of Proposition 3.2
In this appendix, we prove Proposition 3.2. For the convenience, we recall the relevant matrices here:
[TABLE]
We need several auxiliary lemmas. The first one is an explicit formula for the inverse of the matrix define in (32).
Lemma 5.1**.**
Recall the matrix in (32)
[TABLE]
Then has the following form
[TABLE]
Proof.
Define the matrix by
[TABLE]
We now show that . We consider three cases
If , then
[TABLE]
The last equality is [math] since by the definition of we have for every . 2. 2.
If , then similarly as the case above, we get
[TABLE] 3. 3.
If , then since for and for , we get
[TABLE]
From these three cases, we imply that , therefore which concludes the proof of the lemma. ∎
The next two lemmas show relations between the matrices and .
Lemma 5.2**.**
It holds that
[TABLE]
Proof.
For three matrices , we have
[TABLE]
In particular, applying this identity for and , we get
[TABLE]
where we used the definition of and to obtain and respectively. This finishes the proof of the lemma. ∎
Lemma 5.3**.**
It holds that
[TABLE]
Proof.
Let . Then
[TABLE]
Note that iff and iff . We transform further .
If then . This gives that . 2. 2.
If , then we have
[TABLE]
Consider the function . On the one hand, since , it follows that . On the other hand, we have
[TABLE]
which implies that
[TABLE]
In addition, we have
[TABLE]
Summing up (82) and (83) yields . The term is equal to
[TABLE]
Hence if , then . 3. 3.
Finally, if , then we have
[TABLE]
From these three cases we obtain that
[TABLE]
As a result, we have
[TABLE]
which is (81). This finishes the proof of the lemma. ∎
The following lemma is a purely combinatoric identity. We expect that this identity is not new, but we include a proof here for the sake of completeness.
Lemma 5.4**.**
It holds that
[TABLE]
Proof.
We prove (89) by induction. Obviously, (89) is correct for . Assume that we (89) is valid with some value of .
We will prove (89) is also valid for . We have
[TABLE]
where in the last equality we have used the fact that \left(\begin{array}[]{c}n-1\\ k\end{array}\right)+\left(\begin{array}[]{c}n-1\\ k+1\end{array}\right)=\left(\begin{array}[]{c}n\\ k+1\end{array}\right). This is (89) for . Therefore by the induction principle, (89) is proved. ∎
Lemma 5.5**.**
For all and , we have
[TABLE]
Proof.
The identity (90) has been used and proved in our previous paper [DT16, Proof of Theorem 4.1]. For the convenience of the reader we provide a proof here. Consider the function . On the one hand, we have
[TABLE]
Therefore,
[TABLE]
It follows that
[TABLE]
and by multiplying both sides of this equality with , we get
[TABLE]
On the other hand, according to Leibniz formula, we have
[TABLE]
In particular, we get
[TABLE]
The identity (90) is then followed from (91) and (92). ∎
Having these lemmas, we are ready to prove Proposition 3.2 in the following subsections.
5.1.1 Proof of Proposition 3.2–(1): anti-symmetric
In this section,we prove that is anti-symmetric where
[TABLE]
We have
[TABLE]
Therefore, we get
[TABLE]
where we have used Lemma 5.2 to obtain . Let
[TABLE]
To prove is anti-symmetric, it suffices to prove that is anti-symmetric. Let be the element of . Then can be computed as follows.
[TABLE]
where to obtain the last equality, we have changed variable .
Applying (89) for ane we obtain
[TABLE]
Now applying (100) for and we get
[TABLE]
From (101) and (94) we achieve
[TABLE]
which implies that . Therefore, is anti-symmetric and so is .
5.1.2 Proof of Proposition 3.2–(2):
In this section, we prove that where
[TABLE]
According to Lemma 5.3, we have
[TABLE]
implying that
[TABLE]
Therefore,
[TABLE]
Since , we have
[TABLE]
Therefore, we get
[TABLE]
We will prove that where
[TABLE]
Note that the last equality is because . We compute each element of as follows.
[TABLE]
Therefore . It follows that .
5.1.3 Proof of Proposition 3.2–(3): anti-symmetric:
In this section, we prove that is anti-symmetric where
[TABLE]
According to Lemma 5.2 ; therefore we have
[TABLE]
We define
[TABLE]
To prove is anti-symmetric, it suffices to show that is anti-symmetric.
From (94) we have:
[TABLE]
which implies that (using the fact that )
[TABLE]
Using (102) and Lemma 5.3, we obtain
[TABLE]
Therefore, is anti-symmetric and so is .
5.1.4 Proof of Proposition 3.2–(4):
In this section, we prove that last identity in Proposition 3.2. We will show that
[TABLE]
We recall that all the matrices and are of order . Each entry of these matrix should be understood as a matrix of order that equals to the entry multiply with the -dimensional identity matrix .
Since , we have . Therefore
[TABLE]
Next we show that . According to Theorem 2.2, we have
[TABLE]
where
[TABLE]
Therefore,
[TABLE]
Now we show that
[TABLE]
Applying (90) for and we get
[TABLE]
which implies that
[TABLE]
as desired, that is .
Therefore we get as stated.
5.2 Proof of Lemma 4.3
In this secion we prove Lemma 4.3: we need to show that where and are given in (39) and (52).
We will show that for all .
- •
If , we have if and if . Therefore,
[TABLE]
- •
If , we have if and if . Therefore,
[TABLE]
- •
If we have if and if . Therefore,
[TABLE]
It follows from these cases that .
5.3 Proof of Lemma 4.4
In this section, we prove Lemma 4.4. First we will show that where
[TABLE]
We consider the following cases:
- •
If we have if and if . Therefore,
[TABLE]
- •
If , we have if and if . Therefore,
[TABLE]
- •
If , we have if and if . Therefore,
[TABLE]
Therefore, . We have . Therefore,
[TABLE]
We now transform further the last expression. Let , and . From (103), can be rewritten as follows
[TABLE]
We now compute in an alternative way. We have
[TABLE]
In particular, taking yields
[TABLE]
We now compute the integral on the RHS of (106). We have
[TABLE]
It follows from (105),(106) and (107)) that
[TABLE]
which finishes the proof of Lemma 4.4.
Acknowledgements
The majority of this paper was written when M. H. Duong was at the Mathematics Institute, University of Warwick and was supported by ERC Starting Grant 335120 while H. M. Tran was at the Department of Industrial Engineering, Texas A&M University.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[ADPZ 11] S. Adams, N. Dirr, M. A. Peletier, and J. Zimmer. From a large-deviations principle to the Wasserstein gradient flow: a new micro-macro passage. Communications in Mathematical Physics , 307:791–815, 2011.
- 2[AGS 08] L. Ambrosio, N. Gigli, and G. Savaré. Gradient flows in metric spaces and in the space of probability measures . Lectures in Mathematics. ETH Zürich. Birkhauser, Basel, 2nd edition, 2008.
- 3[BL 08] T. Bodineau and R. Lefevere. Large deviations of lattice hamiltonian dynamics coupled to stochastic thermostats. Journal of Statistical Physics , 133(1):1–27, 2008.
- 4[Bra 14] M. Bramanti. An Invitation to Hypoelliptic Operators and Hörmander’s Vector Fields . Springer, Berlin, 2014.
- 5[CSW 14] F. Cavalletti, M. Sedjro, and M. Westdickenberg. A variational time discretization for the compressible euler equations. http://arxiv.org/abs/1411.1012 , 2014.
- 6[DLR 13] M. H. Duong, V. Laschos, and M. Renger. Wasserstein gradient flows from large deviations of many-particle limits. ESAIM Control Optim. Calc. Var. , 19(4):1166–1188, 2013.
- 7[DM 10] F. Delarue and S. Menozzi. Density estimates for a random noise propagating through a chain of differential equations. J. Funct. Anal. , 259(6):1577–1630, 2010.
- 8[DPZ 13] M. H. Duong, M. A. Peletier, and J. Zimmer. GENERIC formalism of a Vlasov-Fokker-Planck equation and connection to large-deviation principles. Nonlinearity , 26(11):2951–2971, 2013.
