A Lagrangian scheme for the solution of nonlinear diffusion equations using moving simplex meshes
Jos\'e A. Carrillo, Bertram D\"uring, Daniel Matthes, David S., McCormick

TL;DR
This paper introduces a Lagrangian numerical scheme using moving simplex meshes for solving nonlinear degenerate diffusion equations, effectively capturing solution support growth and maintaining energy estimates.
Contribution
It presents a novel Lagrangian scheme based on gradient flow structure and finite linear maps, with proven energy consistency and applicability to complex nonlinear diffusion equations.
Findings
Scheme effectively tracks support growth in porous medium equations
Energy estimates are preserved in the discrete scheme
Numerical experiments demonstrate scheme's robustness and accuracy
Abstract
A Lagrangian numerical scheme for solving nonlinear degenerate Fokker-Planck equations in space dimensions is presented. It applies to a large class of nonlinear diffusion equations, whose dynamics are driven by internal energies and given external potentials, e.g. the porous medium equation and the fast diffusion equation. The key ingredient in our approach is the gradient flow structure of the dynamics. For discretization of the Lagrangian map, we use a finite subspace of linear maps in space and a variational form of the implicit Euler method in time. Thanks to that time discretisation, the fully discrete solution inherits energy estimates from the original gradient flow, and these lead to weak compactness of the trajectories in the continuous limit. Consistency is analyzed in the planar situation, . A variety of numerical experiments for the porous medium equation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
A Lagrangian scheme
for the solution of nonlinear diffusion equations using moving simplex meshes
José A. Carrillo
José A. Carrillo, Department of Mathematics, Imperial College London, London, SW7 2AZ, UK
,
Bertram Düring
Bertram Düring, Department of Mathematics, University of Sussex, Pevensey II, Brighton, BN1 9QH, UK
,
Daniel Matthes
Daniel Matthes
Technische Universität München
Zentrum Mathematik
Boltzmannstraße 3
D-85747 Garching
and
David S. McCormick
David S. McCormick, Department of Mathematics, University of Sussex, Pevensey II, Brighton, BN1 9QH, UK
Abstract.
A Lagragian numerical scheme for solving nonlinear degenerate Fokker-Planck equations in space dimensions is presented. It applies to a large class of nonlinear diffusion equations, whose dynamics are driven by internal energies and given external potentials, e.g. the porous medium equation and the fast diffusion equation. The key ingredient in our approach is the gradient flow structure of the dynamics. For discretization of the Lagrangian map, we use a finite subspace of linear maps in space and a variational form of the implicit Euler method in time. Thanks to that time discretisation, the fully discrete solution inherits energy estimates from the original gradient flow, and these lead to weak compactness of the trajectories in the continuous limit. Consistency is analyzed in the planar situation, . A variety of numerical experiments for the porous medium equation indicates that the scheme is well-adapted to track the growth of the solution’s support.
DM was supported by the DFG Collaborative Research Center TRR 109, “Discretization in Geometry and Dynamics”. BD and DSMcC were supported by the Leverhulme Trust research project grant “Novel discretisations for higher-order nonlinear PDE” (RPG-2015-69).
1. Introduction
We study a Lagrangian discretization of the following type of initial value problem for a nonlinear Fokker–Planck equation:
[TABLE]
This problem is posed for the time-dependent probability density function , with initial condition . We assume that the pressure can be written in the form
[TABLE]
for some non-negative and convex and that is a non-negative potential without loss of generality.
Problem (1.1) encompasses a large class of diffusion equations, such as the heat equation (), the porous medium equation () and the fast diffusion equation (), and extends to related problems with given external potentials . To motivate our discretization, we first briefly recall the Lagrangian form of the dynamics: rewriting (1.1) as a transport equation, we obtain
[TABLE]
More general systems can be written in this form. For instance, interaction potentials leading to aggregation equations, relativistic heat equations, -Laplacian equations and Keller-Segel type models can also be included; see Carrillo and Moll [12] and the references therein for a good account of models enjoying this structural form. Here, we reduce to models with nonlinear degenerate diffusion, i.e. , and confining potentials in order to explore our new discretization.
The system (1.3) naturally induces a Lagrangian representation of the dynamics, which can be summarized as follows. Below, we use the notation for the push-forward of under a map ; the definition is recalled in (2.1).
Lemma 1.1**.**
Assume that is a smooth positive solution of (1.1). Let be a given reference density, and let be a given map such that . Further, let be the flow map associated to (1.3b), satisfying
[TABLE]
where and . Then
[TABLE]
at any .
In short, the solution to (1.4) is a Lagrangian map for the solution to (1.1). This fact is an immediate consequence of (1.3a); for convenience of the reader, we recall the proof in Appendix A. Notice that (1.5) can be substituted for in the expression (1.3b) for the velocity, which makes (1.4) an autonomous evolution equation for :
[TABLE]
A more explicit form of (1.6) is derived in (5.2).
There is a striking structural relation between (1.1) and (1.6): it is well-known (see Otto [30] or Ambrosio, Gigli and Savaré [1]) that (1.1) is a gradient flow for the relative Renyi entropy functional
[TABLE]
with respect to the -Wasserstein metric on the space of probability densities on . It appears to be less well known (see Evans et al. [18], Carrillo and Moll [12], or Carrillo and Lisini [11]) that also (1.6) is a gradient flow, namely for the functional
[TABLE]
on the Hilbert space of maps from to , where is a reference measure supported on . We shall discuss these gradient flow structures in more detail in Section 2 below.
The particular spatio-temporal discretization of the initial value problem (1.1) that we study in this paper is based on these facts. Instead of numerically integrating (1.1a) to obtain the density directly, we approximate the Lagrangian map — using a finite subspace of linear maps in space — which then allows us to recover a posteriori via (1.5). Our approximation for is constructed by solving a succession of minimization problems that are naturally derived from the gradient flow structure behind (1.4). Via variational methods, this provides compactness estimates on the discrete solutions.
The use of a finite subspace of linear maps for the Lagrangian maps has a geometric interpretation: the induced densities are piecewise constant on triangles whose vertices move in time. In the sequel, we further assume that in order to prevent the collapse of the images of our Lagrangian maps , as proven below. However, we do not yet know how to prevent (globally) the possible intersection of the images of the approximated Lagrangian maps.
This approach is alternative to the one developed by Carrillo et al. [14, 12], where is obtained by directly solving the PDE (1.6) numerically with finite differences or Galerkin approximation via finite element methods. In fact, the main difference between these two strategies can be summarized as follows: while Carrillo et al. [14, 12] follows the strategy minimize first then discretize, our present approach is to discretize first then minimize. In other words, in Carrillo et al. [14, 12] one minimizes first, obtaining as Euler-Lagrange equations the implicit Euler discretization of (1.6) in [14] approximated by the explicit Euler method in [12], and then discretized in space. In the present approach, we discretize first approximating the space of Lagrangian maps by a suitable finite subspace of linear maps, and then we minimize obtaining a nonlinear system of equations to find the approximated Lagrangian map within that set of linear maps.
Let us mention that other numerical methods have been developed to conserve particular properties of solutions of the gradient flow (1.1). Finite volume methods preserving the decay of energy at the semi-discrete level, along with other important properties like non-negativity and mass conservation, were proposed in the papers [4, 9]. Particle methods based on suitable regularisations of the flux of the continuity equation (1.1) have been proposed in the papers [16, 23, 24, 32]. A particle method based on the steepest descent of a regularized internal part of the energy in (1.7) by substituting particles by non-overlapping blobs was proposed and analysed in Carrillo et al. [10, 13]. Moreover, the numerical approximation of the JKO variational scheme has already been tackled by different methods using pseudo-inverse distributions in one dimension (see [5, 8, 20, 35]) or solving for the optimal map in a JKO step (see [3, 22]). Finally, note that gradient-flow-based Lagrangian methods in one dimension for higher-order, drift diffusion and Fokker-Planck equations have recently been proposed in the papers [17, 27, 28, 29].
There are two main arguments in favour of our taking this indirect approach of solving (1.6) instead of solving (1.1). The first is our interest in structure preserving discretizations: the scheme that we present builds on the non-obvious “secondary” gradient flow representation of (1.1) in terms of Lagrangian maps. The benefits are monotonicity of the transformed entropy functional and an -control on the metric velocity for our fully discrete solutions, that eventually lead to weak compactness of the trajectories in the continuous limit. We remark that our long-term goal is to design a numerical scheme that makes full use of the much richer “primary” variational structure of (1.1) in the Wasserstein distance, that is reviewed in Section 2 below. However, despite significant effort in the recent past — see, e.g., the references [3, 6, 13, 14, 17, 19, 22, 25, 31, 35] — it has not been possible so far to preserve features like metric contractivity of the flow under the discretization, except in the rather special situation of one space dimension (see Matthes and Osberger [25]). This is mainly due to the non-existence of finite-dimensional submanifolds of that are complete with respect to generalized geodesics.
The second motivation is that Lagrangian schemes are a natural choice for numerical front tracking, see, e.g., Budd [7] for first results on the numerical approximation of self-similar solutions to the porous medium equation. We recall that due to the assumed degeneracy of the diffusion in (1.1), solutions that are compactly supported initially remain compactly supported at all times. A numerically accurate calculation of the moving edge of support is challenging, since the solution can have a very complex behavior near that edge, like the waiting time phenomenon (see Vazquez [33]). Our simulation results for — that possess an analytically known, compactly supported, self-similar Barenblatt solution — indicated that our discretization is indeed able to track the edge of support quite accurately.
This work is organized as follows. In Section 2 we present an overview of previous results in gradient flows pertaining our work. Section 3 is devoted to the introduction of the linear set of Lagragian maps and the derivation of the numerical scheme. Section 4 shows the compactness of the approximated sequences of discretizations and we give conditions leading to the eventual convergence of the scheme towards (1.1). Section 5 deals with the consistency of the scheme in two dimensions while Section 6 gives several numerical tests showing the performance of this scheme.
2. Gradient flow structures
2.1. Notations from probability theory
is the space of probability measures on a given base set . We say that a sequence of measures in converges narrowly to a limit in that space if
[TABLE]
for all bounded and continuous functions . The push-forward of a measure under a measurable map is the uniquely determined measure such that, for all ,
[TABLE]
With a slight abuse of notation — identifying absolutely continuous measures with their densities — we denote the space of probability densities on of finite second moment by
[TABLE]
Clearly, the reference density , which is supported on the compact set , belongs to . If is a diffeomorphism onto its image (which is again compact), then the push-forward of ’s measure produces again a density , given by
[TABLE]
2.2. Gradient flow in the Wasserstein metric
Below, some basic facts about the Wasserstein metric and the formulation of (1.1) as gradient flow in that metric are briefly reviewed. For more detailed information, we refer the reader to the monographs of Ambrosio et al. [1] and Villani [34].
One of the many equivalent ways to define the -Wasserstein distance between is as follows:
[TABLE]
The infimum above is in fact a minimum, and the — essentially unique — optimal map is characterized by Brenier’s criterion; see, e.g., Villani [34, Section 2.1]. A trivial but essential observation is that if is a reference density with support , and with a measurable , then (2.2) can be re-written as follows:
[TABLE]
and the essentially unique minimizer in (2.3) is related to the optimal map in (2.2) via .
is a metric on ; convergence in is equivalent to weak- convergence in and convergence of the second moment. Since and hence also are of super-linear growth at infinity, each sublevel set is weak- closed and thus complete with respect to .
As already mentioned above, solutions to (1.1) constitute a gradient flow for the functional from (1.7) in the metric space . In fact, the flow is even -contractive as a semi-group, thanks to the -uniform displacement convexity of (see McCann [26], or Daneri and Savaré [15]), which is a strengthened form of -uniform convexity along geodesics. The -contractivity of the flow implies various properties (see Ambrosio et al. [1, Section 11.2]) like global existence, uniqueness and regularity of the flow, monotonicity of and its sub-differential, uniform exponential estimates on the convergence (if ) or divergence (if ) of trajectories, quantified exponential rates for the approach to equilibrium (if ) and the like.
An important further consequence is that the unique flow can be obtained as the limit for of the time-discrete minimizing movement scheme (see Ambrosio et al. [1] and Jordan, Kinderlehrer and Otto [21]):
[TABLE]
This time discretization is well-adapted to approximate -contractive gradient flows. All of the properties of mentioned above are already reflected on the level of these time-discrete solutions.
2.3. Gradient flow in
Equation (1.6) is the gradient flow of on the space of square integrable (with respect to ) maps (see Evans et al. [18] or Jordan et al. [22]). However, the variational structure behind this gradient flow is much weaker than above: most notably, is only poly-convex, but not -uniformly convex. Therefore, the abstract machinery for -contractive gradient flows in Ambrosio et al. [1] does not apply here. Clearly, by equivalence of (1.1) and (1.6) at least for sufficiently smooth solutions, certain properties of the primary gradient flow are necessarily inherited by this secondary flow, but for instance -contractivity of the flow in the -norm seems to fail.
Nevertheless, it can be proven (see Ambrosio, Lisini and Savaré [2]) that the gradient flow is globally well-defined, and it can again be approximated by the minimizing movement scheme:
[TABLE]
In fact, there is an equivalence between (2.5) and (2.4): simply substitute for and for in (2.4); notice that any can be written as with a suitable (highly non-unique) choice of . This equivalence was already exploited in Carrillo et al. [14, 12]. Thanks to the equality (2.3), the minimization with respect to can be relaxed to a minimization with respect to . Consequently, if , then at all discrete times . However, while the functional in (2.4) is -uniformly convex in along geodesics in , the functional in (2.5) has apparently no useful convexity properties in on .
3. Definition of the numerical scheme
Recall the Lagrangian formulation of (1.1) that has been given in Lemma 1.1. For definiteness, fix a reference density , whose support is convex and compact.
3.1. Discretization in space
Our spatial discretization is performed using a finite subspace of linear maps for the Lagrangian maps . More specifically: let be some (finite) simplicial decomposition of with nodes to and -simplices to . In the case , which is of primary interest here, is a triangulation, with triangles . The reference density is approximated by a density that is piecewise constant on the simplices of , with respective values
[TABLE]
The finite dimensional ansatz space is now defined as the set of maps that are globally continous, affine on each of the simplices , and orientation preserving. That is, on each , the map can be written as follows:
[TABLE]
with a suitable matrix of positive determinant and a vector .
For the calculations that follow, we shall use a more geometric way to describe the maps , namely by the positions of the images of each node . Denote by the space of -tuples of points with the same simplicial combinatorics (including orientation) as the in . Clearly, any is uniquely characterized by the -tuple of its values, and moreover, any defines a .
More explicitly, fix a , with nodes labelled to in some orientation preserving order, and respective image points to . With the standard -simplex given by
[TABLE]
introduce the linear interpolation maps and by
[TABLE]
Then the affine map (3.2) equals to . In particular, we obtain that
[TABLE]
For later reference, we give a more explicit representation for the transformed entropy for , and for the -distance between two maps . Substitution of the special form (3.2) into (1.8) produces
[TABLE]
with the internal energy (recall the definition of from (1.8))
[TABLE]
and the potential energy
[TABLE]
For the -difference of and , we have
[TABLE]
Using Lemma B.1, we obtain on each simplex :
[TABLE]
3.2. Discretization in time
Let a time step be given; in the following, we symbolize the spatio-temporal discretization by , and we write for the joint limit of and vanishing mesh size in .
The discretization in time is performed in accordance with (2.5): we modify from (2.5) by restriction to the ansatz space . This leads to the minimization problem
[TABLE]
For a fixed discretization , the fully discrete scheme is well-posed in the sense that for a given initial map , an associated sequence can be determined by successive solution of the minimization problems (3.7). One only needs to verify:
Lemma 3.1**.**
For each given , there exists at least one global minimizer of .
Remark 3.2**.**
We do not claim uniqueness of the minimizers. Unfortunately, the minimization problem (3.7) inherits the lack of convexity from (2.5), whereas the correspondence between (2.5) and the convex problem (2.4) is lost under spatial discretization. A detailed discussion of ’s (non-)convexity is provided in Appendix C.
Proof of Lemma 3.1.
We only sketch the main arguments. For definiteness, let us choose (just for this proof) one of the infinitely many equivalent norm-induced metrics on the -dimensional vector space of all continuous maps that are piecewise affine with respect to the fixed simplicial decomposition : given with their respective point locations , i.e., for , define the distance between these maps as the maximal -distance of corresponding points , . Clearly, this metric makes a complete space.
It is easily seen that the subset — which is singled out by requiring orientation preservation of the ’s — is an open subset of . It is further obvious that the map is continuous with respect to the metric. The claim of the lemma thus follows if we can show that the sub-level
[TABLE]
is a non-empty compact subset of . Clearly, , so it suffices to verify compactness.
is bounded. We are going to show that there is a radius such that no has a distance larger than to . From non-negativity of , and from the representations (3.5) and (3.6), it follows that
[TABLE]
where . It is now easy to compute a suitable value for the radius .
is a closed subset of . It suffices to show that the limit of any sequence of maps belongs to . By definition of our metric on , global continuity and piecewise linearity of the trivially pass to the limit . We still need to verify that is orientation-preserving. Fix a simplex and consider the corresponding matrices and from (3.2). Since the converge to in the metric, also entry-wise. Now, by non-negativity of , we have for all that
[TABLE]
and since as , it follows that is bounded away from zero, uniformly in . But then also , i.e., the th linear map piece of the limit preserves orientation. ∎
3.3. Fully discrete equations
We shall now derive the Euler-Lagrange equations associated to the minimization problem (3.7), i.e., for each given , we calculate the variations of with respect to the degrees of freedom of . Since that function is a weighted sum over the triangles , it suffices to perform the calculations for one fixed triangle , with respective nodes to , in positive orientation. The associated image points are to . Since we may choose any vertex to be labelled , it will suffice to perform the calculations at one fixed image point .
- •
mass term:
[TABLE]
- •
internal energy: observing that — recall (1.2) —
[TABLE]
we obtain
[TABLE]
where
[TABLE]
is the uniquely determined vector in that is orthogonal to the -simplex with corners to (pointing away from ) and whose length equals the -volume of that simplex.
- •
potential energy:
[TABLE]
Now let be a fixed vertex of . Summing over all simplices that have as a vertex, and choosing vertex labels in accordance with above, i.e., such that in , produces the following Euler-Lagrange equation:
[TABLE]
3.4. Approximation of the initial condition
For the approximation of the initial datum , we require:
- •
converges to narrowly;
- •
is -uniformly bounded, i.e.,
[TABLE]
In our numerical experiments, we always choose , in which case can be taken as the identity on , and we choose accordingly as the identity as well. Hence , which converges to even strongly in . Moreover, since is convex, it easily follows from Jensen’s inequality that
[TABLE]
and therefore,
[TABLE]
In more general situations, in which is not the identity, a sequence of approximations of is needed. Pointwise convergence is more than sufficient to guarantee narrow convergence of to , but the uniform bound (3.11) might require a well-adapted approximation, especially for non-smooth ’s.
4. Limit trajectory
In this section, we assume that a sequence of vanishing discretizations is given, and we study the respective limit of the fully discrete solutions that are produced by the inductive minimization procedure (3.7). For the analysis of that limit trajectory, it is more natural to work with the induced densities and velocities,
[TABLE]
instead of the Lagrangian maps themselves. Note that is only well-defined on the support of — that is, on the image of — and can be assigned arbitrary values outside. Let us introduce the piecewise constant in time interpolations , and as usual,
[TABLE]
Note that and at each .
4.1. Energy estimates
We start by proving the classical energy estimates on minimizing movements for our fully discrete scheme.
Lemma 4.1**.**
For each discretization and for any indices , one has the a priori estimate
[TABLE]
Consequently:
- (1)
* is monotonically decreasing, i.e., for all ;* 2. (2)
* is Hölder--continuous in , up to an error ,*
[TABLE] 3. (3)
* is square integrable with respect to ,*
[TABLE]
Proof.
By the definition of as a minimizer, we know that for any , and in particular for the choice , which yields:
[TABLE]
Summing these inequalies for , recalling that by (1.8) and that by (2.3), produces (4.1).
Monotonicity of in time is obvious.
To prove (4.2), choose such that and . Notice that . If , the claim (4.2) is obviously true; let in the following. Combining the triangle inequality for the metric , estimate (4.1) above and Hölder’s inequality for sums, we arrive at
[TABLE]
Finally, changing variables using in (4.4) yields
[TABLE]
and summing these inequalities from to yields (4.3). ∎
4.2. Compactness of the trajectories and weak formulation
Our main result on the weak limit of is the following.
Theorem 4.2**.**
Along a suitable sequence , the curves convergence pointwise in time, i.e., narrowly for each , towards a Hölder--continuous limit trajectory .
Moreover, the discrete velocities possess a limit such that in , and the continuity equation
[TABLE]
holds in the sense of distributions.
Remark 4.3**.**
The Hölder continuity of implies that satisfies the initial condition (1.1b) in the sense that narrowly as .
Proof of Theorem 4.2.
We closely follow an argument that is part of the general convergence proof for the minimizing movement scheme as given in Ambrosio et al. [1, Section 11.1.3]. Below, convergence is shown for some arbitrary but fixed time horizon ; a standard diagonal argument implies convergence at arbitrary times.
First observe that by estimate (4.2) — applied with — it follows that is bounded, uniformly in and in . Since further converges narrowly to by our hypotheses on the initial approximation, we conclude that all densities belong to a sequentially compact subset for the narrow convergence. The second observation is that the term on the right hand side of (4.2) simplifies to in the limit . A straightforward application of the “refined version” of the Ascoli-Arzelà theorem (Proposition 3.3.1 in Ambrosio et al. [1]) yields the first part of the claim, namely the pointwise narrow convergence of towards a Hölder continuous limit curve .
It remains to pass to the limit with the velocity . Towards that end, we define a probability measure on the set as follows:
[TABLE]
for every bounded and continuous function . For brevity, let be the -marginals of , that have respective Lebesgue densities on . Thanks to the result from the first part of the proof, converges narrowly to a limit , which has density . On the other hand, the estimate (4.3) implies that
[TABLE]
We are thus in the position to apply Theorem 5.4.4 in Ambrosio et al. [1], which yields the narrow convergence of towards a limit . Clearly, the -marginal of is . Accordingly, we introduce the disintegration of with respect to , which is well-defined -a.e.. Below, it will turn out that ’s -barycenter,
[TABLE]
is the sought-for weak limit of . The convergence and the inheritance of the uniform -bound (4.3) to the limit are further direct consequences of Theorem 5.4.4 in Ambrosio et al. [1].
The key step to establish the continuity equation for the just-defined is to evaluate the limit as of
[TABLE]
for any given test function in two different ways. First, we change variables in the second integral, which gives
[TABLE]
For the second evaluation, we write
[TABLE]
and substitute accordingly in the second integral, leading to
[TABLE]
The error term above is controlled via Taylor expansion of and by using (4.3),
[TABLE]
Equality of the limits for both evaluations of for arbitrary test functions shows the continuity equation (4.5). ∎
Unfortunately, the convergence provided by Theorem 4.2 is generally not sufficient to conclude that is a weak solution to (1.1), since we are not able to identify as from (1.3b). The problem is two-fold: first, weak- convergence of is insufficient to pass to the limit inside the nonlinear function . Second, even if we would know that, for instance, , we would still need a -independent a priori control on the regularity (e.g., maximal diameter of triangles) of the meshes generated by the to justify the passage to limit in the weak formulation below.
The main difficulty in the weak formulation that we derive now is that we can only use “test functions” that are piecewise affine with respect to the changing meshes generated by the . For definiteness, we introduce the space
[TABLE]
Lemma 4.4**.**
Assume is such that . Then:
[TABLE]
Proof.
For all sufficiently small , let . By definition of as a minimizer, we have that . This implies that
[TABLE]
We discuss limits of the three terms under the integral for . For the metric term:
[TABLE]
and since is bounded, the last term vanishes uniformly on for . For the internal energy, since , and recalling (3.8),
[TABLE]
Since the piecewise constant function has a positive lower bound, the convergence as is uniform on . Finally, for the potential energy,
[TABLE]
Again, the convergence is uniform on . Passing to the limit in the integral (4.8) yields
[TABLE]
The same inequality is true with in place of , hence this inequality is actually an equality. Since , a change of variables produces (4.7). ∎
Corollary 4.5**.**
In addition to the hypotheses of Theorem 4.2, assume that
- (1)
* in ;* 2. (2)
each is injective; 3. (3)
as , all simplices in the images of under have non-degenerate interior angles and tend to zero in diameter, uniformly w.r.t. .
Then satisfies the PDE
[TABLE]
in the sense of distributions.
Proof.
Let a smooth test function be given. For each and each , a with can be constructed in such a way that
[TABLE]
uniformly on , and uniformly in as . This follows from our hypotheses on the -uniform regularity of the Lagrangian meshes: inside the image of , one can simply choose as the affine interpolation of the values of at the points . Outside, one can take an arbitrary approximation of that is compatible with the piecewise-affine approximation on the boundary of ’s image; one may even choose at sufficient distance to that boundary. The uniform convergences (4.10) then follow by standard finite element analysis.
Further, let be given. For each , substitute into (4.7). Integration of these equalities with respect to yields
[TABLE]
We pass to the limit in these integrals. For the first, we use that by hypothesis, for the last, we use Theorem 4.2 above. Since any test function can be approximated in by linear combinations of products as above, we thus obtain the weak formulation of
[TABLE]
In combination with the continuity equation (4.5), we arrive at (4.9). ∎
Remark 4.6**.**
In principle, our discretization can also be applied to the linear Fokker-Planck equation with and . In that case, one automatically has thanks to Theorem 4.2. Corollary 4.5 above then provides an a posteriori criterion for convergence: if the Lagrangian mesh does not deform too wildly under the dynamics as the discretization is refined, then the discrete solutions converge to the genuine solution.
5. Consistency in 2D
In this section, we prove consistency of our discretization in the following sense. Under certain conditions on the spatial discretization , any smooth and positive solution to the initial value problem (1.1) projects to a discrete solution that satisfies the Euler-Lagrange equations up to a controlled error. We restrict ourselves to dimensions.
5.1. Smooth Lagrangian evolution
First, we derive an alternative form of the velocity field from (1.3b) in terms of .
Lemma 5.1**.**
For with a smooth diffemorphism , we have
[TABLE]
Consequently, the Lagrangian map — relative to the reference density — for a smooth solution to (1.1) satisfies
[TABLE]
Proof.
On the one hand,
[TABLE]
and on the other hand, by definition of the push forward,
[TABLE]
Hence
[TABLE]
Observing that (1.2) implies that , we conclude (5.2) directly from (1.3b). ∎
5.2. Discrete Euler-Lagrange equations in dimension
In the planar case , the Euler-Lagrange equation (3.10) above can be rewritten in a more convenient way.
In the following, fix some vertex of the triangulation, which is indicent to precisely six triangles. For convenience, we assume that these are labelled to in counter-clockwise order. Similarly, the six neighboring vertices are labeled to in counter-clockwise order, so that has vertices and , where we set .
Using these conventions and recalling Lemma B.2, the expression for the vector in (3.9) simplifies to
[TABLE]
Summing the Euler-Lagrange equation (3.10) over to , we obtain
[TABLE]
where the momentum term and the impulse , respectively, are given by
[TABLE]
We shall now prove our main result on consistency. The setup is the following: a sequence of triangulations on , parametrized by , and a sequence of time steps are given. We assume that there is an -independent region on which the are almost hexagonal in the following sense: each node of has precisely six neighbors — labelled to in counter-clockwise order — and there exists a rotation such that
[TABLE]
for .
Now, let be a given smooth solution to the Lagrangian evolution equation (5.2), and fix a time . For all sufficiently small , we define maps by linear interpolation of the values of and , respectively, on . That is, and , at all nodes in . Theorem 5.2 below states that the pair is an approximate solution to the discrete Euler-Lagrange equations (5.3) at all nodes of the respective triangulation that lie in .
The hexagonality hypothesis on the is strong, but some very strong restriction of ’s geometry is apparently necessary. See Remark 5.4 following the proof for further discussion.
Theorem 5.2**.**
Under the hypotheses and with the notations introduced above, the Euler-Lagrange equation (5.3) admits the following asymptotic expansion:
[TABLE]
as , uniformly at the nodes of the respective .
Remark 5.3**.**
Up to an error , the geometric pre-factor equals to one third of the total area of the hexagon with vertices to , and is thus equal to the integral of the piecewise affine hat function with peak at .
Proof of Theorem 5.2.
Throughout the proof, let be fixed; we shall omit the -index for and . First, we fix a node of . Thanks to the equivariance of both (5.2) and (5.3) under rigid motions of the domain, we may assume that in (5.7) is the identity, and that .
We collect some relations that are helpful for the calculations that follow. Trivially,
[TABLE]
Moreover, we have that
[TABLE]
On the other hand, by definition of in (3.1), it follows that
[TABLE]
Combining (5.10) and (5.11) yields
[TABLE]
In accordance with the definition of and from detailed above, let and , and define , for in the analogous way. Further, we introduce , , .
To perform an expansion in the momentum term, first observe that
[TABLE]
for each , and so, using that by hypothesis,
[TABLE]
Using (5.12) and then (5.9) yields
[TABLE]
This is (5.8a).
For the impulse term, we start with a Taylor expansion to second order in space:
[TABLE]
We combine this with the observation that to obtain:
[TABLE]
where
[TABLE]
Plugging this in leads to
[TABLE]
where we have use the auxiliary algebraic results from Lemma B.2, Lemma B.3, and Lemma B.4.
For the remaining part of the impulse term, a very rough approximation is sufficient:
[TABLE]
holds for any that is a convex combination of , where the implicit constant is controlled in terms of the supremum of and on . With that, we simply have, using again (5.12):
[TABLE]
Together, this yields (5.8b). ∎
Remark 5.4**.**
The hypotheses of Theorem (5.2) require that the are almost hexagonal on . This seems like a technical hypothesis that simplifies calculations, but apparently, some strong symmetry property of the is necessary for the validity of the result.
To illustrate the failure of consistency — at least in the specific form considered here — assume that and , and consider a sequence of triangulations for which there is a node such that (5.7) holds with the being replaced by a different six-tuple of vectors . Repeating the steps of the proof above, it is easily seen that , with an -independent constant in place of , and that
[TABLE]
with
[TABLE]
If a result of the form (5.8b) — with replaced by — was true, then this implies in particular that
[TABLE]
holds with some constant for arbitrary matrices of positive determinant and tensors that are symmetric in the second and third component. A specific example for which (5.13) is not true is given by
[TABLE]
in combination with , and a that is zero except for two ones, at the positions and . In Lemma B.5, we show that the left-hand side in (5.13) equals to ; on the other hand, the right-hand side is clearly zero.
Note that this counter-example is significant, insofar as the skew (in fact, degenerate) hexagon described by the in (5.14) corresponds to a popular method for triangulation of the plane.
6. Numerical simulations in
6.1. Implementation
The Euler-Lagrange equations for the -dimensional case have been derived in (5.3). We perfom a small modification in the potential term in order to simplify calculations with presumably minimal loss in accuracy:
[TABLE]
with the short-hand notation
[TABLE]
On the main diagonal, the Hessian amounts to
[TABLE]
Off the main diagonal, the entries of the Hessian are given by
[TABLE]
The scheme consists of an inner (Newton) and an outer (time stepping) iteration. We start from a given initial density and define the solution at the next time step inductively by applying Newton’s method in the inner iteration. To this end we initialise with , the solution at the th time step, and define inductively
[TABLE]
where the update is the solution to the linear system
[TABLE]
The effort of each inner iteration step is essentially determined by the effort to invert the sparse matrix . As soon as the norm of drops below a given stopping threshold, define as approximate solution in the st time step.
In all experiments the stopping criterion in the Newton iteration is set to .
6.2. Numerical experiments
In this section we present results of our numerical experiments for (1.1) with a cubic porous-medium nonlinearity and different choices for the external potential ,
[TABLE]
Numerical experiment 1: unconfined evolution of Barenblatt profile
As a first example, we consider the “free” cubic porous medium equation, that is (6.1) with . It is well-known (see, e.g., Vazquez [33]) that in the long-time limit , arbitrary solutions approach a self-similar one,
[TABLE]
where is the associated Barenblatt profile
[TABLE]
where is chosen to normalize ’s mass to unity.
In this experiment, we are only interested in the quality of the numerical approximation for the self-similar solution (6.2). To reduce numerical effort, we impose a four-fold symmetry of the approximation: we use the quarter circle as computational domain , and interprete the discrete function thereon as one of four symmetric pieces of the full discrete solution. To preserve reflection symmetry over time, homogeneous Neumann conditions are imposed on the artificial boundaries. This is implemented by reducing the degrees of freedom of the nodes along the - and -axes to tangential motion.
We initialize our simulation with a piecewise constant approximation of the profile of from (6.3) at time . We choose a time step and the final time . In Figure 1, we have collected snapshots of the approximated density at different instances of time. The Barenblatt profile of the solution is very well pertained over time.
Remark 6.1**.**
It takes less than 2 minutes to complete this simulation on standard laptop (Matlab code on a mid-2013 MacBook Air 11” with 1.7 GHz Intel Core i7 processor).
Figure 2 shows surface plots of the discrete solution at different times in comparison with the Barenblatt profile at the respective time. By construction of the scheme, the initial mass is exactly conserved in time as the discrete solution propagates. The left plot in Figure 3 shows the decay in the energy and gives quantitative information about the difference of the discrete solution to the analytical Barenblatt solution. The numerical solution shows good agreement with the analytical energy decay rate .
We also compute the -error of the discrete solution to the exact Barenblatt profile and observe that it remains within the order of the fineness of the triangulation. The mass of the discrete solution is perfectly conserved, as guaranteed by the construction of our method.
To estimate the convergence order of our method, we run several experiments with the above initial data on different meshes. We fix the ratio and compute the -error at time on triangulations with We expect the error to decay as a power of . The double logarithmic plot should reveal a line with its slope indicating the numerical convergence order. The right plot in Figure 3 shows the result, the estimated numerical convergence order which is obtained from a least-squares fitted line through the points is equal to . This indicates first order convergence of the scheme with respect to the spatial discretisation parameter .
Numerical experiment 2: Asymptotic self-similarity
In our second example, we are still concerned with the free cubic porous medium equation, (6.1) with . This time, we wish to give an indication that the discrete approximation of the self-similar solution from (6.2) from the previous experiment might inherit the global attractivity of its continuous counterpart. More specifically, we track the discrete evolution for the initial datum
[TABLE]
until time and observe that it appears to approach the self-similar solution from above. Snapshots of the simulation are collected in Figure 4.
Numerical experiment 3: two peaks merging into one under the influence of a confining potential
In this example we consider as initial condition two peaks, connected by a thin layer of mass, given by
[TABLE]
We choose a triangulation of the square and initialise the discrete solution piecewise constant in each triangle, with a value corresponding to (6.5), evaluated in the centre of mass of each triangle. We solve the porous medium equation with a confining potential, i.e. (1.1) with and . The time step is and the final time is
Figure 5 shows the evolution from the initial density. As time increases the peaks smoothly merge into each other. As the thin layer around the peaks is also subject to the potential the triangulated domain shrinks in time. Even if we do not know how to prevent theoretically the intersection of the images of the discrete Lagrangian maps, this seems not to be a problem in practice.
As time evolves, the discrete solution approaches the steady state Barenblatt profile given by
[TABLE]
where is chosen as the mass of the density. The plot in Figure 6 shows the exponential decay of the -distance of the discrete solution to the steady state Barenblatt profile (6.6). We observe that the decay agrees very well with the analytically predicted decay until . For larger times, one would monitor triangle quality numerically, and re-mesh, locally coarsening the triangulation where necessary.
Numerical experiment 4: one peak splitting under the influence of a quartic potential
We consider as the initial condition
[TABLE]
We choose a triangulation of the unit circle and initialise the discrete solution piecewise constant in each triangle, with a value corresponding to (6.7), evaluated in the centre of mass of each triangle. We solve the porous medium equation with a quartic potential, i.e. (1.1) with and . The time step is and the final time is
Figure 7 shows the evolution of the initial density. As time increases the initial density is progressively split, until two new maxima emerge which are connected by a thin layer. For larger times, when certain triangles become excessively distorted, one would monitor triangle quality numerically, and re-mesh, locally refining the triangulation where necessary.
Appendix A Proof of the Lagrangian representation
Proof of Lemma 1.1.
We verify that the density function given by on is constant with respect to time ; the identity (1.5) then follows since
[TABLE]
Firstly, from the definition of the inverse,
[TABLE]
for all , differentiating with respect to time yields
[TABLE]
and so, using (1.4) and (1.3b),
[TABLE]
Now, let be a smooth test function, and consider
[TABLE]
As was arbitrary, is constant with respect to time. ∎
Appendix B Technical lemmas
Lemma B.1**.**
Given , then
[TABLE]
Proof.
Thanks to the symmetry of the integral with respect to the exchange of the components , the left-hand side of (B.1) equals to
[TABLE]
We calculate the integrals, using Fubini’s theorem. First integral:
[TABLE]
Second integral:
[TABLE]
Third integral:
[TABLE]
Substitute this into (B.2):
[TABLE]
Collecting terms yields the right-hand side of (B.1). ∎
Lemma B.2**.**
For each , we have .
Proof.
This is verified by direct calculation:
[TABLE]
Lemma B.3**.**
With defined as in (5.7), we have that
[TABLE]
Proof.
With the abbreviations and :
[TABLE]
Lemma B.4**.**
Let the scheme of eight numbers be symmetric in the last two indices, . With defined as in (5.7), we have that
[TABLE]
Proof.
In principle, this lemma can be verified by a direct calculation, by writing out the six terms in the sum explicitly and using trigonometric identities. Below, we give a slightly more conceptual proof, in which we use symmetry arguments to reduce the number of expressions significantly.
For the matrix involving , we obtain
[TABLE]
while clearly
[TABLE]
The sum of the diagonal entries of the matrix product are easily calculated,
[TABLE]
with the trigonometric expressions
[TABLE]
To key step is to calculate the sum over of the products of with the respective vector
[TABLE]
Several simplifications of this sum can be performed, thanks to the particular form of the and elementary trigonometric identities. First, observe that , and hence that . Since further , it follows that
[TABLE]
Second, can be evaluated explicitly for :
[TABLE]
Third, since and , as well as and , we obtain that
[TABLE]
By putting this together, we arrive at
[TABLE]
where if is even, and if is odd. By elementary computations,
[TABLE]
and so the final result is:
[TABLE]
which is (B.3). ∎
Lemma B.5**.**
With defined as in (5.14), and with such that except for , we have that
[TABLE]
Proof.
This is a slightly tedious, but straightforward calculation. First, by the choice of ,
[TABLE]
and so, by definition of the in (5.14),
[TABLE]
For the inverse matrices S_{k}:=\big{(}\sigma_{k}^{\prime}\big{|}\sigma_{k+1}^{\prime}\big{)}^{-1}, we obtain
[TABLE]
For the traces T_{k}:=\operatorname{tr}\big{[}S_{k}\beta_{k}\big{]}, we thus obtain the values:
[TABLE]
In conclusion,
[TABLE]
which is (B.7). ∎
Appendix C Lack of convexity
Below, we discuss why the minimization problem (3.7) is not convex. More precisely, we show that is not convex as a function of on the affine ansatz space . Since is a convex combination of the expressions \mathbb{H}_{m}\big{(}(A_{m}|b_{m});(\hat{A}_{m}|\hat{b}_{m})\big{)}, it clearly suffices to discuss the convexity of the latter.
We consider a curve and evaluate the second derivatives of the components of the functional at . First,
[TABLE]
Second,
[TABLE]
If we assume that , then we obtain for the sum of these two contributions that
[TABLE]
For the remaining term, however, we obtain — using the abbreviations and — that
[TABLE]
Now observe that is a non-negative, and is a non-positive function. Thus, from the two terms in the final sum, the first one is generally non-negative whereas the second one is of indefinite sign. Choosing
[TABLE]
the sum is obviously negative.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] L. Ambrosio, N. Gigli, and G. Savaré. Gradient flows in metric spaces and in the space of probability measures . Lectures in Mathematics ETH Zürich. Birkhäuser Verlag, Basel, second edition, 2008.
- 2[2] L. Ambrosio, S. Lisini, and G. Savaré. Stability of flows associated to gradient vector fields and convergence of iterated transport maps. manuscripta mathematica , 121(1):1–50, 2006.
- 3[3] J.-D. Benamou, G. Carlier, Q. Mérigot, and E. Oudet. Discretization of functionals involving the monge–ampère operator. Numerische Mathematik , pages 1–26, 2015.
- 4[4] M. Bessemoulin-Chatard and F. Filbet. A finite volume scheme for nonlinear degenerate parabolic equations. SIAM J. Sci. Comput. , 34(5):B 559–B 583, 2012.
- 5[5] A. Blanchet, V. Calvez, and J. A. Carrillo. Convergence of the mass-transport steepest descent scheme for the sub-critical Patlak-Keller-Segel model. SIAM J. Numer. Anal. , 46(2):691–721, 2008.
- 6[6] A. Blanchet, V. Calvez, and J. A. Carrillo. Convergence of the mass-transport steepest descent scheme for the subcritical Patlak-Keller-Segel model. SIAM J. Numer. Anal. , 46(2):691–721, 2008.
- 7[7] C. J. Budd, G. J. Collins, W. Z. Huang, and R. D. Russell. Self-similar numerical solutions of the porous-medium equation using moving mesh methods. R. Soc. Lond. Philos. Trans. Ser. A Math. Phys. Eng. Sci. , 357(1754):1047–1077, 1999.
- 8[8] V. Calvez and T. O. Gallouët. Particle approximation of the one dimensional Keller-Segel equation, stability and rigidity of the blow-up. Discrete Contin. Dyn. Syst. Ser. A , 36(3):1175–1208, 2015.
