Nonlinear Fokker-Planck equations with reaction as gradient flows of the   free energy

Stanislav Kondratyev; Dmitry Vorotnikov

arXiv:1706.08957·math.FA·August 13, 2019

Nonlinear Fokker-Planck equations with reaction as gradient flows of the free energy

Stanislav Kondratyev, Dmitry Vorotnikov

PDF

TL;DR

This paper interprets a class of nonlinear Fokker-Planck equations with reaction as gradient flows on the space of measures using the Hellinger-Kantorovich distance, establishing convergence to equilibrium and applications in ecology.

Contribution

It introduces a new gradient flow framework for nonlinear Fokker-Planck equations with reaction terms, without requiring convexity of the entropy, and proves convergence results.

Findings

01

Proves entropic exponential convergence to equilibrium.

02

Establishes new dissipation inequalities controlling entropy.

03

Provides existence of weak solutions under mild conditions.

Abstract

We interpret a class of nonlinear Fokker-Planck equations with reaction as gradient flows over the space of Radon measures equipped with the recently introduced Hellinger-Kantorovich distance. The driving entropy of the gradient flow is not assumed to be geodesically convex or semi-convex. We prove new generalized dissipation inequalities, which allow us to control the relative entropy by its production. We establish the entropic exponential convergence of the trajectories of the flow to the equilibrium. Along with other applications, this result has an ecological interpretation as a trend to the ideal free distribution for a class of fitness-driven models of population dynamics. Our existence theorem for weak solutions under mild assumptions on the nonlinearity is new even in the absence of the reaction term.

Equations441

\partial_{t} u

\partial_{t} u

u \frac{\partial f}{\partial ν}

u

f \in C^{2} (\overline{Ω} \times (0, \infty)) \cap L_{loc}^{1} (\overline{Ω} \times [0, \infty))

f \in C^{2} (\overline{Ω} \times (0, \infty)) \cap L_{loc}^{1} (\overline{Ω} \times [0, \infty))

u f, u f_{x} \in C (\overline{Ω} \times [0, \infty))

f_{u} < 0,

u \to \infty lim sup f (x, u) < 0 \forall x \in \overline{Ω},

u \to + 0 lim inf f (x, u) > 0 \forall x \in \overline{Ω},

∣ f (x, u) ∣ + u ∣ f_{u} (x, u) ∣ + u ∣ f_{xu} (x, u) ∣ \leq g (u) a. \leavevmode \nobreak a. u > 0; g \in L_{loc}^{1} [0, \infty),

\displaystyle(uf_{x})\big{|}_{u=0}=0.

f_{x} = 0

f_{x} = 0

f_{x} = 0

f (x, m (x)) = 0.

f (x, m (x)) = 0.

Φ (x, u) = - \int_{0}^{u} ξ f_{u} (x, ξ) d ξ, Ψ (x, u) = \int_{0}^{u} Φ (x, ξ) d ξ .

Φ (x, u) = - \int_{0}^{u} ξ f_{u} (x, ξ) d ξ, Ψ (x, u) = \int_{0}^{u} Φ (x, ξ) d ξ .

Φ (x, 0) = Ψ (x, 0) = 0, Φ_{u} = - u f_{u}, Φ_{x} = - \int_{0}^{u} ξ f_{xu} (x, ξ) d ξ, Ψ_{u} = Φ.

Φ (x, 0) = Ψ (x, 0) = 0, Φ_{u} = - u f_{u}, Φ_{x} = - \int_{0}^{u} ξ f_{xu} (x, ξ) d ξ, Ψ_{u} = Φ.

\partial_{t} u = ΔΦ - div (Φ_{x} + u f_{x}) + u f,

\partial_{t} u = ΔΦ - div (Φ_{x} + u f_{x}) + u f,

\partial_{t} \int_{Ω} Ψ d x = - \int_{Ω} ∣\nablaΦ ∣^{2} d x + \int_{Ω} (Φ_{x} + u f_{x}) \cdot \nablaΦ d x + \int_{Ω} u f Φ d x .

\partial_{t} \int_{Ω} Ψ d x = - \int_{Ω} ∣\nablaΦ ∣^{2} d x + \int_{Ω} (Φ_{x} + u f_{x}) \cdot \nablaΦ d x + \int_{Ω} u f Φ d x .

W (u) = \int_{Ω} Ψ (x, u (x)) d x

W (u) = \int_{Ω} Ψ (x, u (x)) d x

\int_{0}^{T} \int_{Ω} (u \partial_{t} φ + (- \nablaΦ + Φ_{x} + u f_{x}) \cdot \nabla φ + f u φ) d x d t = \int_{Ω} u^{0} (x) φ (x, 0) d x

\int_{0}^{T} \int_{Ω} (u \partial_{t} φ + (- \nablaΦ + Φ_{x} + u f_{x}) \cdot \nabla φ + f u φ) d x d t = \int_{Ω} u^{0} (x) φ (x, 0) d x

E (x, u) = - \int_{m (x)}^{u} f (x, ξ) d ξ .

E (x, u) = - \int_{m (x)}^{u} f (x, ξ) d ξ .

E (u) = \int_{Ω} E (x, u (x)) d x .

E (u) = \int_{Ω} E (x, u (x)) d x .

\partial_{t} E (u) = - \int_{Ω} u (f^{2} + ∣\nabla f ∣^{2}) d x .

\partial_{t} E (u) = - \int_{Ω} u (f^{2} + ∣\nabla f ∣^{2}) d x .

u ∣\nabla f ∣^{2} = \frac{1}{u} ∣ - \nablaΦ + Φ_{x} + u f_{x} ∣^{2} (u > 0) .

u ∣\nabla f ∣^{2} = \frac{1}{u} ∣ - \nablaΦ + Φ_{x} + u f_{x} ∣^{2} (u > 0) .

D E (u) = \int_{Ω} u f^{2} d x + \int_{[u > 0]} \frac{1}{u} ∣ - \nablaΦ + Φ_{x} + u f_{x} ∣^{2} d x,

D E (u) = \int_{Ω} u f^{2} d x + \int_{[u > 0]} \frac{1}{u} ∣ - \nablaΦ + Φ_{x} + u f_{x} ∣^{2} d x,

\partial_{t} E (u) = - D E (u) .

\partial_{t} E (u) = - D E (u) .

\partial_{t}\mathcal{W}(u)\leq\int_{\Omega}\big{(}-|\nabla\Phi|^{2}+(\Phi_{x}+uf_{x})\cdot\nabla\Phi+uf\Phi\big{)}\,\mathrm{d}x

\partial_{t}\mathcal{W}(u)\leq\int_{\Omega}\big{(}-|\nabla\Phi|^{2}+(\Phi_{x}+uf_{x})\cdot\nabla\Phi+uf\Phi\big{)}\,\mathrm{d}x

\partial_{t} E (u) \leq - D E (u) .

\partial_{t} E (u) \leq - D E (u) .

\displaystyle-\int_{0}^{T}\chi^{\prime}(t)\mathcal{W}(u)\,\mathrm{d}t\leq\iint_{Q_{T}}\chi(t)\big{(}-|\nabla\Phi|^{2}+(\Phi_{x}+uf_{x})\cdot\nabla\Phi+uf\Phi\big{)}\,\mathrm{d}x\,\mathrm{d}t,

\displaystyle-\int_{0}^{T}\chi^{\prime}(t)\mathcal{W}(u)\,\mathrm{d}t\leq\iint_{Q_{T}}\chi(t)\big{(}-|\nabla\Phi|^{2}+(\Phi_{x}+uf_{x})\cdot\nabla\Phi+uf\Phi\big{)}\,\mathrm{d}x\,\mathrm{d}t,

\int_{0}^{T} χ^{'} (t) E (u) d t \geq \int_{0}^{T} χ (t) D E (u) d t .

u \in U in f ∥ u ∥_{L^{1} (Ω)} > 0.

u \in U in f ∥ u ∥_{L^{1} (Ω)} > 0.

E (u) \leq C_{U} D E (u) (u \in U) .

E (u) \leq C_{U} D E (u) (u \in U) .

∥ u ∥_{L^{\infty} (Ω \times (0, \infty))} \leq in f {ξ \geq 0 : x \in Ω sup f (x, ξ) \leq - x \in Ω ess sup f^{-} (x, u^{0} (x))};

∥ u ∥_{L^{\infty} (Ω \times (0, \infty))} \leq in f {ξ \geq 0 : x \in Ω sup f (x, ξ) \leq - x \in Ω ess sup f^{-} (x, u^{0} (x))};

t \to + 0 ess lim sup W (u (t)) \leq W (u^{0});

t \to + 0 ess lim sup W (u (t)) \leq W (u^{0});

t > 0 ess sup E (u (t)) \leq E (u^{0});

t > 0 ess sup E (u (t)) \leq E (u^{0});

∥ u (t) ∥_{L^{1} (Ω)} \geq ∥ min (u^{0}, m) ∥_{L^{1} (Ω)} a. \leavevmode \nobreak a. t > 0.

∥ u (t) ∥_{L^{1} (Ω)} \geq ∥ min (u^{0}, m) ∥_{L^{1} (Ω)} a. \leavevmode \nobreak a. t > 0.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Nonlinear Fokker-Planck equations with

reaction as gradient flows of the free energy

Stanislav Kondratyev

CMUC, Department of Mathematics, University of Coimbra, 3001-501 Coimbra, Portugal

[email protected]

and

Dmitry Vorotnikov

CMUC, Department of Mathematics, University of Coimbra, 3001-501 Coimbra, Portugal

[email protected]

Abstract.

We interpret a class of nonlinear Fokker-Planck equations with reaction as gradient flows over the space of Radon measures equipped with the recently introduced Hellinger-Kantorovich distance. The driving entropy of the gradient flow is not assumed to be geodesically convex or semi-convex. We prove new generalized dissipation inequalities, which allow us to control the relative entropy by its production. We establish the entropic exponential convergence of the trajectories of the flow to the equilibrium. Along with other applications, this result has an ecological interpretation as a trend to the ideal free distribution for a class of fitness-driven models of population dynamics. Our existence theorem for weak solutions under mild assumptions on the nonlinearity is new even in the absence of the reaction term.

Keywords: functional inequalities, optimal transport, Hellinger-Kantorovich distance, geodesic non-convexity

MSC [2010] 26D10, 35Q84, 49Q20, 58B20

1. Introduction

1.1. Setting

Let $\Omega$ be an open connected bounded domain in $\mathbb{R}^{d}$ with sufficiently smooth boundary and let $\nu$ be the outward unit normal along $\partial\Omega$ . We are interested in nonnegative solutions of

[TABLE]

Here $u$ is the unknown function, $f=f(x,u(x,t))$ is a known nonlinear function of $x$ and $u$ , equation (1.2) is the no-flux boundary condition and the initial data $u^{0}$ are nonnegative. We refer to Section 1.3 for the motivation and background.

When considering problem (1.1)–(1.3), we always make the following assumptions concerning the function $f\colon\Omega\times(0,\infty)\to\mathbb{R}$ :

[TABLE]

When needed, we also assume that

[TABLE]

*Remark 1.1**.*

We make comfortable assumptions about the smoothness of $f$ . We do not insist that $f$ should be defined for $u=0$ so as not to exclude the interesting cases such as $f=-(\log u+V(x))$ (which corresponds to the linear Fokker-Planck equation, cf. [15, 20]) and $f=u^{\alpha}-1$ , $-1<\alpha<0$ , (the fast diffusion, cf. [40]). However, we assume in (1.5) that the functions $uf$ and $uf_{x}$ admit continuous extensions to $\overline{\Omega}\times[0,\infty)$ . This ensures that the terms in (1.1) make sense. Moreover, we assume (1.10) to avoid certain complications with the entropy production to be defined below.

*Remark 1.2**.*

Assumption (1.6) is essential, it ensures the parabolicity of (1.1). The equation may become degenerate or singular only if $u=0$ or $u$ is large. The latter does not bother us as we only consider bounded solutions in what follows.

*Remark 1.3**.*

Assumptions (1.7), (1.8) ensure the existence of a positive equilibrium, see below.

*Remark 1.4**.*

Estimate (1.9) ensures that the entropy and energy of the equation are well-defined and well-behaved. Note that at least some restrictions on the growth of $f_{u}$ as $u\to 0$ are inevitable, as the related very fast diffusion equation is known to behave abnormally [39].

*Remark 1.5**.*

Conditions (1.11) and (1.12) are convenient technical assumptions needed for $L^{\infty}$ -bounds (hence for the existence theorem) and for controlling the energy for large $u$ in the proof of Theorem 1.8. However, they are not necessary everywhere, so we explicitely mention them when the need arises.

*Remark 1.6**.*

The results of the paper remain valid if $\Omega$ is the periodic box $\mathbb{T}^{d}$ .

It follows from (1.6)–(1.8) that for any $x\in\overline{\Omega}$ there exists a unique $m(x)>0$ such that

[TABLE]

Clearly, $m\in C^{2}(\overline{\Omega})$ . It is a stationary solution of (1.1), (1.2). As we will see, all non-zero solutions of the problem converge to $m$ .

1.2. Energy and entropy

Now we will introduce the energy and entropy functionals for equation (1.1) as well as the notion of weak solution.

Put

[TABLE]

It is easy to see that

[TABLE]

Observe that both $\Phi$ and $\Psi$ are nonnegative and strictly increase with respect to $u$ .

Note that if $u$ is a nonnegative function of $x$ and possibly of $t$ , an $L^{\infty}$ -bound on $u$ is translated into an $L^{\infty}$ -bound on $\Phi(\cdot,u(\cdot))$ , i. e., the superposition operator associated with $\Phi$ is $L^{\infty}$ -bounded. The same is true of $\Psi$ .

Let $u$ be a classical solution of (1.1)–(1.3). Equation (1.1) can be cast in the equivalent form

[TABLE]

where we write $\Phi$ for $\Phi(x,u(x,t))$ , etc. Multiplying by $\Phi(x,u(x,t))$ and integrating over $\Omega$ , we obtain

[TABLE]

We call the functional

[TABLE]

the energy of problem (1.1)–(1.3) and equation (1.14), the energy identity. Thus, any classical solution of (1.1)–(1.3) satisfies the energy identity (1.14).

For our purposes, the energy identity is useful because it allows us to control the integral $\iint_{Q_{T}}|\nabla\Phi|^{2}\,\mathrm{d}x\,\mathrm{d}t$ . In particular, we can define the weak solution of (1.1)–(1.3) in a class of functions $u$ such that $\Phi(\cdot,u(\cdot))\in L^{2}(0,T;H^{1}(\Omega))$ . It is easier to exploit this assumption in the case of equation (1.13). Thus, we define the weak solution as follows:

Definition 1.7.

Let $u^{0}\in L^{\infty}(\Omega)$ . A function $u\in L^{\infty}(Q_{T})$ is called a weak solution of (1.1)–(1.3) on $[0,T]$ if $\Phi(\cdot,u(\cdot))\in L^{2}(0,T;H^{1}(\Omega))$ and

[TABLE]

for any function $\varphi\in C^{1}(\overline{\Omega}\times[0,T])$ such that $\varphi(x,T)=0$ . A function $u\in L^{\infty}_{\text{loc}}([0,\infty);L^{\infty}(\Omega))$ is called a weak solution of (1.1)–(1.3) on $[0,\infty)$ if for any $T>0$ it is a weak solution on $[0,T]$ .

Now, let us address the entropy of the problem. Define

[TABLE]

It follows from (1.9) that $E$ is well-defined and continuous on $\overline{\Omega}\times[0,\infty)$ . As $f$ decreases with respect to $u$ and $f(x,m(x))=0$ , it is clear that $E\geq 0$ and $E(x,u)=0$ if and only if $u=m(x)$ . The relative entropy of equation (1.1) is the functional

[TABLE]

Observe that it is well-defined at least for $u\in L^{\infty}_{+}(\Omega)$ as the superposition operator $u\mapsto E(\cdot,u(\cdot))$ is bounded in the spaces $L^{\infty}_{+}\to L^{\infty}_{+}$ .

A straightforward computation shows that for a positive classical solution of (1.1)–(1.3) we have

[TABLE]

Equation (1.17) is called the entropy dissipation identity and the integral on the right-hand side of (1.17) is called the entropy production. However, the term $\int_{\Omega}u|\nabla f|^{2}\,\mathrm{d}x$ may make no sense for vanishing or non-smooth $u$ . In order to generalise the definition of the entropy production, we use the identity

[TABLE]

Given a function $u\in L_{+}^{\infty}(\Omega)$ such that $\Phi(\cdot,u(\cdot))\in H^{1}(\Omega)$ , the right-hand side of the last identity is a nonnegative measurable function on $[u>0]$ , so we can define the entropy production for such functions by the formula

[TABLE]

where the second integral on the right-hand side may be infinite. Thus, we see that any positive classical solution of (1.1)–(1.3) satisfies the entropy dissipation identity

[TABLE]

As usual, in the case of weak solutions we establish not the identities (1.14) and (1.18) but rather corresponding inequalities, viz. the energy inequality

[TABLE]

and the entropy dissipation inequality

[TABLE]

For functions $u\in L_{+}^{\infty}(\Omega)$ such that $\Phi\in L^{2}(0,T;H^{1}(\Omega))$ we understand (1.19) and (1.20) in the sense of measures, i. e., that for any smooth nonnegative compactly supported function $\chi\colon(0,T)\to\mathbb{R}$ we respectively have

[TABLE]

If (1.20) holds in the sense of measures, the derivative $\partial_{t}\mathcal{E}(u)$ is a nonpositive distribution and hence a measure, while the entropy $\mathcal{E}(u)$ itself a. e. coincides with a non-increasing function.

An important question is whether the entropy can be controlled by the entropy production, since this would imply the exponential stability of the equilibrium. It turns out that this is true provided that the $L^{1}$ -norm of $u$ is bounded away from [math]. Specifically, we have

Theorem 1.8 (Entropy-entropy production inequality).

Suppose that $f$ satisfies (1.4)–(1.10) as well as (1.11). Let $U\subset L_{+}^{\infty}(\Omega)$ be a set of functions such that for any $u\in U$ , we have $\Phi(\cdot,u(\cdot))\in H^{1}(\Omega)$ and

[TABLE]

Then there exists $C_{U}$ such that

[TABLE]

Theorem 1.8 is a consequence of a fairly general functional inequality established in Section 2.

Theorem 1.9 (Existence of weak solutions).

Suppose that $f$ satisfies (1.4)–(1.10) as well as (1.11) and (1.12). Then for any $u^{0}\in L^{\infty}_{+}(\Omega)$ there exists a nonnegative weak solution $u\in L^{\infty}(\Omega\times(0,\infty))$ of problem (1.1)–(1.3) enjoying the following properties:

(1)

(upper $L^{\infty}$ -bound)

[TABLE] 2. (2)

$u$ * satisfies the energy inequality (1.19) in the sense of measures and*

[TABLE] 3. (3)

$u$ * satisfies the entropy dissipation inequality (1.20) in the sense of measures and*

[TABLE] 4. (4)

(lower $L^{1}$ -bound)

[TABLE]

*Remark 1.10**.*

Theorem 1.9, mutatis mutandis, is also valid in the case of the pure Fokker-Planck equation (1.29). Even in this case, our conditions on the nonlinearity $f$ are more relaxed than the ones available in the literature, see, e.g., [1, 4, 22, 23, 13, 40, 7, 3] and the references therein.

*Remark 1.11**.*

In the general case, uniqueness of solutions cannot be expected due to the non-Lipschitz reaction term. However, our weak solutions are unique provided the initial data is bounded away from zero, see Theorem 3.9.

*Remark 1.12**.*

Under the hypotheses of Theorem 1.9, the right-hand side of (1.23) is always finite (see Remark 3.5). Moreover, if $u^{0}$ satisfies an estimate $\|u^{0}\|_{L^{\infty}(\Omega)}\leq a$ , inequality (1.23) provides an estimate $\|u\|_{L^{\infty}(\Omega\times(0,\infty))}\leq C_{a}$ .

The next theorem shows that the solutions that we have constructed exponentially converge to $m$ . Note that (1.12) is not needed for the long-time convergence.

Theorem 1.13 (Convergence to equilibrium).

Assume (1.11) and suppose that a weak solution $u$ of (1.1)–(1.3) with the initial data $u^{0}\not\equiv 0$ satisfies the entropy dissipation inequality (1.20), inequality (1.25), and the lower $L^{1}$ -bound (1.26). Then $u$ exponentially converges to $m$ in the sense of entropy:

[TABLE]

where $\gamma>0$ can be chosen uniformly over initial data satisfying

[TABLE]

with some $c>0$ .

Theorems 1.8, 1.9, and 1.13 are proved in Section 3.3.

1.3. Motivation and background

The nonlinear Fokker-Planck equation

[TABLE]

is intended to express the behaviour of stochastic systems coming from various branches of physics, chemistry and biology, see [15, 38, 21, 5]. In order to take into account the creation and annihilation of mass, the general drift-diffusion-reaction equation (1.1) was suggested in [14]. In the considerations of [14] (cf. also [15]), the crucial role is played by the free energy functional that up to an additive constant coincides with our relative entropy functional $\mathcal{E}$ from (1.16). We opt for this change of terminology (though for thermodynamists the free energy involves the (physical) entropy, the internal energy, and the temperature) because in mathematical analysis it is convenient to refer to the basic Lyapunov functional of a system as the entropy, cf. [41, p. 270].

On the other hand, equation (1.1) is a general nonlinear model for the spatial dynamics of a population that is tending to achieve the ideal free distribution [17, 16] (the distribution that happens if everybody is free to choose its location) in a heterogeneous environment. The dispersal strategy is determined by a local intrinsic characteristic of organisms called fitness (see, e.g., [10, 11]). The fitness manifests itself as a growth rate, and simultaneously affects the dispersal as the species move along its gradient towards the most favorable environment. In (1.1), $u(x,t)$ is the density of organisms, and $f(x,u)$ is the fitness. The equilibrium $u(x)\equiv m(x)$ when the fitness is constantly zero corresponds to the ideal free distribution. The original model [32, 10] assumes a linear logistic fitness

[TABLE]

but in general it can be any nonlinear function of the spatial variable and the density, cf. [11]. The assumptions (1.6), (1.7), (1.8) are natural as they simply mean that the fitness is decreasing with respect to the population density (as the resources are limited), being positive for very small densities and negative for very large densities. Our Theorem 1.13 indicates that the populations converge to the ideal free distribution with an exponential rate.

The existence of weak solutions for the fitness-driven dispersal model (1.1)–(1.3) with the logistic fitness (1.30) was shown in [12], and the entropic exponential convergence to $m$ was established in [25]. The same kind of results for cross-diffusion systems involving several interacting populations (with logistic fitnesses) can be found in [24]. Related two-species models were investigated in [6, 31], where one population uses the fitness-driven dispersal strategy and the other diffuses freely or does not move at all. A system of two interacting populations with a particular nonlinear fitness function has recently been considered in [43], which is the only existing mathematical treatment of a non-logistic fitness model that we are aware of.

But perhaps our main motivation to study (1.1) is that it is a gradient flow of the entropy functional $\mathcal{E}$ with respect to the intriguing recently introduced distance on the space of Radon measures, which is related to the unbalanced optimal transport (i.e., failing to preserve the total transported mass), and that is referred to as the Hellinger-Kantorovich distance or the Wasserstein-Fisher-Rao distance [25, 8, 30, 29, 9]. This distance endows the set of Radon measures with a formal (infinite dimensional) Riemannian metric $\langle\cdot,\cdot\rangle$ , and provides first- and second-order differential calculus [25] in the spirit of Otto [35, 41, 42]. In particular, one can compute the metric gradients of the functionals of the form

[TABLE]

by the formula

[TABLE]

where $\frac{\delta F}{\delta u}=\partial_{u}F(x,u)$ stands for the first variation with respect to $u$ and $\nabla=\nabla_{x}$ is the usual gradient in space. We refer to [25] for further details and explanations. Since $f=-\partial_{u}E$ , we can recast (1.1) as a gradient flow

[TABLE]

The entropy dissipation identity (1.17), which by the way was already known to Frank [14], is then nothing but the archetypal property of gradient flows

[TABLE]

In this connection, we recall that for the metric gradient flows like (1.32), the geodesic convexity of the driving entropy functional (or at least semi-convexity, i.e., $\lambda$ -convexity with a negative constant $\lambda$ ) makes a difference [35, 2, 41, 42, 36]. The presence of convexity allows one to apply minimizing movement schemes [2, 20] to construct solutions to the gradient flow. Moreover, $\lambda$ -convexity with $\lambda$ strictly positive enables the Bakry-Emery procedure that usually yields the exponential convergence of the relative entropy to zero. Minimizing movement schemes for Hellinger-Kantorovich gradient flows of geodesically convex functionals and for related reaction-diffusion equations were suggested in [19, 18].

Our entropy $\mathcal{E}$ is geodesically $(-1/2)$ -convex with respect to the Hellinger-Kantorovich structure if $f=1-u^{\alpha}$ , $\alpha>0$ , but fails to be semi-convex for $f=u^{\alpha}-1$ , $\alpha<0,$ and for $f=-\log u$ (the latter option corresponds to the interesting case of the Boltzmann entropy). The spatial heterogeneity further complicates the situation. The quadratic (logistic) multicomponent entropy considered in [24, 26] is not even semi-convex. All this can be observed by computing the Hessian of the entropy, cf. [25, Section 3.4]; the non-convexity of the Boltzmann entropy with respect to the Hellinger-Kantorovich metric was also mentioned in [19, 18, 30, 29]. We refer to [28] for a more detailed discussion of examples of $f$ and the corresponding geodesic non-convexity. However, Santambrogio [36] emphasizes that the lack of geodesic convexity is not a universal obstacle for the study of gradient flows; our results in the current paper and in [24, 26, 25, 27, 28, 37] illustrate this idea.

2. Generalized dissipation inequalities

2.1. Setting

Motivated by the expressions for the entropy and entropy production, we forget for a while problem (1.1)–(1.3) and consider the integrals

[TABLE]

on their own right. Here $\Omega$ a domain in $\mathbb{R}^{d}$ ; $p\geq 1$ ; the functions

[TABLE]

are fixed, and $u$ varies over a set $U$ of functions $\Omega\to(0,\infty)$ . Observe that the nonnegativity of $E$ and $g$ ensures the existence of the integrals (2.1) and (2.2), although they need not be finite.

The functions $f$ and $E$ introduced in Section 1.2 are, of course, prototypes for the ones appearing in (2.1) and (2.2), but we assume no formal relationship between them. In particular, in this section we do not suppose that $f$ satisfies (1.4)–(1.12).

We would like to know whether (2.1) can be controlled by (2.2) uniformly with respect to $u\in U$ . In general, this is not the case, cf. a related discussion in [27]. However, we show that under suitable assumptions on the functions $E$ , $f$ , and $g$ , (2.2) does indeed control (2.1) provided that the set $U$ of admissible $u$ is separated from [math] in some sense.

For simplicity, we concentrate on the regular case. Section 2.4 contains a discussion of possible generalisations.

Theorem 2.1.

Let $\Omega$ be a bounded, connected, open domain in $\mathbb{R}^{d}$ admitting the relative isoperimetric inequality. Let $p\geq 1$ . Suppose that functions $E,g\in C(\Omega\times(0,\infty))$ and $f\in C^{1}(\Omega\times(0,\infty))$ satisfy

[TABLE]

Finally, suppose that a set $U\subset C^{1}(\Omega)$ consisting of strictly positive functions contains no sequence $\{u_{n}\}$ such that $\{E(\cdot,u_{n}(\cdot))\}$ is bounded in $L^{1}(\Omega)$ and $\{u_{n}\}$ converges to [math] in measure. Then there exists a constant $C=C(\Omega,p,E,g,f,U)$ such that

[TABLE]

*Remark 2.2**.*

The isoperimetric inequality for $\Omega$ reads

[TABLE]

where $P(A;\Omega)$ denotes the relative perimeter of a Lebesgue measurable set $A$ of locally finite perimeter with respect to $\Omega$ , cf. [33, Remark 12.39], [34]. We recall that the relative perimeter is defined as

[TABLE]

where $\mu_{A}:=\nabla 1_{A}$ is the Gauss-Green measure associated with $A$ . The support of $\mu_{A}$ is contained [33] in the topological boundary of $A$ .

*Remark 2.3**.*

If $E\in C(\overline{\Omega}\times\mathbb{R}_{+})$ , condition (2.4) is automatically true. If the set $\{(x,u)\in\overline{\Omega}\times\mathbb{R}_{+}\colon E(x,u)=0\}$ is compact, the right-hand side of (2.6) is simplified to $\max_{E(x,u)=0}f(x,u)$ and likewise, if $f\in C(\overline{\Omega}\times\mathbb{R}_{+})$ , the left-hand side of (2.6) can be written as $\min_{x}f(x,0)$ . As for (2.5), it is more tricky. In Section 2.4 we show that it always holds in a particular setting relevant for gradient flows (Theorem 2.9).

*Remark 2.4**.*

The infimum in (2.5) depends on $\varepsilon$ and may tend to zero as $\varepsilon\to 0$ , otherwise the claim would be trivial.

2.2. Strategy of the proof of Theorem 2.1

Before starting the proof of Theorem 2.1, we would like to informally outline the underlying ideas.

For simplicity, we will opt for an argument by contradiction. Of course, a direct proof could be presented (as we have recently done in [27] for a related inequality), and a quantitative constant could be derived from it. However, this would be much more cumbersome, and the constant obtained in this way would anyway not be optimal. Any discussion of quantitative constants lies beyond the scope of this article.

It easily follows from (2.5) that $g$ controls $E$ from above unless $u$ is small. Moreover, we infer (Lemma 2.5) that if the constant in (2.7) blows up, the sets where either $u$ or $E$ are small tend to grow and together occupy nearly all of $\Omega$ , while the ‘transitional annulus’—where neither is small—collapses. At this point we must be prepared to face the situation where the integral

[TABLE]

is controlled neither by

[TABLE]

(because (2.5) is not applicable), nor by

[TABLE]

(because $g$ may be small), nor by

[TABLE]

(because the ‘annulus’ is too small).

This is where the term with the gradient comes into play. The crucial observation is that the total variation of $f$ over the ‘annulus’ can be estimated from below. Actually, condition (2.6) gives a universal lower bound on the variation of $f$ between the ‘inner boundary’ of the annulus (say, where $u$ is small) and its ‘outer boundary’ (where $E$ is small). All in all, the integral (2.9) is controlled by the area of the set $[u\ll 1]$ (due to (2.4)), which is controlled by the perimeter of this set (by the isoperimetric inequality), which is in turn controlled by the total variation of $f$ over the ‘annulus’. This eventually leads to a contradiction. Naturally, when this idea is implemented in Lemma 2.7 and the subsequent reasoning, we must relate the total variation and the integral $\int_{\Omega}u|\nabla f|^{p}\,\mathrm{d}x$ . Then we use the coarea formula and estimate the total variation of $f$ by the perimeters of its superlevel sets.

2.3. Proof of Theorem 2.1

Here we prove Theorem 2.1. We start with the following observations.

Under the hypotheses of Theorem 2.1, integral (2.1) is finite for $u\in U$ whenever so is

[TABLE]

Indeed, according to (2.4) we can choose $\varepsilon>0$ such that

[TABLE]

By (2.5), we have

[TABLE]

(possibly $B=\infty$ ). Then $E(x,u)\leq g(x,u)/B$ whenever $u>\varepsilon$ , so

[TABLE]

as claimed.

Take sequences $\{\varepsilon_{n}\}$ and $\{\xi_{n}\}$ such that $\varepsilon_{n}>0$ , $\varepsilon_{n}\to 0$ ,

[TABLE]

(this is possible according to (2.5)), and $\xi_{n}\to 0$ .

Assume that Theorem 2.1 is not true. Then there exists a sequence of functions $\{u_{n}\}\subset U$ such that

[TABLE]

where

[TABLE]

Clearly, $E_{n},g_{n}\in C(\Omega)$ and $f_{n}\in C^{1}(\Omega)$ . Moreover, it easily follows from (2.3)–(2.6) that

[TABLE]

and according to the choice of $\xi_{n}$ , we have

[TABLE]

We want to show that the sequence $\{E_{n}\}$ is bounded in $L^{1}(\Omega)$ and $u_{n}\to 0$ in measure, thus obtaining a contradiction.

We use (2.10) to estimate

[TABLE]

Thus, we have

[TABLE]

For large $n$ , the first term on the right-hand side is negative, so we conclude that

[TABLE]

From (2.15) we get

[TABLE]

and by (2.12), the last expression is bounded uniformly with respect to $n$ . Hence the sequence $\{E_{n}\}$ is bounded in $L^{1}(\Omega)$ .

Lemma 2.5.

Given $a>0$ ,

[TABLE]

Proof.

Using (2.17), we have:

[TABLE]

where we have taken into account (2.12), so (2.18) is proved. ∎

Lemma 2.6.

Given $a>0$ , for large $n$ we have

[TABLE]

Proof.

Using the estimate

[TABLE]

obtained in the proof of Lemma 2.5, we get

[TABLE]

and the lemma follows. ∎

It follows from (2.13) that we can choose $a>0$ , $\alpha$ , and $\beta$ , all independent of $n$ , such that for large $n$ we have

[TABLE]

We can assume that the limit

[TABLE]

exists. It follows from (2.20) that for large $n$ the sets $[u_{n}\leq\varepsilon_{n}]$ and $[E_{n}\leq a]$ are disjoint, so in view of Lemma 2.5 we have

[TABLE]

Thus, we actually face three logical possibilities:

[TABLE]

As $\varepsilon_{n}\to 0$ , (2.22) clearly implies $u_{n}\to 0$ in measure, a contradiction.

In what follows we show that (2.23) and (2.24) are in fact impossible. The following lemma is crucial.

Lemma 2.7.

We have

[TABLE]

Proof.

We have

[TABLE]

Using the coarea formula, we get:

[TABLE]

Fix $t\in(\alpha,\beta)$ . Evoking the definition of the relative perimeter, we have

[TABLE]

where $\mu_{[f_{n}>t]}$ is the Gauss-Green measure. Obviously, we have

[TABLE]

for any $t\in(\alpha,\beta)$ . It follows from (2.20) that

[TABLE]

so

[TABLE]

and continuing (2.29), we obtain

[TABLE]

Combining this with (2.27) and (2.28), we obtain (2.25). ∎

Let us show that (2.23) is impossible. Assume that it holds.

If at a point $x$ we have $f_{n}(x)>t$ , $t\in(\alpha,\beta)$ , (2.20) guarantees that $E_{n}(x)>a$ . Hence, $[f_{n}>t]\subset[E_{n}>a]$ . It follows from (2.23) and (2.21) that $[|E_{n}\leq a]|\to|\Omega|$ , and thus $|[E_{n}>a]|\to 0$ , so we conclude that $|[f_{n}>t]|$ is uniformly in $t$ small when $n$ is large. For such large $n$ we can apply the isoperimetric inequality:

[TABLE]

Now it follows from (2.20) that $[u_{n}\leq\varepsilon_{n}]\subset[f_{n}>t]$ , so we have

[TABLE]

Plugging this estimate into (2.25), we obtain

[TABLE]

Estimating

[TABLE]

by virtue of (2.19), we obtain

[TABLE]

where $C$ is independent of $n$ .

Combining obtained estimate with (2.16), we get:

[TABLE]

whence

[TABLE]

as $\xi_{n}\to 0$ and the suprema are bounded by (2.12). This contradicts the fact that the left-hand side is a positive constant independent of $n$ . Thus, (2.23) is impossible.

It remains to show that (2.24) is also impossible. Assume that it holds.

It is easy to check that in this case we have

[TABLE]

where $p_{0}>0$ is independent of $t$ and $n$ . Indeed, we have the inclusions

[TABLE]

and as in our case the measure of the first and third terms goes to $\mu_{0}$ as $n\to\infty$ , we also have

[TABLE]

Now it suffices to apply the isoperimetric equality to $[f_{n}>t]$ if $\mu_{0}<1/2$ and to $[f_{n}\leq t]$ otherwise.

Plugging (2.30) into (2.25), we get

[TABLE]

Comparing this with (2.16), we obtain

[TABLE]

As $n\to\infty$ , the left-hand side remains bounded away from 0, while the right-hand side goes to 0, a contradiction.

2.4. Generalisations and specialisations

We start with the remark that Theorem 2.1 can often be applied if $U$ is a subset of a space $X$ of functions defined on $\Omega$ provided that $C^{1}(\Omega)$ is dense in $X$ and the integrals (2.1) and (2.2) are continuous with respect to the topology of $X$ . Indeed, if $U_{1}=U\cap C^{1}(\Omega)$ is dense in $U$ , we apply the theorem to $U_{1}$ and proceed by density to make sure that the same constant works for $U$ as well. On the other hand, if $U_{1}$ is not dense in $U$ , we replace $U$ with its small enlargement $\widetilde{U}$ in the cone of nonnegative functions in $X$ and apply the same reasoning to $\widetilde{U}$ . A more complicated density argument is used in the proof of Theorem 1.8 given in Section 3.3.

Another question is whether the constant $C$ can be chosen uniformly with respect to $(E,g,f)$ if the latter triple is allowed to vary over a set $\mathcal{X}$ . It turns out that Theorem 2.1 can be easily extended to handle this case. Specifically, if the suprema and infima in (2.4)–(2.6) are additionally taken over $(E,g,f)\in\mathcal{X}$ , the constant $C$ can be chosen independently of $(E,g,f)$ . The proof remains essentially the same. Assuming the converse, we have violating sequences $\{(\widetilde{E}_{n},\tilde{g}_{n},\tilde{f}_{n})\}\subset\mathcal{X}$ and $\{u_{n}\}\subset U$ such that (2.10) holds with

[TABLE]

Moreover, the functions $E_{n}$ , $g_{n}$ , and $f_{n}$ satisfy (2.11)–(2.14). The rest of the proof can be reused verbatim.

It should also be noted that the bare $u$ on the right-hand side of (2.7) can be replaced by a nonnegative function $v(x,u(x))$ . Of course, in this case it no longer makes sense to require that $U$ should consist exclusively of positive functions. The separation from [math] should be taken in the sense that no sequence $\{v(\cdot,u_{n}(\cdot))\}$ , where $u_{n}\in U$ and the sequence $\{E_{n}(\cdot,u_{n}(\cdot))\}$ is bounded in $L^{1}(\Omega)$ , converges to [math] in measure. However, if $v$ is, for example, an increasing function vanishing at [math], this new condition is clearly equivalent to the original one.

Again, the proof remains essentially unchanged, the sets $[u_{n}>\varepsilon_{n}]$ and $[u_{n}\leq\varepsilon_{n}]$ being replaced by $[v_{n}>\varepsilon_{n}]$ and $[v_{n}\leq\varepsilon_{n}]$ , respectively (here $v_{n}(x)=v(x,u_{n}(x))$ ).

Summarising, we have the following strengthened version of Theorem 2.1:

Theorem 2.8.

Let $\Omega$ be a bounded, connected, open domain in $\mathbb{R}^{d}$ admitting the relative isoperimetric inequality. Let $p\geq 1$ and $I$ be an interval (possibly unbounded). Let $\mathcal{X}=\{(E,g,f,v)\}$ be a set of tuples such that $E,g,v\in C(\Omega\times I)$ , $f\in C^{1}(\Omega\times I)$ , and

[TABLE]

Finally, suppose that a set $U\subset C^{1}(\Omega;I)$ satisfies the following requirement: for any sequences $\{(E_{n},g_{n},f_{n},v_{n})\}\subset\mathcal{X}\}$ and $\{u_{n}\}\subset U$ such that the sequence $\{E_{n}(\cdot,u_{n}(\cdot))\}$ is bounded in $L^{1}(\Omega)$ , the sequence $\{v_{n}(\cdot,u_{n}(\cdot))\}$ does not converge to [math] in measure. Then there exists a constant $C$ depending only on $\Omega$ , $p$ , $U$ and $\mathcal{X}$ such that

[TABLE]

The proof is left to the reader.

Another option would be to allow for nonnegative instead of strictly positive $u$ in Theorem 2.1. In this case one assumes that $E\in C(\Omega\times[0,\infty))$ and that the supremum in (2.4) is taken over $0\leq u\leq\varepsilon$ and $x\in\Omega$ . The resulting inequality differs from (2.7) in that the integral on the right-hand side is taken over $[u>0]$ . The only modification needed in the proof is that whenever $g$ or $u|\nabla f|^{p}$ are integrated over $\Omega$ , the domain of integration should be changed to $[u>0]$ . Note that this does not fit into the previous theorem because $f$ can be undefined on $[u=0]$ .

We conclude by showing that Theorem 2.1 is applicable in a situation relevant for gradient flows. In the subsequent formulation, $f_{u}$ and $E_{u}$ denote the derivatives of the functions $f$ and $E$ , respectively, with respect to their second argument.

Theorem 2.9.

Suppose that functions $E\in C(\overline{\Omega}\times[0,\infty))$ , $f\in C^{1}(\overline{\Omega}\times(0,\infty))$ , and $m\in C(\overline{\Omega})$ satisfy

[TABLE]

and let $U\subset C^{1}(\Omega)$ be a set of strictly positive functions having the property that no sequence $\{u_{n}\}\subset U$ such that $\{E(\cdot,u_{n}(\cdot))\}$ is bounded in $L^{1}(\Omega)$ , converges to [math] in measure. Finally, let $\sigma\in(0,\min_{\overline{\Omega}}m)$ and

[TABLE]

Then we have

[TABLE]

where $C>0$ depends on $\Omega$ , $f$ , $\sigma$ , and $U$ .

*Remark 2.10**.*

Observe that under the hypotheses of Theorem 2.9, the functions $E$ and $m$ are uniquely determined by $f$ . Indeed, if $x\in\Omega$ is fixed, $E(x,u)$ as a function of $u$ attains its minimum at $m(x)>0$ , so $E_{u}(x,m(x))=0$ , i. e., $f(x,m(x))=0$ , according to (2.38). This uniquely defines $m(x)$ , as it follows from (2.39) that $f(x,u)$ strictly decreases with respect to $u$ . Now, $E(x,u)$ is the antiderivative of $-f(x,u)$ with respect to $u$ vanishing at $m(x)$ .

Proof.

We check the hypotheses of Theorem 2.8 with $I=(0,\infty)$ , $p=2$ , $g(x,u)=v_{\sigma}(u)(f(x,u))^{2}$ , and the set $\mathcal{X}$ consisting of the single tuple $(E,g,f,v_{\sigma})$ . Clearly, we have (2.31), while (2.32)–(2.34) are equivalent to (2.4)–(2.6).

Recalling Remark 2.3, we see that (2.4) holds.

Let us check (2.6). Fix $x\in\Omega$ . The function $E(x,u)$ is strictly convex in $u$ and attains its zero minimum only at $u=m(x)$ . As $f(x,m(x))=0$ , we see that

[TABLE]

On the other hand, as $f$ decreases with respect to $u$ , we have

[TABLE]

so (2.6) indeed holds.

It remains to check (2.5). Without loss of generality, assume that $\varepsilon>0$ is such that

[TABLE]

By Cauchy’s mean value theorem, for any $x\in\Omega$ , $u>\sigma$ , $u\neq m(x)$ , we have

[TABLE]

where $\xi_{x,u}$ is some point between $u$ and $m(x)$ .

By uniform continuity, there exists $\delta\in(0,\min_{\overline{\Omega}}m-\sigma)$ such that

[TABLE]

implies

[TABLE]

Then from (2.44) and (2.41) we see that

[TABLE]

Further, using (2.45) and (2.42), we have

[TABLE]

whence, recalling that $f_{u}$ is negative and $f$ is decreasing, we conclude

[TABLE]

Now, if $|u-m(x)|<\delta$ , the point $\xi_{x,u}$ also satisfies $|\xi-m(x)|<\delta$ , so we use (2.46) to conclude from (2.43) that

[TABLE]

If $u\geq m(x)+\delta$ , then either $m(x)<\xi_{x,u}<m(x)+\delta$ and we again obtain (2.48), or $\xi_{x,u}\geq m(x)+\delta$ and then we use (2.47) to get

[TABLE]

Thus,

[TABLE]

since the function $g/E$ is continuous and positive on the compact set

[TABLE]

We have showed that (2.5) holds.

Thus, the hypotheses of Theorem 2.8 are fulfilled and the inequality follows. ∎

3. Technicalities

3.1. Positive classical solutions

Let

[TABLE]

be the Heaviside step function.

Lemma 3.1.

If nonnegative $u,\hat{u}\in C^{\infty}(\overline{\Omega})$ satisfy the no-flux boundary condition (1.2), then

[TABLE]

where $f$ and $\hat{f}$ stand for $f(x,u(x))$ and $f(x,\hat{u}(x))$ , respectively.

Proof.

Without loss of generality, the functions $u$ and $\hat{u}$ are defined and smooth on $\mathbb{R}^{d}$ . Consider the set $\Upsilon:=[u-\hat{u}>0]$ . First let us assume that [math] is a regular value of the function $u-\hat{u}$ , then the boundary of $\Upsilon$ is smooth. Employing de Giorgi’s Gauss-Green formula [33, Theorem 15.9] and the formula for the Gauss-Green measure of an intersection [33, Theorem 16.3], we compute

[TABLE]

where $\nu_{\Upsilon\cap\Omega}$ is the measure-theoretic outward unit normal vector along the reduced boundary $\partial^{*}(\Upsilon\cap\Omega)$ of the intersection [33]. Due to the no-flux boundary condition, the last two integrals vanish. On $\partial\Upsilon\cap\Omega$ , we have $u=\hat{u}$ and consequently, $f=\hat{f}$ . Thus, we can write

[TABLE]

Due to the monotonicity of $f$ , we have $\Upsilon=[f-\hat{f}<0]$ . We see then that whenever $\nabla(f-\hat{f})\neq 0$ on $\partial\Upsilon$ , $\nabla(f-\hat{f})$ is an outward normal vector along $\partial\Upsilon$ . Thus, $\nabla(f-\hat{f})\cdot\nu_{\Upsilon}\geq 0$ and equality (3.2) gives (3.1).

In the general case, take a decreasing sequence $\varepsilon_{n}\to 0$ such that [math] is a regular value of $u-\hat{u}-\varepsilon_{n}$ . Set

[TABLE]

By the above, we have

[TABLE]

As $\theta$ is left-continuous, we have

[TABLE]

moreover, it is clear that

[TABLE]

Passing to the limit in (3.3), we obtain (3.1). ∎

Lemma 3.2 ( $L^{1}$ -contraction for positive classical solutions).

Let $u$ and $\hat{u}$ be classical solutions of (1.1)–(1.3) on $[0,T]$ with different initial data. Suppose that $u$ and $\hat{u}$ satisfy

[TABLE]

with some $\kappa>0$ and let $L_{\kappa}>0$ be such that

[TABLE]

Then for a. a. $t>0$ ,

[TABLE]

Proof.

We have:

[TABLE]

where $f$ and $\hat{f}$ stand for $f(x,u(x,t))$ and $f(x,\hat{u}(x,t))$ , respectively. By Lemma 3.1, we have $I_{1}\geq 0$ . To estimate $I_{2}$ , we use (3.4) and the observation that the integrand vanishes where $u-\hat{u}<0$ , thus obtaining

[TABLE]

Inequality (3.5) follows. ∎

For $c\in\mathbb{R}$ , define $u_{c}\in C^{2}(\overline{\Omega})$ by

[TABLE]

As $f$ is monotonous in $u$ , we see that the function $u_{c}$ is unique, but it does not need to exist for a given $c$ . Note that $u_{0}=m$ .

*Remark 3.3**.*

There is a simple formula for the $L^{\infty}$ -norm of $u_{c}$ :

[TABLE]

It follows from the fact that due to the monotonicity of $f$ , the inequality $\xi\geq\|u_{c}\|_{L^{\infty}(\Omega)}$ or, equivalently, $\xi\geq u_{c}(x)$ for all $x\in\Omega$ , holds if and only if $f(x,\xi)\leq f(x,u_{c}(x))\equiv c$ for all $x\in\Omega$ , i. e., when

[TABLE]

*Remark 3.4**.*

If (1.11) holds, for any $u\in L^{\infty}_{+}(\Omega)$ the function $u_{c}$ with

[TABLE]

is well-defined and $u\leq u_{c}$ a. e. in $\Omega$ . Indeed, if the second alternative in (1.11) holds, for any $x\in\overline{\Omega}$ , the function $f(x,\xi)$ assumes all the values in the interval $(-\infty,0]$ as $\xi$ varies over $[m(x),\infty)$ ; in particular, $f(x,\xi)$ attains the value $c$ . If, on the other hand, the first alternative in (1.11) holds, take $\xi_{1}\geq\|u\|_{L^{\infty}}$ such that $c_{1}:=f(x,\xi_{1})$ is independent of $x$ and negative. Clearly, for any fixed $x\in\overline{\Omega}$ , the function $f(x,\xi)$ takes all the values in the interval $[c_{1},0]$ as $\xi$ varies over $[m(x),\xi_{1}]$ . Now it suffices to observe that due to the monotonicity of $f$ , we have $c\in[c_{1},0]$ . One can prove in the same way that if (1.12) holds, for any function $u$ essentially bounded away from [math] on $\Omega$ , there exists $u_{c}$ such that $u\geq u_{c}$ a. e. in $\Omega$ , and $c\geq 0$ .

*Remark 3.5**.*

It follows from Remarks 3.4 and 3.3 that if (1.11) holds, the right-hand side of (1.23) is finite for any $u\in L^{\infty}_{+}(\Omega)$ .

Lemma 3.6 (Restricted $L^{1}$ -contraction).

Let $u$ be a classical solution of (1.1)–(1.3) on $[0,\infty)$ . Then for $c\leq 0$ we have

[TABLE]

and likewise, for $c\geq 0$ we have

[TABLE]

provided that $u_{c}$ exists.

Proof.

Let us prove (3.9) for $c\leq 0$ . Computing the derivative of the left-hand side, for a. a. $t>0$ we get

[TABLE]

As $\nabla f(x,u_{c}(x))\equiv 0$ , we can use Lemma 3.1 to get $I_{1}\geq 0$ . Now, the integrand of $I_{2}$ can only be non-zero where $u>u_{c}$ , in which case $f\leq c\leq 0$ due to the monotonicity of $f$ ; consequently, $I_{2}\leq 0$ . Thus, we have

[TABLE]

and (3.9) follows. Inequality (3.10) is proved in much the same way. ∎

Lemma 3.7.

Suppose that $f$ satisfies (1.11) and (1.12). Then for any smooth $u^{0}\colon\overline{\Omega}\to(0,\infty)$ satisfying the non-flux boundary condition, problem (1.1)–(1.3) has a classical solution.

Proof.

Equation (1.1) can be cast in the form

[TABLE]

If we show that a classical solution is a priori bounded and stays away from 0, we can ignore the fact that the coefficient $-uf_{u}$ can be degenerate or singular at $u=0,\infty$ and infer the existence of the solution from the classical theory of quasilinear parabolic equations.

Indeed, according to Remark 3.4, we can find $u_{c_{1}}$ and $u_{c_{2}}$ such that $c_{2}\leq 0\leq c_{1}$ and

[TABLE]

Then it follows from Lemma 3.6 that

[TABLE]

providing the required bounds. ∎

3.2. Positive initial data

If the initial data (1.3) is bounded away from [math], we approximate it with smooth functions and prove the existence and uniqueness of weak solutions to (1.1)–(1.3) stated in Theorem 3.9 below.

Lemma 3.8.

Suppose that $u\in L_{+}^{\infty}(Q_{T})$ satisfies the energy inequality (1.19) in the sense of measures; then

[TABLE]

where $C>0$ is determined by an upper bound on $\|u\|_{L^{\infty}(\Omega)}$ .

Proof.

The function

[TABLE]

has a non-positive derivative in the sense of measures, so it a. e. coincides with a non-increasing function. In other words, for a. a. $t_{0},t_{1}\in(0,T)$ , $t_{0}<t_{1}$ , we have

[TABLE]

An upper bound on $\|u\|_{L^{\infty}(Q_{T})}$ defines essential upper bounds on $uf$ , $\Phi=\Phi(x,u(x,t))$ , $\Phi_{x}$ , and $uf_{x}$ , so for a. a. $t\in(t_{0},t_{1})$ we can estimate

[TABLE]

whence

[TABLE]

Passing to the essential upper limit as $t_{0}\to 0$ and estimating $t_{1}-t_{0}\leq T$ , we obtain

[TABLE]

whence (3.11) and (3.12) follow. ∎

Theorem 3.9 (Solvability for positive data).

Suppose that $f$ satisfies (1.4)–(1.10) as well as (1.11) and (1.12). Then for any $u^{0}\in L_{+}^{\infty}$ such that

[TABLE]

with some constant $\kappa>0$ , there exists a unique weak solution

[TABLE]

satisfying the following properties: i) the upper bound (1.23) and lower bound (1.26); ii) the energy and entropy dissipation inequalities as well as (1.24) and (1.25); iii) the restricted contraction

[TABLE]

whenever $u_{c}$ is defined; iv) if $\hat{u}$ is another such solution with the initial data $\hat{u}^{0}$ , the $L^{1}$ -contraction holds:

[TABLE]

where $L_{\kappa}$ is defined by (3.4).

Proof.

Let $\{u_{n}^{0}\}$ be a sequence of smooth functions satisfying the no-flux boundary condition such that

[TABLE]

and

[TABLE]

Let $u_{n}$ be the classical solution of (1.1)–(1.3) on $[0,\infty)$ with the initial data $u^{0}_{n}$ . For any $T>0$ , it follows from Lemma 3.2 that

[TABLE]

so $\{u_{n}\}$ is a Cauchy sequence in $C([0,T];L^{1}(\Omega))$ . As $T$ is arbitrary, we see that $\{u_{n}\}$ converges in $C([0,\infty);L^{1}(\Omega))$ to some function $u$ . We claim that it is the sought-for solution.

By Remark 3.4, there exists $u_{c}$ ( $c\leq 0$ ) such that $u_{c}\geq 1/\kappa$ ; then $u_{c}$ dominates the initial data $u^{0}_{n}$ and thus, the solutions $u_{n}$ as well, which follows from Lemma 3.6. Consequently, the sequence $\{u_{n}\}$ is bounded in $L^{\infty}(\Omega\times(0,\infty))$ , so it converges to $u$ weakly* in this space, whence $u\in L^{\infty}(\Omega\times(0,\infty))$ .

Put

[TABLE]

Fix $T>0$ . As the sequence $\{u_{n}\}$ is bounded in $L^{\infty}(Q_{T})$ , so are the sequences $\{u_{n}f_{n}\}$ , $\{u_{n}f_{xn}\}$ , $\{\Phi_{n}\}$ , $\{\Phi_{xn}\}$ , $\{\Psi_{n}\}$ , and $\{E_{n}\}$ . Thus, there is no loss of generality in assuming

[TABLE]

where we write $\Phi$ for $\Phi(\cdot,u(\cdot))$ , etc. It follows from (3.18) that $\nabla\Phi_{n}\to\nabla\Phi$ in the sense of distributions. The approximate solutions satisfy the energy inequality and (1.24) while their initial energy is bounded, so we see from (3.12) that the sequence $\nabla\Phi_{n}$ is bounded in $L^{2}(Q_{T})$ . Consequently, $\Phi\in L^{2}(0,T;H^{1}(\Omega))$ and

[TABLE]

Let us check that $u$ is a weak solution of (1.1)–(1.3) on $[0,T]$ . Take an admissible test function $\varphi$ . Writing the weak setting for the approximate solution, we have

[TABLE]

It follows from (3.17), (3.18), and (3.19) that we can pass to the limit in (3.20) and obtain (1.15) for $u$ . Thus, $u$ is indeed a weak solution.

Let us show that $u$ satisfies the energy inequality on $[0,T]$ in the sense of measures. Taking a smooth nonnegative test function $\varphi\in C^{\infty}$ vanishing outside of $[0,T]$ , we write the energy inequality in the sense of measures for the approximate solutions:

[TABLE]

Convergences (3.18) ensure that we can pass to the limit in all the terms but for the first one on the right-hand side. As for the latter, it follows from (3.19) that $\sqrt{\varphi}\,\nabla\Phi_{n}\to\sqrt{\varphi}\,\nabla\Phi$ weakly in $L^{2}(Q_{T})$ , whence

[TABLE]

and the energy inequality follows.

Let us check (1.24). The approximate solutions satisfy

[TABLE]

so by virtue of (3.11) we obtain

[TABLE]

It follows from (3.17) and (3.18) that

[TABLE]

so we get

[TABLE]

Now sending $\varepsilon\to 0$ we recover (1.24).

Let us show that $u$ satisfies the entropy dissipation inequality on $[0,T]$ in the sense of measures. Let $\varphi\in C^{\infty}$ be a smooth nonnegative test function vanishing outside of $[0,T]$ . The approximate solutions satisfy the entropy dissipation inequality in the sense of measures, so we have

[TABLE]

Consequently, for any $\delta>0$ we have

[TABLE]

Observe that

[TABLE]

We claim that

[TABLE]

Then, taking into account (3.18), we can pass to the limit in (3.21) obtaining

[TABLE]

On the set $\{(x,t)\in Q_{T}\colon u(x,t)=0\}$ we have $uf_{x}=0$ (by virtue of (1.10)), $\Phi_{x}=0$ and $\Phi=0$ , whence also $\nabla\Phi=0$ a. e. on this set. Thus, we can write

[TABLE]

Letting $\delta\to 0$ , by Beppo Levi’s theorem we obtain the energy inequality.

To prove the technical claim (3.27), we use a variant of the Banach-Alaoglu theorem in varying $L^{2}(\mathrm{d}\mu^{n})$ spaces:

Lemma 3.10.

Let $\mathcal{O}\subset\mathbb{R}^{N}$ be an open set, $\mu_{n}$ a sequence of finite non-negative Radon measures narrowly converging to $\mu$ , and $v_{n}$ a sequence of vector fields on $\mathcal{O}$ . If

[TABLE]

then there exists $v\in L^{2}(\mathcal{O},\mathrm{d}\mu)$ such that, up to extraction of some subsequence,

[TABLE]

and

[TABLE]

The proof of this fact by optimal transport techniques can be found in [2]; this lemma also follows from a variant of the Banach-Alaoglu theorem [25, Proposition 5.3]. We will apply this lemma with $\mathcal{O}=Q_{T}$ , $v_{n}$ from (3.26), and the sequence of measures $\mathrm{d}\mu_{n}(t,x):=\frac{\varphi(t)}{\max(u_{n},\delta)}\,\mathrm{d}x\,\mathrm{d}t$ , which converges narrowly to $\mathrm{d}\mu(t,x)=\frac{\varphi(t)}{\max(u,\delta)}\,\mathrm{d}x\,\mathrm{d}t$ due to the strong convergence (3.25). Extracting a subsequence if needed, we see that there is a vector-field $v\in L^{2}(\mathcal{O},\mathrm{d}\mu)$ verifying (3.28) and (3.29). On the other hand, by (3.25) and (3.26),

[TABLE]

weakly in $L^{1}(Q_{T})$ . Evoking (3.28), we find that

[TABLE]

for all test functions $\boldsymbol{\zeta}$ . By density, we conclude that $v=-\nabla\Phi+\Phi_{x}+uf_{x}$ in $L^{2}(\mathcal{O},\mathrm{d}\mu)$ , and (3.27) follows from (3.29).

Inequality (1.25) is proved in the same way as (1.24) given that it holds for the approximate solutions.

Inequalities (3.13)–(3.15) follow from correspondent inequalities for approximate solutions (Lemmas 3.2 and 3.6), as we obviously have

[TABLE]

where the approximations $\hat{u}_{n}$ are constructed in the same way as $u_{n}$ .

Contraction (3.15) implies the uniqueness of $u$ .

To obtain the upper bound (1.23), we define $c\leq 0$ by (3.8) and thus have $u^{0}\leq u_{c}$ on $\Omega$ , whence in view of contraction (3.13),

[TABLE]

Recalling the formula (3.7) for the norm of $u_{c}$ , we obtain the upper bound.

To obtain the lower $L^{1}$ -bound (1.26), we take $u_{c}=m$ in (3.14), obtaining

[TABLE]

as required. ∎

3.3. Nonnegative initial data

If initial data (1.3) is only nonnegative, we approximate it with positive functions and reuse the proof of Theorem 3.9 to establish the existence of solutions to (1.1)–(1.3) as stated in Theorem 1.9 (but not uniqueness, owing to the loss of contraction).

Proof of Theorem 1.9.

Take a decreasing sequence $\varepsilon_{n}\to 0$ and set

[TABLE]

By Theorem 3.9, there exists a weak solution $u_{n}$ of (1.1)–(1.3) with the initial data $u^{0}_{n}$ . Contraction (3.15) ensures the comparison principle for this sequence of solutions, whence $u_{n+1}\leq u_{n}$ a. e. in $\Omega\times(0,\infty)$ . Consequently, there exists the monotone limit $u\in L^{\infty}(\Omega\times(0,\infty))$ and moreover, we obviously have the convergences (3.18). From this moment on, the proof copies that of Theorem 3.9 except that (3.13) and (3.14) hold almost everywhere rather then everywhere. ∎

We conclude by proving Theorems 1.8 and 1.13.

Proof of Theorem 1.8.

Let $D=\left\{(x,\Phi(x,u))\colon x\in\Omega,u>0\right\}$ and consider the function $\Xi\colon D\to[0,\infty)$ implicitly defined by the equation

[TABLE]

As $\Phi$ is monotonous with respect to its second argument, $\Xi$ is uniquely defined. Clearly, $\Xi$ is $C^{2}$ .

Fix $u\in U$ . We claim that there exists a sequence of functions $\Phi_{n}\in C(\overline{\Omega})\cap C^{\infty}(\Omega)$ such that

[TABLE]

Indeed, take a sequence $\{\delta_{n}\}$ , where $\delta_{n}>0$ and $\delta_{n}\to 0$ , put $\widetilde{\Phi}_{n}(x)=\Phi(x,u(x))+\delta_{n}$ , and let $\widetilde{\Phi}_{n}^{\varepsilon}$ be the mollification of $\widetilde{\Phi}_{n}$ . Observe that $\widetilde{\Phi}_{n}$ is strictly positive and so is $\widetilde{\Phi}_{n}^{\varepsilon}$ . It suffices to show that for any $n$ sufficiently large there exists $\varepsilon_{n}>0$ such that whenever $\varepsilon<\varepsilon_{n}$ , we have

[TABLE]

If the second alternative in (1.11) holds, for every $x\in\Omega$ we have

[TABLE]

as $u\to+\infty$ . This implies that $D=\Omega\times(0,\infty)$ , so (3.30) obviously holds with any $\varepsilon$ .

Assume the first alternative in (1.11). Take $\xi_{0}\geq\|u\|_{L^{\infty}(\Omega)}$ such that $f(x,\xi)$ does not depend on $x$ if $\xi\geq\xi_{0}$ and set

[TABLE]

We have:

[TABLE]

Thus, for large $n$ we have

[TABLE]

Upon mollification,

[TABLE]

For a fixed $n$ , the function $\Phi(\cdot,\xi_{0}+1)$ is continuous on $\overline{\Omega}$ , so the mollifications $\Phi^{\varepsilon}(\cdot,\xi_{0}+1)$ converge to it uniformly on $\overline{\Omega}$ as $\varepsilon\to 0$ . Consequently,

[TABLE]

for all $x\in\Omega$ , proving (3.30).

Taking a sequence $\{\Phi_{n}\}$ as above, we can set $u_{n}(x)=\Xi(x,\Phi_{n}(x))$ , so that $\Phi_{n}(x)=\Phi(x,u_{n}(x))$ . Clearly, $u_{n}\in C^{2}(\Omega)$ and $u_{n}>0$ . Further, the sequence $\{u_{n}\}$ is bounded in $L^{\infty}(\Omega)$ because so is $\{\Phi_{n}\}$ , and due to the continuity of $\Xi$ we have

[TABLE]

As a consequence, for $f_{n}=f(x,u_{n}(x))$ and $E_{n}=E(x,u_{n}(x))$ we have

[TABLE]

where we write $f$ for $f(\cdot,u(\cdot))$ , etc. In particular, there is no loss of generality in assuming a lower bound

[TABLE]

(positivity by virtue of (1.21)), where $c$ is obviously independent not only of $u_{n}$ but of $u$ as well.

Define

[TABLE]

By Theorem 2.9, there exist a function

[TABLE]

where $\sigma>0$ , and a constant $C>0$ such that

[TABLE]

In particular, as $u_{n}\in\widetilde{U}$ , we see that

[TABLE]

where $v_{n}=v(u_{n}(x))$ .

Let us check that we can pass to the limit in (3.33). First, it follows from (3.32) that

[TABLE]

Next, note that we clearly have

[TABLE]

and thus, again using (3.32), we obtain

[TABLE]

Finally, as $u_{n}$ is smooth and positive, we can write

[TABLE]

On the set $[u=0]$ we have $uf_{x}=0$ by (1.10), $\Phi_{x}=0$ , and $\Phi=0$ , the last equality implying $\nabla\Phi=0$ a. e. on $[u=0]$ . Thus, we can write

[TABLE]

To sum up, we have

[TABLE]

which is even stronger than (1.22). ∎

Proof of Theorem 1.13.

Let $U\subset L^{\infty}_{+}$ be the set of functions such that for any $v\in U$ , we have $\Phi(\cdot,v(\cdot))\in H^{1}(\Omega)$ and $\|v\|_{L^{1}(\Omega)}\geq c$ . By Theorem 1.8 we have the entropy-entropy production inequality (1.22) for $U$ .

Let $u$ be a weak solution of (1.1)–(1.3) with the initial data satisfying (1.28). It follows from the lower $L^{1}$ -bound (1.26) that $u(t)\in U$ for a. a. $t>0$ . Combining the entropy dissipation and entropy-entropy production inequalities, we obtain

[TABLE]

Letting $e(t)=\mathcal{E}(u(t))\mathrm{e}^{C_{U}^{-1}t}$ , we see that $\partial_{t}e(t)\leq 0$ in the sense of measures, whence $e$ a. e. coincides with a nonincreasing function. Moreover,

[TABLE]

by virtue of (1.25), so $e(t)\leq\mathcal{E}(u^{0})$ for a. a. $t>0$ , yielding (1.27) with $\gamma=C_{U}^{-1}$ . ∎

Acknowledgment

The idea of this paper originated from conversations of the second author with Goro Akagi and Yann Brenier during a stay at ESI in Vienna. He would like to thank Goro Akagi and Yann Brenier for the inspiring discussions and correspondence, Ulisse Stefanelli for the invitation to the thematic program Nonlinear Flows at ESI, and ESI for hospitality. The research was partially supported by the Portuguese Government through FCT/MCTES and by the ERDF through PT2020 (projects UID/MAT/00324/2019, PTDC/MAT-PUR/28686/2017 and TUBITAK/0005/2014).

Conflict of interest: none

Bibliography43

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] H. W. Alt and S. Luckhaus. Quasilinear elliptic-parabolic differential equations. Math. Z. , 183(3):311–341, 1983.
2[2] L. Ambrosio, N. Gigli, and G. Savaré. Gradient Flows: in Metric Spaces and in the Space of Probability Measures . Basel: Birkhäuser Basel, 2008.
3[3] V. Barbu. Generalized solutions to nonlinear Fokker-Planck equations. J. Differential Equations , 261(4):2446–2471, 2016.
4[4] M. Bertsch and D. Hilhorst. A density dependent diffusion equation in population dynamics: stabilization to equilibrium. SIAM J. Math. Anal. , 17(4):863–883, 1986.
5[5] T. Bodineau, J. Lebowitz, C. Mouhot, and C. Villani. Lyapunov functionals for boundary-driven nonlinear drift-diffusion equations. Nonlinearity , 27(9):2111–2132, 2014.
6[6] R. S. Cantrell, C. Cosner, Y. Lou, and C. Xie. Random dispersal versus fitness-dependent dispersal. J. Differential Equations , 254(7):2905–2941, 2013.
7[7] J. A. Carrillo, A. Jüngel, P. A. Markowich, G. Toscani, and A. Unterreiter. Entropy dissipation methods for degenerate parabolic problems and generalized Sobolev inequalities. Monatsh. Math. , 133(1):1–82, 2001.
8[8] L. Chizat, G. Peyré, B. Schmitzer, and F.-X. Vialard. An interpolating distance between optimal transport and Fisher–Rao metrics. Foundations of Computational Mathematics , 18(1):1–44, 2018.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Nonlinear Fokker-Planck equations with

Abstract.

1. Introduction

1.1. Setting

Remark 1.1*.*

Remark 1.2*.*

Remark 1.3*.*

Remark 1.4*.*

Remark 1.5*.*

Remark 1.6*.*

1.2. Energy and entropy

Definition 1.7**.**

Theorem 1.8** (Entropy-entropy production inequality).**

Theorem 1.9** (Existence of weak solutions).**

Remark 1.10*.*

Remark 1.11*.*

Remark 1.12*.*

Theorem 1.13** (Convergence to equilibrium).**

1.3. Motivation and background

2. Generalized dissipation inequalities

2.1. Setting

Theorem 2.1**.**

Remark 2.2*.*

Remark 2.3*.*

Remark 2.4*.*

2.2. Strategy of the proof of Theorem 2.1

2.3. Proof of Theorem 2.1

Lemma 2.5**.**

Proof.

Lemma 2.6**.**

Proof.

Lemma 2.7**.**

Proof.

2.4. Generalisations and specialisations

Theorem 2.8**.**

Theorem 2.9**.**

Remark 2.10*.*

Proof.

3. Technicalities

3.1. Positive classical solutions

Lemma 3.1**.**

Proof.

Lemma 3.2** (L1L^{1}L1-contraction for positive classical solutions).**

Proof.

Remark 3.3*.*

Remark 3.4*.*

Remark 3.5*.*

Lemma 3.6** (Restricted L1L^{1}L1-contraction).**

Proof.

Lemma 3.7**.**

Proof.

3.2. Positive initial data

Lemma 3.8**.**

Proof.

Theorem 3.9** (Solvability for positive data).**

Proof.

Lemma 3.10**.**

3.3. Nonnegative initial data

Proof of Theorem 1.9.

Proof of Theorem 1.8.

Proof of Theorem 1.13.

Acknowledgment

Conflict of interest: none

*Remark 1.1**.*

*Remark 1.2**.*

*Remark 1.3**.*

*Remark 1.4**.*

*Remark 1.5**.*

*Remark 1.6**.*

Definition 1.7.

Theorem 1.8 (Entropy-entropy production inequality).

Theorem 1.9 (Existence of weak solutions).

*Remark 1.10**.*

*Remark 1.11**.*

*Remark 1.12**.*

Theorem 1.13 (Convergence to equilibrium).

Theorem 2.1.

*Remark 2.2**.*

*Remark 2.3**.*

*Remark 2.4**.*

Lemma 2.5.

Lemma 2.6.

Lemma 2.7.

Theorem 2.8.

Theorem 2.9.

*Remark 2.10**.*

Lemma 3.1.

Lemma 3.2 ( $L^{1}$ -contraction for positive classical solutions).

*Remark 3.3**.*

*Remark 3.4**.*

*Remark 3.5**.*

Lemma 3.6 (Restricted $L^{1}$ -contraction).

Lemma 3.7.

Lemma 3.8.

Theorem 3.9 (Solvability for positive data).

Lemma 3.10.