Convex Sobolev inequalities related to unbalanced optimal transport

Stanislav Kondratyev; Dmitry Vorotnikov

arXiv:1904.04112·math.FA·April 9, 2019

Convex Sobolev inequalities related to unbalanced optimal transport

Stanislav Kondratyev, Dmitry Vorotnikov

PDF

TL;DR

This paper investigates the decay of relative entropies in nonlinear drift-diffusion-reaction equations modeled as gradient flows in unbalanced optimal transport, establishing new inequalities without requiring convexity of the functionals.

Contribution

It introduces novel isoperimetric-type inequalities for controlling relative entropies in gradient flows over Radon measures, even without geodesic convexity.

Findings

01

Proves exponential decay of relative entropies in the studied equations.

02

Establishes new inequalities linking entropies and their productions.

03

Extends analysis to non-convex functionals in unbalanced optimal transport.

Abstract

We study the behaviour of various Lyapunov functionals (relative entropies) along the solutions of a family of nonlinear drift-diffusion-reaction equations coming from statistical mechanics and population dynamics. These equations can be viewed as gradient flows over the space of Radon measures equipped with the Hellinger-Kantorovich distance. The driving functionals of the gradient flows are not assumed to be geodesically convex or semi-convex. We prove new isoperimetric-type functional inequalities, allowing us to control the relative entropies by their productions, which yields the exponential decay of the relative entropies.

Equations215

\partial_{t} ρ

\partial_{t} ρ

ρ \frac{\partial f}{\partial ν}

ρ

P (A; Ω) \geq C_{Ω} min (∣ A ∣^{\frac{d - 1}{d}}, ∣Ω ∖ A ∣^{\frac{d - 1}{d}}) .

P (A; Ω) \geq C_{Ω} min (∣ A ∣^{\frac{d - 1}{d}}, ∣Ω ∖ A ∣^{\frac{d - 1}{d}}) .

g (1) = 0; g^{'} (s) > 0 (s > 0),

g (1) = 0; g^{'} (s) > 0 (s > 0),

ψ (1) = 0, ψ (s) > 0 (s \neq = 1),

ψ \in C^{2} (0, + \infty), ψ^{''} (s) > 0 (s > 0, s \neq = 1),

s \to \infty lim ψ^{'} (x) = \infty,

∣ g (s) ∣ + s ∣ g^{'} (s) ∣ \leq h (s) a. \leavevmode \nobreak a. s > 0; h \in L_{loc}^{1} [0, \infty),

s g (s) \in C ([0, + \infty)) .

\int_{Ω} ρ_{\infty} d x = 1.

\int_{Ω} ρ_{\infty} d x = 1.

f = f (x, ρ (x)) := - g (\frac{ρ ( x )}{ρ _{\infty} ( x )}) .

f = f (x, ρ (x)) := - g (\frac{ρ ( x )}{ρ _{\infty} ( x )}) .

0 \leq E_{ψ} (ρ) := \int_{Ω} ψ (\frac{ρ}{ρ _{\infty}}) ρ_{\infty} d x,

0 \leq E_{ψ} (ρ) := \int_{Ω} ψ (\frac{ρ}{ρ _{\infty}}) ρ_{\infty} d x,

\partial_{t} E_{ψ} (ρ_{t}) = - D E_{ψ} (ρ_{t}),

\partial_{t} E_{ψ} (ρ_{t}) = - D E_{ψ} (ρ_{t}),

D E_{ψ} (ρ) := \int_{Ω} g^{'} (\frac{ρ}{ρ _{\infty}}) ψ^{''} (\frac{ρ}{ρ _{\infty}}) \nabla (\frac{ρ}{ρ _{\infty}})^{2} ρ d x + \int_{Ω} g (\frac{ρ}{ρ _{\infty}}) ψ^{'} (\frac{ρ}{ρ _{\infty}}) ρ d x

D E_{ψ} (ρ) := \int_{Ω} g^{'} (\frac{ρ}{ρ _{\infty}}) ψ^{''} (\frac{ρ}{ρ _{\infty}}) \nabla (\frac{ρ}{ρ _{\infty}})^{2} ρ d x + \int_{Ω} g (\frac{ρ}{ρ _{\infty}}) ψ^{'} (\frac{ρ}{ρ _{\infty}}) ρ d x

r = \frac{ρ}{ρ _{\infty}},

r = \frac{ρ}{ρ _{\infty}},

E_{ψ} (ρ) = \int_{Ω} ψ (r) d ρ_{\infty}

E_{ψ} (ρ) = \int_{Ω} ψ (r) d ρ_{\infty}

D E_{ψ} (ρ) = \int_{Ω} r g (r) ψ^{'} (r) d ρ_{\infty} + \int_{Ω} r g^{'} (r) ψ^{''} (r) ∣\nabla r ∣^{2} d ρ_{\infty}

ψ_{g} (s) := \int_{1}^{s} g (ξ) d ξ,

ψ_{g} (s) := \int_{1}^{s} g (ξ) d ξ,

E_{ψ} (ρ) ≲ D E_{ψ} (ρ) .

E_{ψ} (ρ) ≲ D E_{ψ} (ρ) .

grad_{H K} E (ρ) = - div (ρ \nabla \frac{δ E}{δ ρ}) + u \frac{δ E}{δ ρ} .

grad_{H K} E (ρ) = - div (ρ \nabla \frac{δ E}{δ ρ}) + u \frac{δ E}{δ ρ} .

\partial_{t} ρ = - grad_{H K} D E_{ψ_{g}} (ρ), ρ (0) = ρ^{0} .

\partial_{t} ρ = - grad_{H K} D E_{ψ_{g}} (ρ), ρ (0) = ρ^{0} .

\partial_{t} ρ = - grad_{W} D E_{ψ_{g}} (ρ)

\partial_{t} ρ = - grad_{W} D E_{ψ_{g}} (ρ)

D E_{ψ}^{W} (ρ) := \int_{Ω} r g^{'} (r) ψ^{''} (r) ∣\nabla r ∣^{2} d ρ_{\infty} .

D E_{ψ}^{W} (ρ) := \int_{Ω} r g^{'} (r) ψ^{''} (r) ∣\nabla r ∣^{2} d ρ_{\infty} .

\partial_{t} ρ = - grad_{H} D E_{ψ_{g}} (ρ)

\partial_{t} ρ = - grad_{H} D E_{ψ_{g}} (ρ)

D E_{ψ}^{H} (ρ) := \int_{Ω} r g (r) ψ^{'} (r) d ρ_{\infty} .

D E_{ψ}^{H} (ρ) := \int_{Ω} r g (r) ψ^{'} (r) d ρ_{\infty} .

D E_{ψ}^{W} (ρ) + D E_{ψ}^{H} (ρ) = D E_{ψ} (ρ) .

D E_{ψ}^{W} (ρ) + D E_{ψ}^{H} (ρ) = D E_{ψ} (ρ) .

E_{ψ} (ρ) ≲ D E_{ψ}^{H} (ρ)

E_{ψ} (ρ) ≲ D E_{ψ}^{H} (ρ)

E_{ψ} (ρ) ≲ D E_{ψ}^{W} (ρ)

E_{ψ} (ρ) ≲ D E_{ψ}^{W} (ρ)

ρ_{n} = ρ_{\infty} \frac{n}{n - 1} 1_{(\frac{1}{n}, 1)}

ρ_{n} = ρ_{\infty} \frac{n}{n - 1} 1_{(\frac{1}{n}, 1)}

ψ (s) = {\frac{1}{p ( p - 1 )} (s^{p} - p s + p - 1), s lo g s - s + 1, if 1 < p \leq 2 if p = 1

ψ (s) = {\frac{1}{p ( p - 1 )} (s^{p} - p s + p - 1), s lo g s - s + 1, if 1 < p \leq 2 if p = 1

\int_{Ω} r^{p} d ρ_{\infty} - (\int_{Ω} r d ρ_{\infty})^{p} ≲ \int_{Ω} r^{p - 2} ∣\nabla r ∣^{2} d ρ_{\infty}, 1 < p \leq 2.

\int_{Ω} r^{p} d ρ_{\infty} - (\int_{Ω} r d ρ_{\infty})^{p} ≲ \int_{Ω} r^{p - 2} ∣\nabla r ∣^{2} d ρ_{\infty}, 1 < p \leq 2.

\int_{Ω} r^{p} d ρ_{\infty} - (\int_{Ω} r d ρ_{\infty})^{p} ≲ \int_{Ω} r^{p - 2} ∣\nabla r ∣^{2} d ρ_{\infty} + \int_{Ω} r lo g (\frac{r}{\int _{Ω} r d ρ _{\infty}}) (r^{p - 1} - (\int_{Ω} r d ρ_{\infty})^{p - 1}) d ρ_{\infty}, p > 2.

\int_{Ω} r^{p} d ρ_{\infty} - (\int_{Ω} r d ρ_{\infty})^{p} ≲ \int_{Ω} r^{p - 2} ∣\nabla r ∣^{2} d ρ_{\infty} + \int_{Ω} r lo g (\frac{r}{\int _{Ω} r d ρ _{\infty}}) (r^{p - 1} - (\int_{Ω} r d ρ_{\infty})^{p - 1}) d ρ_{\infty}, p > 2.

\int_{Ω} r^{p} - (\int_{Ω} r)^{p} ≲ (\int_{Ω} r)^{1 - α} \int_{Ω} r^{p + α - 3} ∣\nabla r ∣^{2} .

\int_{Ω} r^{p} - (\int_{Ω} r)^{p} ≲ (\int_{Ω} r)^{1 - α} \int_{Ω} r^{p + α - 3} ∣\nabla r ∣^{2} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Convex Sobolev inequalities related to unbalanced optimal transport

Stanislav Kondratyev

CMUC, Department of Mathematics, University of Coimbra, 3001-501 Coimbra, Portugal

[email protected]

and

Dmitry Vorotnikov

CMUC, Department of Mathematics, University of Coimbra, 3001-501 Coimbra, Portugal

[email protected]

Abstract.

We study the behaviour of various Lyapunov functionals (relative entropies) along the solutions of a family of nonlinear drift-diffusion-reaction equations coming from statistical mechanics and population dynamics. These equations can be viewed as gradient flows over the space of Radon measures equipped with the Hellinger-Kantorovich distance. The driving functionals of the gradient flows are not assumed to be geodesically convex or semi-convex. We prove new isoperimetric-type functional inequalities, allowing us to control the relative entropies by their productions, which yields the exponential decay of the relative entropies.

Keywords: functional inequalities, optimal transport, reaction-diffusion, fitness-driven dispersal, entropy, exponential decay

MSC [2010] 26D10, 35K57, 35B40, 49Q20, 58B20

1. Introduction

The unbalanced optimal transport [36, 30, 13, 35, 14, 43] interpolates between the classical Monge-Kantorovich transport [45, 46] and the optimal information transport [41]. It equips the space of finite Radon measures with a formal Riemannian structure so that certain classes of reaction-diffusion equations and systems can be interpreted as gradient flows. This paper continues our investigation [30, 29, 31, 33, 32] of such gradient flows and associated functional inequalities, see also [12, 24, 23] for related studies.

The class of PDEs that we consider in this paper is

[TABLE]

Here $f=f(x,\rho(x,t))$ is a nonlinear function of $x$ and $\rho$ which is required to have a certain structure specified below in (1.12), and $\Omega\subset\mathbb{R}^{d}$ is an open connected bounded domain admitting the relative isoperimetric inequality, cf. [40],

[TABLE]

All our results remain valid if $\Omega$ is a periodic box $\mathbb{T}^{d}$ ; in this case (1.2) is omitted.

The drift-diffusion-reaction equation (1.1) appears in statistical mechanics [19]. It also describes nonlinear fitness-driven models of population dynamics, cf. [38, 15, 16, 25, 33], where it is assumed that the dispersal strategy is determined by a local intrinsic characteristic of organisms called fitness. We refer to Section 2 and to [33] for more detailed discussions.

Let $g\colon(0,\infty)\to\mathbb{R}$ and $\psi\colon[0,\infty)\to\mathbb{R}$ be fixed $C^{1}$ -smooth functions, which satisfy the following assumptions:

[TABLE]

Let $\rho_{\infty}\colon\overline{\Omega}\to\mathbb{R}$ be a fixed smooth strictly positive function satisfying

[TABLE]

Define

[TABLE]

Thus, the functions $g$ and $\rho_{\infty}$ determine the problem (1.1)–(1.3), and the function $\psi$ is merely needed to define a Lyapunov functional for this problem,

[TABLE]

which will be referred to as the relative entropy. Obviously, $\mathcal{E}_{\psi}(\rho)=0$ if and only if $\rho\equiv\rho_{\infty}$ . Formally calculating $\partial_{t}\mathcal{E}_{\psi}(\rho_{t})$ along a solution of (1.1)–(1.3) we obtain

[TABLE]

where the entropy production $D\mathcal{E}_{\psi}$ is defined by

[TABLE]

Setting

[TABLE]

we can write

[TABLE]

Note that problem (1.1)–(1.3) can be viewed as a formal gradient flow (with respect to the unbalanced Hellinger-Kantorovich Riemannian structure) of the driving functional $D\mathcal{E}_{\psi_{g}}(\rho)$ , where

[TABLE]

see Section 2 for the details. We are interested in the exponential decay of the Lyapunov functional (1.14) along the trajectories of this gradient flow. This is related to the entropy-entropy production inequalities of the form

[TABLE]

They can be viewed as unbalanced generalizations of the convex Sobolev inequalities [2, 3, 27], see Section 2.

The main results of the paper are convex Sobolev inequalities akin to (1.17), see Theorems 3.5 and 4.1, and existence and asymptotics of weak solutions to (1.1)–(1.3), see Theorem 3.6.

2. Background and discussion

Assume for a while that $\Omega$ is a torus or is convex, although this is not required for our main results. The gradient of a scalar functional $\mathcal{E}$ on the space of finite Radon measures over $\overline{\Omega}$ with respect to the Hellinger-Kantorovich Riemannian structure (also known as the Wasserstein-Fisher-Rao one) was calculated in [30, 35]:

[TABLE]

The first term on the right-hand side is the Otto-Wasserstein gradient $\operatorname{grad}_{W}\mathcal{E}(\rho)$ , cf. [42, 45], and the second one is the Hellinger-Fisher-Rao gradient $\operatorname{grad}_{H}\mathcal{E}(\rho)$ , cf. [28]. It is easy to compute that $\frac{D\mathcal{E}_{\psi_{g}}(\rho)}{\delta\rho}=-f(x,\rho)$ , hence (1.1)–(1.3) may be interpreted as a gradient flow:

[TABLE]

The production of the relative entropy $\mathcal{E}_{\psi}(\rho)$ along the Otto-Wasserstein gradient flow

[TABLE]

is

[TABLE]

Similarly, the production of the same entropy along the Hellinger gradient flow

[TABLE]

is

[TABLE]

In the case of non-convex $\Omega$ we can abuse the terminology and still refer to (1.1)–(1.3) as to a gradient flow.

It is clear that

[TABLE]

Generally speaking, neither the Otto-Wasserstein nor the Fisher-Rao entropy production are able to control the relative entropy, so (1.17) is a result of an interplay between the reaction, diffusion and drift. A simple counterexample to

[TABLE]

is $\rho_{\infty}1_{A}$ with $A$ being a proper subset of $\Omega$ . Indeed, $D\mathcal{E}_{\psi}^{H}(\rho_{\infty}1_{A})=0$ due to (1.5), (1.9) and (1.10). It is easy to construct a smooth example by mollifying this one. A trivial counterexample to

[TABLE]

is $k\rho_{\infty}$ where $k\neq 1$ is a non-negative constant.

*Remark 2.1**.*

Note that the two counterexamples intersect at $\rho\equiv 0$ , which violates our target inequality (1.17). However, we will observe, cf. Theorems 3.5 and 4.1, that it suffices keep the total mass $\int_{\Omega}\rho$ bounded away from [math] to secure (1.17).

In view of (1.11), in order to obtain more interesting and instructive examples we should restrict ourselves to probability densities $\rho$ . The sequence

[TABLE]

of probability densities on $\Omega=(0,1)$ is a counterexample to (2.4). Indeed, the left-hand side of (2.4) is of order $n^{-1}$ and the right-hand side is $\lesssim n^{-2}$ .

Inequality (2.5) for $\int_{\Omega}\rho=1$ deserves a more detailed discussion.

Let us start with considering $g(s)=\log s$ . In this case, as first observed in the seminal paper [26], the gradient flow (2.2) is the linear Fokker-Planck equation, and the celebrated Bakry-Émery approach allows one to prove (2.5) for $\Omega=\mathbb{R}^{d}$ [2, 3, 27]. However, it is crucial to have concavity of $\frac{1}{\psi^{\prime\prime}(s)}$ , which we never assume in this work. These instances of (2.5) are referred to as convex Sobolev inequalities, which inspired the title of our paper. The particular case

[TABLE]

implies the log-Sobolev inequality for $p=1$ , the Poincaré inequality for $p=2$ and Beckner’s inequalities [4] for $1<p<2$ . Namely, (2.5) may be rewritten as

[TABLE]

In contrast, our assumptions on $\psi$ admit any $p>2$ in (2.6), which yields the following “Beckner-Hellinger inequality”:

[TABLE]

Consider now the case $g(s)=\frac{s^{\alpha-1}-1}{\alpha-1}$ , $\alpha>0$ , $\alpha\neq 1$ . Assume for simplicity that $|\Omega|=1$ and $\rho_{\infty}\equiv 1$ . Then (2.2) is the porous medium equation, cf. [42]. The alleged inequality (2.5) for the relative entropy (2.6), $p\in(1,\infty)$ , reads

[TABLE]

Setting $q:=\frac{2p}{p+\alpha-1}$ , $l:=\frac{p+\alpha-1}{2}$ , $u:=r^{l}$ , we rewrite (2.9) in the form

[TABLE]

The inequality

[TABLE]

similar to (2.10) appears in [11], see also [10, 18]. It holds for $0<q<2,$ $lq>1$ , that is, for $\alpha>1$ , $p>1$ . Assume for a moment that the the relative entropy, i.e., the left-hand side of (2.11), is a priori bounded. Since $ql\geq 1$ , the mass $\int_{\Omega}u^{1/l}$ is a priori bounded. Consequently, (2.11) is weaker than (2.10) since the exponent $q/2$ is less than $1$ , and it is plausible that (2.10) cannot be true. Inequality (2.11) for $q=2$ is equivalent to Beckner’s inequality (2.7). As explained in [18], inequality (2.11) is wrong for $q>2$ . In this connection, our results yield the following variant of (2.10):

[TABLE]

for any $q>0$ , $q\neq 2$ , $1<lq<1+2l$ , that is, any $\alpha>0$ , $\alpha\neq 1$ , $p>1$ .

The counterparts of the alleged inequalities (2.9) and (2.10) for $p=1$ are

[TABLE]

Here $q=\frac{2}{\alpha}$ . This resembles the inequality

[TABLE]

which was established in [10, 18]. Since $q/2<1$ , (2.15) is weaker than (2.14), so it seems that (2.14) cannot be true. Our results imply the following variant of (2.14):

[TABLE]

*Remark 2.2**.*

Inequalities (2.8), (2.12), (2.16) are obtained assuming $\int_{\Omega}r\,d\rho_{\infty}=1$ (so that (3.4) is automatically satisfied), but hold without this normalization due to their homogeneity.

Many authors studied (2.5) or related inequalities in the particular case $\psi=\psi_{g}$ , that is, when the driving entropy is compared to its production, cf., e.g., [42, 45, 46, 1, 9]. In this connection, the strict geodesic convexity of the driving entropy normally plays the pivotal role. In [33] (see also [30]) we studied (1.17) for $\psi=\psi_{g}$ without assuming neither Otto-Wasserstein nor Hellinger-Kantorovich geodesic convexity (we also never assume any similar condition in the present paper). The inequalities obtained there can be further refined [32] be means of studying gradient flows in the spherical Hellinger-Kantorovich space [34, 7], which is beyond the scope of the present paper (though it may seem strange, even non-negativity of the entropy production is uncertain for the spherical Hellinger-Kantorovich flows in the case $\psi\neq\psi_{g}$ ). The proofs in the present paper are more direct and simple than in [33] due to the “quasihomogeneous structure” (1.12).

Our last example concerns $g(s)=\frac{1}{2}\log\frac{2s^{2}}{1+s^{2}}$ , which corresponds to the arctangential heat equation [6]. The relative entropy $\mathcal{E}_{\psi_{g}}$ generated by this $g$ is geodesically convex neither in the Otto-Wasserstein nor in the Hellinger-Kantorovich sense, cf. [32]. Take $\psi(s)=s\log s-s+1$ . Then we infer the following inequality resembling the log-Sobolev one:

[TABLE]

provided $\int_{\Omega}r\,d\rho_{\infty}$ is bounded away from [math].

Nonlinear Fokker-Planck equations akin to (2.2) model behaviour of various stochastic systems, see [20, 44, 27, 5]. The related drift-diffusion-reaction equation (1.1) was suggested in [19]. On the other hand, equation (1.1) belongs to the class of nonlinear models (cf. [16, 25, 47, 33, 32, 38, 15]) for the spatial dynamics of populations which are tending to achieve the ideal free distribution [22, 21] (the distribution which happens if everybody is free to choose its location) in a heterogeneous environment. The dispersal strategy is determined by a local intrinsic characteristic of organisms called fitness. The fitness manifests itself as a growth rate, and simultaneously affects the dispersal as the species move along its gradient towards the most favorable environment. In (1.1), $\rho(x,t)$ is the density of organisms, and $f(x,\rho)$ is the fitness. The equilibrium $\rho\equiv\rho_{\infty}$ when the fitness is constantly zero corresponds to the ideal free distribution. The works [17, 8, 37, 47, 30, 29, 31, 33] perform mathematical analysis of some of such fitness-driven models. Our Theorem 3.6 indicates that the populations converge to the ideal free distribution with an exponential rate.

3. Main results

We start by introducing the weak solutions to (1.1)–(1.3), following the lines of [33, 32].

Define

[TABLE]

where the integral exists by (1.9). Observe that

[TABLE]

so that $G$ is a nonnegative continuous increasing function on $[0,\infty)$ .

Set

[TABLE]

As in [33], we can write (1.1) in the form

[TABLE]

where $\Phi$ stands for $\Phi(x,\rho(x,t))$ .

Definition 3.1.

Let $\rho^{0}\in L^{\infty}(\Omega)$ ; $Q_{T}:=\Omega\times(0,T)$ . A function $\rho\in L^{\infty}(Q_{T})$ is called a weak solution of (1.1)–(1.3) on $[0,T]$ if for $r=\rho/\rho_{\infty}$ we have $G(r(\cdot))\in L^{2}(0,T;H^{1}(\Omega))$ and

[TABLE]

for any function $\varphi\in C^{1}(\overline{\Omega}\times[0,T])$ such that $\varphi(x,T)=0$ . A function $\rho\in L^{\infty}_{\text{loc}}([0,\infty);L^{\infty}(\Omega))$ is called a weak solution of (1.1)–(1.3) on $[0,\infty)$ if for any $T>0$ it is a weak solution on $[0,T]$ .

*Remark 3.2**.*

For $\rho\in L^{\infty}(Q_{T})$ we automatically have $G(r)\in L^{\infty}(Q_{T})$ , so the condition $G(r(\cdot))\in L^{2}(0,T;H^{1}(\Omega))$ is equivalent to $rg^{\prime}(r)\nabla r\in L^{2}(Q_{T})$ . Here $r=\rho/\rho_{\infty}$ .

Formally, the integrand $rg^{\prime}(r)\psi^{\prime\prime}(r)|\nabla r|^{2}$ vanishes if $r=0$ . Otherwise it can be written as

[TABLE]

This motivates the following extension of the entropy production suitable for weak solutions.

Definition 3.3.

If $\rho\in L^{\infty}(\Omega)$ and $G(r)\in H^{1}(\Omega)$ , then the entropy production is defined by

[TABLE]

*Remark 3.4**.*

Observe that although the integrand with the gradient in (3.3) is a nonnegative measurable function on $\Omega$ , the integral, and hence the entropy production, may be infinite.

The following entropy-entropy production inequality applicable to weak solutions is based on an isoperimetric-type inequality established in Section 4.

Theorem 3.5 (Entropy-entropy production inequality).

Suppose that $g$ and $\psi$ satisfy (1.5)–(1.10). Let $U\subset L_{+}^{\infty}(\Omega)$ be a set of functions such that for any $\rho\in U$ and $r=\rho/\rho_{\infty}$ , we have $G(r)\in H^{1}(\Omega)$ and

[TABLE]

Then there exists $C_{U}$ such that

[TABLE]

Proof.

The idea is to use the isoperimetric-type inequality provided by Theorem 4.1 (see Section 4). Since we are dealing with a less regular setting at the moment, we argue by approximation.

Take $\rho\in U$ and as usual, put $r=\rho/\rho_{\infty}$ . Arguing as in [33, proof of Theorem 1.7], we see that there exists a sequence of functions $G_{n}\in C(\overline{\Omega})\cap C^{\infty}(\Omega)$ taking values in $(0,a)$ , where $a<G(\infty)$ , such that

[TABLE]

Set $r_{n}(x)=G^{-1}(G_{n}(x))$ and $\rho_{n}(x)=r_{n}(x)\rho_{\infty}(x)$ , so that $G_{n}(x)=G(r_{n}(x))$ . Clearly, $r_{n}$ and $\rho_{n}$ are positive and reasonably smooth, the sequences $\{r_{n}\}$ and $\{\rho_{n}\}$ are bounded in $L^{\infty}(Q_{T})$ (specifically, the former is bounded by $G^{-1}(a)$ ), and by the continuity of $G^{-1}$ we have

[TABLE]

In particular, this implies that $\rho_{n}$ converges to $\rho$ in $L^{1}(\Omega)$ . Further, by the Lebesgue Dominated Convergence we have

[TABLE]

Thus, if we denote the infimum in (3.4) by $d_{U}$ and the supremum in (3.5) by $E_{U}$ , there is no loss of generality in assuming that $\|\rho_{n}\|_{L^{1}(\Omega)}\geq d_{U}/2$ and $\mathcal{E}_{\psi}(\rho_{n})\leq 2E_{U}$ . It follows from Theorem 4.1 that there exist $C$ and $\sigma$ both depending on $d_{U}$ and $E_{U}$ (but not on the approximation nor on $\rho$ itself) such that

[TABLE]

By the Lebesgue Dominated Convergence we have

[TABLE]

Further, we have

[TABLE]

On one hand, $\nabla G_{n}\to\nabla G$ in $L^{2}(\Omega)$ . On the other hand, the functions

[TABLE]

are uniformly bounded in $L^{\infty}(\Omega)$ , and since we obviously have

[TABLE]

we also have

[TABLE]

Using Reverse Fatou’s Lemma for products (Lemma A.1 in the Appendix), we obtain

[TABLE]

Combining this with (3.7) and (3.9), we see that we can pass to the limit in (3.8) and obtain (3.6) with $C_{U}=C$ . ∎

Theorem 3.6 (Existence and asymptotics of weak solutions).

Assume (1.5)–(1.10). Then for any $\rho^{0}\in L^{\infty}_{+}(\Omega)$ there exists a nonnegative weak solution $\rho\in L^{\infty}(\Omega\times(0,\infty))$ of problem (1.1)–(1.3) which enjoys the following properties:

(1)

$\rho$ * satisfies the entropy dissipation inequality in the sense of measures: for any smooth nonnegative compactly supported function $\chi\colon(0,T)\to\mathbb{R}$ we have*

[TABLE] 2. (2)

the initial entropy satisfies

[TABLE] 3. (3)

$\rho$ * satisfies the lower $L^{1}$ -bound*

[TABLE] 4. (4)

$\rho$ * exponentially converges to $\rho_{\infty}$ in the sense of entropy:*

[TABLE]

where $\gamma_{\psi}>0$ can be chosen uniformly over initial data satisfying

[TABLE]

with some $c,C>0$ ; 5. (5)

for any $p\in[2,+\infty)$ ,

[TABLE]

where $\gamma_{p}>0$ can be chosen uniformly over initial data satisfying

[TABLE]

Proof.

For the proof of existence, the approximating procedure used in [33] is still applicable in the current setting. As a matter of fact, the existence result in [33] requires that $|f(x,\xi)|$ is either large or does not depend on $x$ when $\xi$ is near [math] or near $+\infty$ . A similar requirement was imposed for large $\xi$ . However, these assumptions are only needed in order to ensure that any $u\in L_{+}^{\infty}(\Omega)$ can be bounded from above by a function $u_{c}\colon\Omega\to\mathbb{R}$ satisfying $f(x,u_{c}(x))\equiv cst$ and that $u$ can be bounded from below by another such function provided that $u$ is uniformly bounded away from [math]. This is still the case in the current setting. Indeed, assume for simplicity that $u$ is continuous on $\overline{\Omega}$ . Set $c=\max_{\Omega}g(u/\rho_{\infty})$ and put $u_{c}=\rho_{\infty}g^{-1}(c)$ , then clearly $f(x,u_{c}(x))=-g(u_{c}(x)/\rho_{\infty})=-c$ ; moreover, it follows from the monotonicity of $g$ that $u\leq u_{c}$ , as required. The existence of a lower bound is proved in a similar way, cf. [33, Remark 3.4].

Inequality (3.11) is proved in the same way as the analogous inequality in [33].

We prove that the solution constructed as in [33] satisfies (3.10). To this end it suffices to check that this inequality is preserved under the passage to the limit. Specifically, assume that smooth enough approximate solutions $\{\rho_{n}\}$ are uniformly bounded in $L^{\infty}(Q_{T})$ and converge to $\rho$ a. e. in $Q_{T}$ , while

[TABLE]

By the Lebesgue Dominated Convergence we have

[TABLE]

Arguing as in [33, proof of Theorem 3.9] and, in particular, taking into account that $\nabla G=0$ a. e. on the set $\{(x,t)\in Q_{T}\colon r=0\}$ and $\nabla G_{n}=0$ a. e. on the set $\{(x,t)\in Q_{T}\colon r_{n}=0\}$ , we conclude that for any $\delta>0$ we have

[TABLE]

so sending $\delta\to\infty$ and applying Beppo Levy’s theorem, we obtain

[TABLE]

or, equivalently,

[TABLE]

Combining this with (3.17) and (3.18), we obtain (3.10).

We now prove the exponential convergence of the solution to the steady state. Let $\rho$ be a weak solution of (1.1)–(1.3) with the initial data satisfying (3.14). Let $U\subset L^{\infty}_{+}$ be the set of functions such that for any $u\in U$ , we have $G(u/\rho_{\infty})\in H^{1}(\Omega)$ and $\|u\|_{L^{1}(\Omega)}\geq c$ , $\mathcal{E}_{\psi}(u)\leq C$ with the same $c$ and $C$ as in (3.14). By Theorem 3.5 we have the entropy-entropy production inequality (3.6) for $U$ . It follows from the bounds (3.11) and (3.12) that $\rho(t)\in U$ for a. a. $t>0$ . Combining the entropy dissipation and entropy-entropy production inequalities, we get

[TABLE]

in the sense of measures. Set $\gamma_{\psi}=C_{U}^{-1}$ and $\phi(t)=\mathcal{E}_{\psi}(\rho(t))e^{\gamma_{\psi}t}$ . It is easy to check that that $\partial_{t}\phi(t)\leq 0$ in the sense of measures, whence $\phi$ a. e. coincides with a nonincreasing function. Moreover,

[TABLE]

by virtue of (3.11), so $\phi(t)\leq\mathcal{E}_{\psi}(\rho^{0})$ for a. a. $t>0$ , which implies (3.13).

We will now use (3.13) with $\psi(s)=|s-1|^{p}$ , which is a $C^{2}$ -function for $p\geq 2$ , and satisfies the assumptions (1.6)–(1.8). We immediately get

[TABLE]

where $\gamma_{p}=\gamma_{\psi}/p$ . Uniform boundedness of $\|\rho^{0}\|_{L^{p}}^{p}$ implies a bound on $\mathcal{E}_{\psi}(\rho^{0})$ . ∎

4. Inequality

In this section we prove a refined version of our unbalanced convex Sobolev inequality in the smooth case.

Theorem 4.1.

Assume (1.5)–(1.10). Let $U\in C^{\infty}_{+}(\Omega)$ be such that

[TABLE]

Then there exist constants (independent of $\rho$ ) $C>0$ , $0<\alpha<\beta<\infty$ , such that

[TABLE]

The proof of Theorem 4.1 is based on the next two lemmas.

Lemma 4.2.

Fix $0<\alpha<\beta<1$ . Then

[TABLE]

Proof.

If the minimum on the right-hand side vanishes, there is nothing to prove. Otherwise the set $[\alpha<r<\beta]$ has nonzero measure. In what follows, we use some facts from geometric measure theory, which can be found in [39]. The relative perimeter of a Lebesgue measurable set $A$ of locally finite perimeter with respect to $\Omega$ is $P(A;\Omega)=|\mu_{A}|(\Omega),$ where $\mu_{A}:=\nabla 1_{A}$ is the Gauss-Green measure associated with $A$ . The support of $\mu_{A}$ is contained in the topological boundary of $A$ .

We have:

[TABLE]

The last integral is the variation of $r$ over $[\alpha<r<\beta]$ , which can be computed using the coarea formula:

[TABLE]

where we first use the observation that the support of the Gauss–Green measure associated with $[r<t]$ is disjoint with $[\alpha<r<\beta]$ whenever $t\leq\alpha$ or $t\geq\beta$ , and then we notice that if $\alpha<t<\beta$ , then the part of the support of the Gauss–Green measure of $[r<t]$ lying in $\Omega$ is contained in $[\alpha<r<\beta]$ .

Invoking the relative isoperimetric inequality (1.4), we estimate

[TABLE]

and since for $t\in(\alpha,\beta)$ we have

[TABLE]

we see that

[TABLE]

Combining this estimate with (4.3) and (4.4), we obtain (4.2). ∎

Lemma 4.3.

Given $\varepsilon>0$ , there exists $C_{\varepsilon}>0$ such that

[TABLE]

Proof.

Applying L’Hôpital’s rule for $\liminf$ , and remembering that $g$ is an increasing function, we obtain

[TABLE]

In (4.6) and (4.7) we have used the fact that for $s\neq 1$ , the signs of $g(s)$ and $\psi^{\prime}(s)$ coincide, while $\psi^{\prime\prime}(s)>0$ . Obviously, (4.6) and (4.7) imply (4.5). ∎

Proof of Theorem 4.1.

We claim that there exists $\beta>0$ such that

[TABLE]

Indeed, it follows from (1.8) (L’Hôpital’s rule) that

[TABLE]

As the entropy $\mathcal{E}_{\psi}$ is bounded on $U$ , by de la Vallée Poussin’s theorem the set $U$ is uniformly integrable. Put

[TABLE]

for any $\rho\in U$ we have

[TABLE]

where $\omega_{U}$ is the modulus of integrability of $U$ . Hence

[TABLE]

which clearly implies a lower bound on $\big{|}[\rho\geq m]\big{|}$ and a fortiori on $\big{|}[r\geq\beta]\big{|}$ with $\beta=\frac{m}{\sup\rho_{\infty}}$ .

Clearly, there is no loss in generality in assuming $\beta<1$ in (4.8).

In what follows we fix $\alpha$ and $\beta$ such that $0<\alpha<\beta<1$ and $\beta$ satisfies (4.8). Denote

[TABLE]

and also

[TABLE]

Assume for now that $\sigma>0$ . Using Lemma 4.2, we have

[TABLE]

Taking into account (4.8), we can write

[TABLE]

with $c$ independent of $\rho$ . Estimating

[TABLE]

we obtain

[TABLE]

If $\sigma=0$ , this estimate trivially holds with any $c$ . Since $\sigma$ is a priori bounded from above by $|\Omega|$ , (4.9) implies that

[TABLE]

Evoking Lemma 4.3, we obtain

[TABLE]

Using (4.10) to estimate $\sigma$ by $D_{\alpha\beta}\mathcal{E}_{\psi}$ , we obtain (4.1) ∎

Appendix A Reverse Fatou’s Lemma for products

Lemma A.1.

Let $(S,\Sigma,\mu)$ be a measure space. Suppose that $\{f_{n}\}$ is bounded in $L^{\infty}(S,\mu)$ and $\{g_{n}\}$ converges to a nonnegative limit $g$ in $L^{1}(S,\mu)$ . Then

[TABLE]

Proof.

As we have $|f_{n}g|\leq(\sup_{n}\|f_{n}\|)g$ , we can use Reverse Fatou’s Lemma obtaining

[TABLE]

Further, it is clear that

[TABLE]

Using (A.2) and (A.3) we obtain

[TABLE]

as claimed. ∎

Acknowledgment

The research was partially supported by the Portuguese Government through FCT/MCTES and by the ERDF through PT2020 (projects UID/MAT/00324/2019, PTDC/MAT-PUR/28686/2017 and TUBITAK/0005/2014).

Conflict of interest statement

We have no conflict of interest to declare.

Bibliography47

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] L. Ambrosio, N. Gigli, and G. Savaré. Gradient Flows: in Metric Spaces and in the Space of Probability Measures . Basel: Birkhäuser Basel, 2008.
2[2] A. Arnold, P. Markowich, G. Toscani, and A. Unterreiter. On convex Sobolev inequalities and the rate of convergence to equilibrium for Fokker-Planck type equations. Comm. Partial Differential Equations , 26(1-2):43–100, 2001.
3[3] D. Bakry and M. Émery. Diffusions hypercontractives. In Séminaire de Probabilités XIX 1983/84 , pages 177–206. Springer, 1985.
4[4] W. Beckner. A generalized Poincaré inequality for Gaussian measures. Proc. Amer. Math. Soc. , 105(2):397–400, 1989.
5[5] T. Bodineau, J. Lebowitz, C. Mouhot, and C. Villani. Lyapunov functionals for boundary-driven nonlinear drift-diffusion equations. Nonlinearity , 27(9):2111–2132, 2014.
6[6] Y. Brenier. Geometric origin and some properties of the arctangential heat equation. Tunis. J. Math. , 1(4):561–584, 2019.
7[7] Y. Brenier and D. Vorotnikov. On optimal transport of matrix-valued measures. Ar Xiv e-prints , Aug. 2018.
8[8] R. S. Cantrell, C. Cosner, Y. Lou, and C. Xie. Random dispersal versus fitness-dependent dispersal. J. Differential Equations , 254(7):2905–2941, 2013.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Convex Sobolev inequalities related to unbalanced optimal transport

Abstract.

1. Introduction

2. Background and discussion

Remark 2.1*.*

Remark 2.2*.*

3. Main results

Definition 3.1**.**

Remark 3.2*.*

Definition 3.3**.**

Remark 3.4*.*

Theorem 3.5** (Entropy-entropy production inequality).**

Proof.

Theorem 3.6** (Existence and asymptotics of weak solutions).**

Proof.

4. Inequality

Theorem 4.1**.**

Lemma 4.2**.**

Proof.

Lemma 4.3**.**

Proof.

Proof of Theorem 4.1.

Appendix A Reverse Fatou’s Lemma for products

Lemma A.1**.**

Proof.

Acknowledgment

Conflict of interest statement

*Remark 2.1**.*

*Remark 2.2**.*

Definition 3.1.

*Remark 3.2**.*

Definition 3.3.

*Remark 3.4**.*

Theorem 3.5 (Entropy-entropy production inequality).

Theorem 3.6 (Existence and asymptotics of weak solutions).

Theorem 4.1.

Lemma 4.2.

Lemma 4.3.

Lemma A.1.