Contraction and regularizing properties of heat flows in metric measure spaces
Giulia Luise, Giuseppe Savar\'e

TL;DR
This paper explores novel contraction and regularizing effects of heat flows in metric measure spaces, highlighting their relationships with various distances and curvature bounds, without requiring flow linearity.
Contribution
It demonstrates contraction properties of Hellinger-Kakutani distances and Csiszár divergences in general metric spaces, and links regularization effects to lower Ricci curvature bounds in $RCD(K, abla)$ spaces.
Findings
Contraction of Hellinger-Kakutani distances holds in arbitrary metric-measure spaces.
Regularizing effects depend on dual formulations of transport distances.
Results relate heat flow properties to lower Ricci curvature bounds in $RCD(K, abla)$ spaces.
Abstract
We illustrate some novel contraction and regularizing properties of the Heat flow in metric-measure spaces that emphasize an interplay between Hellinger-Kakutani, Kantorovich-Wasserstein and Hellinger-Kantorvich distances. Contraction properties of Hellinger-Kakutani distances and general Csisz\'ar divergences hold in arbitrary metric-measure spaces and do not require assumptions on the linearity of the flow. When weaker transport distances are involved, we will show that contraction and regularizing effects rely on the dual formulations of the distances and are strictly related to lower Ricci curvature bounds in the setting of metric measure spaces.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Contraction and regularizing properties
of heat flows in metric measure spaces
Abstract.
We illustrate some novel contraction and regularizing properties of the Heat flow in metric-measure spaces that emphasize an interplay between Hellinger-Kakutani, Kantorovich-Wasserstein and Hellinger-Kantorvich distances. Contraction properties of Hellinger-Kakutani distances and general Csiszár divergences hold in arbitrary metric-measure spaces and do not require assumptions on the linearity of the flow.
When weaker transport distances are involved, we will show that contraction and regularizing effects rely on the dual formulations of the distances and are strictly related to lower Ricci curvature bounds in the setting of metric measure spaces.
Key words and phrases:
Heat flows, contraction, Hellinger distance, Wasserstein distance, curvature bounds, spaces.
1991 Mathematics Subject Classification:
Primary: 49Q20, 47D07; Secondary: 30L99.
The second author is partially supported by PRIN2015 grant from MIUR for the project Calculus of Variations and by IMATI-CNR
Giulia Luise
Department of Computer Science, University College London
Gower Street, London WC1E 6BT, UK
Giuseppe Savaré ∗
Dipartimento di Matematica ‘Felice Casorati’, University of Pavia
Via Ferrata 1, 27100 Pavia, Italy
Dedicated to Alexander Mielke on the occasion of his 60th birthday
Contents
-
2.3 Kantorovich-Wasserstein and Hellinger-Kantorovich distances
-
3.2 Gradient flow of the Cheeger energy in metric-measure spaces
-
4 Contraction properties for the Heat flow in metric measure spaces
-
5 Regularizing properties of the Heat flow in metric measure spaces
1. Introduction
The study of contraction properties of norms and more general convex entropy functionals with respect to the action of Markov semigroups is a very classic subject (see e.g. [9]). More recently, the role of the Kantorovich-Rubinstein-Wasserstein metric for second order diffusion equations in the space of probability measures has been deeply investigated, starting from the pioneering contribution by F. Otto [35]. Many investigations have clarified the relations between analytic estimates depending on the structure of the generating differential operator and geometric properties of the underlying spaces, with an increasing level of generality. An incomplete list of contributions includes the contraction of a general class of evolution equations combining diffusion, interaction and drift [13], the gradient-flow structure and the geodesic convexity in Euclidean spaces [25, 35, 2], the Heat flow in Riemannian manifolds and the Ricci curvature [36, 37, 41, 17, 19, 42], Hilbert geometry [7], the duality with gradient estimates and the Alexandrov spaces [30, 21], the RCD metric measure spaces and the Bakry-Émery condition [3, 4, 5, 10, 20, 6].
In one of the most general formulations, we will deal with a metric-measure space given by a complete and separable metric space endowed with a Borel positive measure with full support satisfying the growth condition
[TABLE]
We introduce the Cheeger energy functional
[TABLE]
where
[TABLE]
is a convex, -homogeneous and lower semicontinuous functional whose proper domain provides one of the equivalent characterization of the metric Sobolev space (see also [22, 28, 39, 11, 23]). A local weak gradient can be associated to each function so that the Cheeger energy admits the integral representation
[TABLE]
The subdifferential of (whose minimal selection will be denoted by ) generates a continuous semigroup of order preserving contractions in , which is canonically attached to the metric-measure structure .
Even if in general the operators are not linear, one can prove [3] that the semigroup is contractive with respect to all the norms, ,
[TABLE]
and all the integral functionals with convex integrand
[TABLE]
A first important result we will prove in Section 4 is the extension of (4)-(5) to arbitrary convex integral functionals on evolving pairs:
[TABLE]
whenever is a lower semicontinuous convex integrand with . As a byproduct, we obtain that the action of on nonnegative functions is a contraction with respect to arbitrary Csiszár divergences (see [16, 33] and Section 2), such as the Kullback-Leibler entropy functional [29] associated to if , yielding (since is mass preserving)
[TABLE]
or the Hellinger-Kakutani distances [24, 26]
[TABLE]
associated to ,
The most relevant connections with optimal transport metrics occur when is also a quadratic form, i.e. it satisfies the parallelogram rule
[TABLE]
In this case is a linear positive selfadjoint operator in and is a linear Markov semigroup associated to a strongly local symmetric Dirichlet form on , admitting Carré du Champ which provides a bilinear extension of the weak gradient, since
[TABLE]
If every bounded function with -a.e. admits a -continuous representative (still denoted by ) which satisfies the -Lipschitz condition
[TABLE]
then satisfies (a suitable weak formulation of) the Bakry-Émery condition ,
[TABLE]
if and only if admits a (unique) extension to the space of finite Borel measures and satisfies the contraction property (see [5])
[TABLE]
here denotes the -Kantorovich-Wasserstein distance between probability measures of with finite quadratic moments
[TABLE]
In fact, this property is deeply related with the synthetic theory of metric-measure spaces with Ricci curvature bounded from below developed by Lott-Villani [34] and Sturm [40]. The combination of the Lott-Sturm-Villani condition with the quadratic property of the Cheeger energy (7) provides one of the equivalent characterizations of the so-called metric-measure space [4], which turned out to be equivalent with the Bakry-Émery functional-analytic approach we have adopted here [5].
The link between (8) and (9) becomes more apparent if we consider that (8) is in fact equivalent to the Bakry-Émery commutation estimate
[TABLE]
combined with the duality formula expressing the distance in terms of regular subsolutions to the Hamilton-Jacobi equation [36, 3, 1]
[TABLE]
thanks to the dual representation formula for :
[TABLE]
(10) shows in fact that preserves (up to an exponential factor) subsolutions to the Hamilton-Jacobi equation (11).
In Section 5 we improve (9) in two directions. First of all, we will show that after a strictly positive time exhibits a regularizing effect, providing a control of the stronger -Hellinger distance
[TABLE]
in terms of the weaker Wasserstein distance between the initial measures:
[TABLE]
where
[TABLE]
Notice that when and we obtain the asymptotic estimate
[TABLE]
proving in particular Hellinger convergence of to as , with exponential rate if .
A second and more refined estimate involves the recently introduced family of Hellinger-Kantorovich distances , , [15, 14, 27, 31, 32], which can be defined in terms of an Optimal Entropy–Transport problem [31, 32]
[TABLE]
where are the marginals of , is the Kullback-Leibler divergence
[TABLE]
and is the cost function
[TABLE]
It turns out that (corresponding to in the more general notation of [31, 32]) admits a dual dynamic representation formula [32]
[TABLE]
so that when the Bakry-Émery condition holds one has [32]
[TABLE]
Actually, the stronger Hellinger distance at time can be estimated in terms of the weaker Hellinger-Kantorovich one: for every
[TABLE]
Differently from other well known properties, the estimates (13) and (16) cannot be deduced by a regularization effect on a single initial datum, since , and are not translation invariant. In this respect, the dual dynamic approach plays a crucial role.
Plan of the paper. The paper is organized as follows: in Section 2 we will collect a few preliminary results on Csiszár divergences, Hellinger-Kakutani, Kantorovich-Wasserstein and Hellinger-Kantorovich metrics.
Section 3 is devoted to a short review of the main tools of calculus in metric-measure spaces, which are used throughout the work. A brief description of the main properties of metric measures spaces is also presented.
The last two sections contain novel results. Section 4 is dedicated to the proof of (6) in general metric measure spaces. Section 5 discusses the regularization estimates (13) and (16).
2. Distances and entropies on the space of finite measures
2.1. Csiszár divergences/Relative entropies
We first recall a few basic facts on convex and -homogeneous functionals of positive measures.
Let be a measurable space. We will denote the space of finite nonnegative measures on by . If , we say that is a common dominating measure if , . Such a always exists, for instance we may take . We will also often consider the Lebesgue decomposition of w.r.t. given by
[TABLE]
We consider the class of Csiszàr density functions
[TABLE]
Definition 2.1**.**
Let be as in (18a,b) and let with Lebesgue decomposition as in (17). The Csizár divergence associated with is defined as
[TABLE]
The -perspective functional is defined as
[TABLE]
where , , and is any common dominating measure.
If and are related by (18c) then
[TABLE]
Notice that (20) does not depend on the choice of the dominating measure , since the function is positively -homogeneous.
(21) can be easily checked by observing that is a dominating measure for the couple ; if are measurable subsets of such that
[TABLE]
we can easily calculate the densities by
[TABLE]
so that
[TABLE]
An important class of entropy functions is provided by the power like functions which have the following explicit formulas
[TABLE]
For , the entropy function generates the well known Kullback-Leibler divergence, often referred to as relative logarithmic entropy. Notice that is superlinear, so that and its corresponding perspective function is
[TABLE]
Definition 2.2** (Kullback-Leibler divergence (relative logarithmic entropy)).**
Let and be two finite nonnegative measures. The logarithmic entropy of with respect to is given by the Csiszàr functional associated to :
[TABLE]
The functionals admit a useful dual representation. Let us denote by the set of bounded Borel functions on and by the Legendre conjugate function of , given by
[TABLE]
We introduce the closed convex subsets of given by
[TABLE]
Since is lower semicontinuous, it can be recovered from and by the Fenchel-Moreau formula [32]
[TABLE]
Similarly, we have
[TABLE]
and if (18c) holds.
Theorem 2.3**.**
For every we have
[TABLE]
Proof.
[32, Th. 2.7] ∎
2.2. Hellinger distances
We consider a specific example of perspective functionals , which gives raise to the Hellinger distances.
Definition 2.4**.**
For and the -Hellinger distance is defined by
[TABLE]
where , , and is an arbitrary dominating measure.
Notice that the above definition corresponds to (20), (19) for the choices
[TABLE]
An immediate consequence of the above definition, choosing is the uniform bound
[TABLE]
For the definition above gives the usual total variation distance, which we will still denote by . The total variation distance and the -Hellinger distance induce the same topology on the space and the following relation holds.
Theorem 2.5**.**
Let be the conjugate exponent of . For every and arbitrary nonnegative finite measures and in ,
[TABLE]
where .
Proof.
The first part of (26) follows immediately by the representation (24) and the elementary inequality
[TABLE]
The second inequality of (26) is a consequence of
[TABLE]
which can be easily obtained by integration (without loss of generality we can assume )
[TABLE]
where
[TABLE]
(27) with the choices and , combined with Hölder inequality, yields
[TABLE]
∎
An interesting characterization of in terms of is provided by the following property [32]:
Proposition 2.6**.**
For any two measures and in
[TABLE]
In particular
[TABLE]
Proof.
Recalling (22) and (23b), (28) follows by the simple calculation
[TABLE]
attained at . ∎
We now look at the Hellinger distance in its dual formulation. We focus on a ‘static-dual’ formulation first and then we proceed to the dynamic dual formulation in terms of subsolution of the equation . This expression will play a crucial role in the contraction result of Proposition 5.1 and the regularizing estimates of Theorems 5.2 and 5.4. In the next computation we adopt the convention to write
[TABLE]
Corollary 2.7**.**
Let and be the conjugate of . The Hellinger distance admits the following dual formulation:
[TABLE]
Proof.
The result is a consequence of Theorem 2.3 and the computation of the convex set associated to the perspective function of (25); it is sufficient to prove that
[TABLE]
In order to show (31) we first compute the Legendre transform of , obtaining
[TABLE]
Recalling that , the inequality for is equivalent to
[TABLE]
We then obtain
[TABLE]
which yields (31). ∎
The dynamic counterpart of the dual formulation is outlined in the proposition below.
Proposition 2.8**.**
Let and let be the conjugate of . For every , in
[TABLE]
Proof.
First of all we manipulate the formulation (32) so that we can maximize with respect to one function only. We first observe that replacing, e.g. by , , the couple is still admissible and
[TABLE]
so that it is not restrictive to assume in (30).
that for every choice of satisfying the best selection of in order to maximize is given by
[TABLE]
Setting we obtain the formula
[TABLE]
On the other hand we observe that the function \zeta_{1}:=\frac{\zeta_{0}}{\big{(}1+\zeta_{0}^{q-1}\big{)}^{p-1}} corresponds to the solution at time of
[TABLE]
and by the comparison theorem for ordinary differential equation, any subsolution to (33) will satisfy . ∎
2.3. Kantorovich-Wasserstein and Hellinger-Kantorovich distances
Kantorovich-Wasserstein distance.
The standard definition of the Kantorovich-Wasserstein distance arises in a natural way in the frame of optimal transport. Here we recall the definition only and we refer to [2, 42] for further details.
We will deal with a complete and separable metric space ; we denote by its Borel -algebra and by the space of Borel probability measures on . For we set
[TABLE]
where is an arbitrary point of (the definition is independent of the choice of ).
If is a Borel map between two metric spaces, we denote by the corresponding push-forward operation, defined by
[TABLE]
In particular, when we consider the canonical cartesian projections defined by , , and a general measure (also called transport plan) , the measures are the marginals of .
Definition 2.9**.**
Let . For any the -Kantorovich-Wasserstein distance is defined by
[TABLE]
As we will see, a key ingredient we will extensively use in our arguments is given by the dynamic dual formulation of the Wasserstein distance, in terms of the subsolutions of the Hamilton-Jacobi equation. Such a result, which has been formulated in different form by [36, 3, 6, 1], holds if is a length space, i.e. if for every and every there exists an approximate mid-point such that
[TABLE]
We denote by the Banach space of bounded Lipschitz functions endowed with the norm
[TABLE]
Proposition 2.10**.**
If is a length space then for every
[TABLE]
where is the conjugate of .
Proof.
Let ; since is a length space, also is a length space, so that for every we can find a Lipschitz curve such that
[TABLE]
It follows that for every curve the map is Lipschitz continuous and by [6, Lemma 6.4, Theorem 6.6]
[TABLE]
if is also a subsolution to the Hamilton-Jacobi equation
[TABLE]
then the previous inequality, the bound (35) on the metric velocity and the arbitrariness of yield
[TABLE]
On the other hand, for every we can use the Hopf-Lax semigroup
[TABLE]
and Kantorovich duality for the Wasserstein distance to find such that
[TABLE]
Using the refined estimate on the Hopf-Lax semigroup of [3] we can show that is uniformly bounded in , is Lipschitz continuous with values in and satisfies
[TABLE]
By using a rescaling argument of [1] and the smoothing technique of the proof of [32, Theorem 8.12] we conclude. ∎
Hellinger-Kantorovich distance.
After Hellinger-Kakutani and Kantorovich-Wasserstein distances, we recall the definition of a third distance between probability measures, that plays a role in the main contributions of this work.
Let be a separable complete metric space. The Hellinger-Kantorovich distances are defined on the space of finite nonnegative Borel measures and they do not require measures to have the same mass. As in the previous cases of or , the Hellinger-Kantorovich distances admit different formulations that we summarize below. Here we focus on the family of distances depending on a tuning parameter ; they correspond to the case of [31] with the choice . In the even more specific case , coincides with the distance which has been extensively studied in [32]. The general case can be reduced to the case by rescaling the distance by a factor .
The first formulation comes from the Logarithmic-Entropy-Transport problem, where the constraints on the marginals typical of optimal transport problems (2.9) are relaxed by the introduction of two penalizing functionals. The primal formulation of the Hellinger-Kantorovich distance is the following:
Definition 2.11**.**
For any ,
[TABLE]
where is the cost function defined by (15).
A direct comparison with (28) by restricting to plans of the form where is an arbitrary measure dominating and is the diagonal identity map , immediatily yields
[TABLE]
[32, Theorem 7.22] also shows that
[TABLE]
[32, Proposition 7.23, Theorem 7.24] provide two further useful bounds of in terms of , when :
[TABLE]
The distance admits an equivalent dual formulation in terms of subsolutions to a suitable version of the Hamilton-Jacobi equation, which can be compared with (32) and (34): in fact, it is possible to show [32, Section 8.4] that
[TABLE]
3. Metric measure spaces with curvature bounds
This section is dedicated to a brief review of a few notions related to calculus and Sobolev spaces in metric measure spaces. We refer to [3] and [4] for a complete review of the topic.
3.1. Calculus in metric measure spaces: basic notions
Let be a complete and separable metric space, endowed with a Borel positive measure satisfying the growth condition (1) and . As we already mentioned in the Introduction, on this class of metric measure space it is possible to introduce an effective metric counterpart of the classic Dirichlet energy form in Euclidean spaces and of the corresponding Sobolev spaces. In the following, we will recall the basic notions only, which are strictly necessary to understand the main results of the work, by adopting the Cheeger point of view.
Definition 3.1**.**
A function is a relaxed gradient of if there exist Borel -Lipschitz functions such that:
- a)
in and weakly converge to in ;
- b)
-a.e. in .
We say that is the minimal relaxed gradient of if its norm is minimal among relaxed gradients. We shall denote by the minimal relaxed gradient.
The minimal relaxed gradient is used to give an integral formulation of the Cheeger energy (2), which can be represented as
[TABLE]
and set equal to if has no relaxed gradients. The Cheeger energy is a convex, -homogeneous lower semicontinuous functional on with dense domain [3, Th. 4.5]. From the lower semicontinuity of it follows that the domain endowed with the norm
[TABLE]
is a Banach space, which is called . In general it is not a Hilbert space and this causes the potential non linearity of the heat flow. The following proposition summaries some useful properties of the minimal relaxed gradient, which will be helpful for our purposes.
Proposition 3.2**.**
Let . Then the following properties hold:
- a)
* -a.e. on for all constants and with ;*
- b)
* and for any Lipschitz function on an interval containing the image of ; the inequality refines to the equality if in addition is nondecreasing;*
- c)
if and is a nondecreasing contraction, then
[TABLE]
Proof.
[3, Prop. 4.8] ∎
3.2. Gradient flow of the Cheeger energy in metric-measure spaces
The metric-measure counterpart of the Laplacian operator can be defined in terms of the element of minimal -norm in the subdifferential of . is the multivalued operator in defined for all by the following relation:
[TABLE]
Definition 3.3** (Metric-measure Laplacian).**
*The metric-measure Laplacian of is defined for any such that . For those , is the element of minimal norm in . *
The domain of is denoted by and is a dense subset of . The metric-measure heat flow can be obtained by applying the classic theory of gradient flows in Hilbert spaces [12] and it enjoys further properties which have been studied in [3]. More refined contraction properties will be proved in Section 4.
Theorem 3.4** (Gradient flow of in ).**
For any there exists a unique locally absolutely continuous curve such that in as and
[TABLE]
The following properties hold:
- (1)
The curve is locally Lipschitz, for any and it holds
[TABLE] 2. (2)
The curve is locally Lipschitz in , infinitesimal at and continuous in [math] if . Its right derivative is given by for every . 3. (3)
The family of maps is a strongly continuous semigroup of contractions in which can be extended in a unique way to a strongly continuous semigroup of contractions in every , (still denoted by ) thus satisfying
[TABLE]
3.3. metric measure spaces
In this subsection we briefly recall the definition and some properties of a class of metric measure spaces which generalize the notion of Riemannian manifolds with Ricci curvature bounded from below. This will be the general setting of the regularization result that we propose in Section 5, where, indeed, the bound on the curvature plays a direct role.
On a general metric measure space, the Cheeger energy is not a quadratic form and this translates into a potential lack of linearity of its -gradient flow . If we require the Cheeger energy to be quadratic, and hence the heat flow to be linear, we restrict the choice of the underlying metric domain to class of metric measure spaces which can be considered a nonsmooth generalization of Riemannian manifolds: among them, the so called Bakry-Émery curvature condition can be used to select the class of metric measure spaces (we refer to [3, 4] for a complete discussion and the other important equivalent characterization we mentioned in the Introduction). As in the previous section, will denote a complete and separable metric measure space satisfying the volume growth condition (1).
Definition 3.5** (The -condition).**
is a metric measure space if the Cheeger energy is quadratic (7), every function with admits a -Lipschitz representative (still denoted by ) and
[TABLE]
Equation (41) is one of the equivalent formulation of the celebrated Bakry-Émery condition [8], [5]. Notice that the condition implies in particular that every bounded function with has a Lipschitz continuous representative (identified with ) satisfying
[TABLE]
On spaces, an even stronger version of (41) holds true, together with crucial regularization properties which we collect in the next statement.
Theorem 3.6**.**
Let be a space.
- (1)
*For every and the function has a unique continuous representative (in the following, with a slight abuse of notation, we will identify with , whenever ). * 2. (2)
For every with and
[TABLE] 3. (3)
For every and
[TABLE]
where has been defined in (14). In particular
[TABLE]
Proof.
Property (1) is a consequence of [5, Corollary 4.18]. The first identity of (42) is stated in [5, Theorem 3.17]; the second one is stated in [38, Corollary 4.3]. (23b) is a consequence of the above properties and the estimate of [5, Corollary 2.3(iv)]. ∎
4. Contraction properties for the Heat flow in metric measure spaces
This section is devoted to some fairly general contraction properties of the heat flow in the metric-measure setting. Our main result concerns the behaviour of the functional
[TABLE]
Theorem 4.1**.**
Let be a metric measure space with the Heat semigroup generated by the Cheeger energy in , and let be defined as in (45a,b,c). Then, for
[TABLE]
We prove some useful lemmas first. The first one shows a generalization of part in Proposition 3.2 and is the core of the proof of the main theorem.
Lemma 4.2**.**
Let be a metric space, let be a convex function with -Lipschitz (w.r.t the Euclidean norm) gradient , and let be the map . For every bounded Lipschitz map , the function satisfies
[TABLE]
Proof.
Since is positive definite and is -Lipschitz, we observe that satisfies
[TABLE]
For every , , and we set
[TABLE]
Let us now fix ; it is possible to find two sequences of points , , such that
[TABLE]
Taking a linear combination of the difference quotients with the positive coefficients it holds
[TABLE]
Now, and hence
[TABLE]
Since is , a first order expansion at with and the Lipschitz character of yield
[TABLE]
Estimating the first component of along the sequence and the second component of along we get for
[TABLE]
Recalling (48), since the coefficients are nonnegative, we get
[TABLE]
where for every is the symmetric matrix defined by
[TABLE]
(47) and the next elementary Lemma yield
[TABLE]
thus obtaining (46). ∎
Lemma 4.3**.**
Let be a symmetric matrix and let be defined by , . If
[TABLE]
then also satisfies
[TABLE]
Proof.
It is easy to check that a symmetric matrix satisfies for every (49) if and only if
[TABLE]
and it is clear that (51) is preserved if we replace the coefficients by . The second inequality of (50) follows immediately by the first one and the Cauchy-Schwartz inequality, since
[TABLE]
∎
Lemma 4.4**.**
Let be a convex function as in (45b) and (45c) with -Lipschitz (w.r.t the Euclidean norm) gradient , and let be the map . For every couple bounded Lipschitz map with , the components of belong to and satisfy
[TABLE]
Proof.
Let us consider the case when (the case of a finite measure is simper, and it follows by obvious modifications of the arguments below): notice that (45c) yields .
Let us first notice that so that for every the functions belong to as well.
We first prove that
[TABLE]
whenever are bounded and Lipschitz and is of class . To this aim, it is sufficient to regularize e.g. by convolution with a family of smooth kernels , satisfying
[TABLE]
We then set
[TABLE]
Applying Lemma 4.2 we get
[TABLE]
Since
[TABLE]
passing to the limit as we have in ; up to the extraction of a suitable subsequence (not relabelled) we can also assume that
[TABLE]
We claim that
[TABLE]
In fact, for an arbitrary measurable set we have
[TABLE]
so that for every measurable set
[TABLE]
(52) then follows by (53) by a similar argument: we select optimal sequences of bounded Lipschitz functions converging to in such that
[TABLE]
and we consider the corresponding sequences , converging to in . We then pass to the limit in the inequality
[TABLE]
∎
Next lemma focuses on a useful property of the metric Laplacian which relies on the estimate that we have just proved.
Lemma 4.5**.**
If and is a convex function satisfying (45c), then
[TABLE]
Proof.
It is not restrictive to assume that is -Lipschitz. As we observed in the proof of Lemma 4.4, we also note that belongs to , since when (45c) yields ; therefore the integral in (56) is well defined. Recall that
[TABLE]
and that . Hence taking in our case and we get
[TABLE]
and similarly
[TABLE]
By definition of the Cheeger functional and Lemma 4.4 we obtain (56). ∎
With the previously developed tools we can conclude the proof of Theorem 4.1.
Proof.
Let us set and . Assume first that is with Lipschitz gradient , so that has at most quadratic growth. Recalling that are differentiable as -valued maps, we get
[TABLE]
thanks to (56). We thus obtain
[TABLE]
In the general case, we apply (57) to the functional associated to the Yosida approximation of ,
[TABLE]
It is well known [12] that is convex of class with Lipschitz gradient ; moreover, if (45c) holds, then also is nonnegative and it is immediate to check from (58) that . Since , (57) then yields
[TABLE]
We can eventually pass to the limit as and applying Beppo Levi monotone convergence theorem, since as for every . ∎
A few particular cases follow as corollaries of the main result. The first one states the contraction in the Hellinger metric for measures which are absolutely continuous w.r.t. : with a slight abuse of notation, for every , , we will set
[TABLE]
Corollary 4.6**.**
For every nonnegative we have
[TABLE]
Proof.
It is sufficient to prove (59) for every couple of nonnegative functions and then argue by approximation using (40) for . We can then apply Theorem 4.1 with the function given by
[TABLE]
∎
More generally, the same contraction result holds true for any Csiszàr divergence; recalling the discussion of Section 2.1 and keeping the same notation of (18a), (18b), (18c) and Definition 2.1, we first set
[TABLE]
Corollary 4.7**.**
Let be a Csiszàr divergence as in Definition 2.1. Then, for every nonnegative ,
[TABLE]
Proof.
Recalling (21), it is sufficient to apply Theorem 4.1 to the integral functional associated to the function of (18b), extended to if or .
∎
5. Regularizing properties of the Heat flow in metric
measure spaces
In the previous section we have shown contraction results involving convex functionals and metric heat flows in metric measure spaces, thus covering the case of nonlinear flows in Finsler-like geometries.
In the linear case, the Hellinger contraction (59) can also be proved by a different approach, based on the dual dynamic formulation of the Hellinger distance that we have discussed in 2.8. We first explain this technique in the simple case of a submarkovian operator on the set of bounded measurable functions and we will then show how to extend this approach to prove new regularization results for the Heat semigroup in metric measure spaces.
5.1. Hellinger contraction for submarkovian operators
Let be a measurable space and let be a linear submarkovian operator [18, Chap. IX, Sect. 1]: this means that for every bounded measurable maps
[TABLE]
where convergence in (60b) has to be intented pointwise everywhere. Notice that for every
[TABLE]
so that choosing we get the Jensen’s inequality
[TABLE]
We can define the adjoint operator acting on by the formula
[TABLE]
The next result could also be derived by a more refined Jensen inequality for submarkovian operator. Here we want to highlight the role of the dual dynamic point of view.
Proposition 5.1**.**
Let be a measure space and let be a linear submarkovian operator in . Then, for any
[TABLE]
Proof.
Let us consider a solution of
[TABLE]
We apply the map to this solution; since the linear map is continuous with respect to the supremum norm in , . Moreover, from (61) applied to it follows that is also a subsolution to (63):
[TABLE]
since is positivity preserving. Then, recalling the formulation (32) of the Hellinger distance, we have
[TABLE]
Taking the supremum of the left hand side with respect to all the subsolutions of (63) and applying (32) once more, we eventually get (62).∎
Remark 1**.**
The same argument combined with the -Jensen inequality for yields
[TABLE]
for every . The proof can also be extended to submarkovian operators in with respect to a given reference measure , obtaining in this case an Hellinger estimate for measures absolutely continuous w.r.t. .
5.2. Regularization - for
Let us now focus on the regularization estimates for the particular class of Markovian operators provided by the heat semigroup in a metric measure space satisfying the condition. Since maps into , we can use (12) to define the adjoint heat semigroup on arbitrary positive and finite measure of (see [5, Section 3.2] for more details).
Theorem 5.2**.**
Let be a metric measure space and . Then, for every
[TABLE]
where has been defined in (14).
Proof.
Let us set and to shorten the notation. We will consider the case ; the case follows directly from (44) by using the dual characterization of the Kantorovich distance , or by approximating by measures with bounded support (thus in for every ) and then passing to the limit in (64) as .
The dual dynamic formulations (32) and (34) (recall that a space is a length space) we know
[TABLE]
and
[TABLE]
A simple rescaling argument, replacing by , shows that for
[TABLE]
Now, take such that . We apply the order preserving semigroup to and we get
[TABLE]
The Lipschitz regularization property stated in Theorem 3.6 ensures that and that it satisfies the refined Bakry-Emery condition (43), where we neglect the last negative term:
[TABLE]
Since , the conjugate is in and hence . Taking the power in (67) and using Jensen’s inequality we obtain
[TABLE]
The combination of this inequality and (66) yields
[TABLE]
which shows that is a subsolution of the Hamilton-Jacobi equation as in (65) with the time-and-curvature dependent weight
[TABLE]
All these facts lead to
[TABLE]
Thus, taking the supremum over all the subsolutions to we conclude
[TABLE]
where as in (68), which yields (64). ∎
As a byproduct, when , we obtain an precise decay rate for the asymptotic behaviour of .
Corollary 5.3**.**
Let be a metric measure space with and let , . For every we have
[TABLE]
In the case and it is interesting to compare (69) with the well known exponential decay rates of the logarithmic entropy and of the Wasserstein distance
[TABLE]
which follow by the -geodesic convexity of the functional in spaces. In particular, recalling (29), the first estimate of (70) provides
[TABLE]
which exhibits the same exponential behaviour of (69); however, (69) only requires .
5.3. Regularization -
With a similar argument we prove that the Hellinger distance at time can be estimated from above by the weighted Hellinger-Kantorovich distance , in which the parameter acts on the transport part of the distance with a time-dependent factor and does not affect the reaction part. Note that this embodies a natural combination of the Hellinger-Kantorovich estimate above and the Hellinger contraction that we proved in Proposition 5.1.
Theorem 5.4**.**
Let be a metric measure space. For every
[TABLE]
Proof.
As in the previous proof, we set and and we recall that
[TABLE]
and that (39)
[TABLE]
We consider a solution of and we apply the linear operator , , obtaining
[TABLE]
Theorem 3.6 ensures that is Lipschitz and satisfies
[TABLE]
so that
[TABLE]
this inequality corresponds to the subsolutions of Hamilton-Jacobi equation in (72) weighted with . Therefore
[TABLE]
and taking the supremum with respect to the subsolutions to we get (71).∎
It is worth noticing that (71) yields the pure Hellinger contraction estimate (59) thanks to (37). Similarly, choosing and applying (38) one recovers (64) in the case .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] L. Ambrosio, M. Erbar, and G. Savaré. Optimal transport, Cheeger energies and contractivity of dynamic transport distances in extended spaces. Nonlinear Anal. , 137:77–134, 2016.
- 2[2] L. Ambrosio, N. Gigli, and G. Savaré. Gradient flows in metric spaces and in the space of probability measures . Lectures in Mathematics ETH Zürich. Birkhäuser Verlag, Basel, second edition, 2008.
- 3[3] L. Ambrosio, N. Gigli, and G. Savaré. Calculus and heat flow in metric measure spaces and applications to spaces with Ricci bounds from below. Invent. Math. , 195(2):289–391, 2014.
- 4[4] L. Ambrosio, N. Gigli, and G. Savaré. Metric measure spaces with Riemannian Ricci curvature bounded from below. Duke Math. J. , 163(7):1405–1490, 2014.
- 5[5] L. Ambrosio, N. Gigli, and G. Savaré. Bakry-Émery curvature-dimension condition and Riemannian Ricci curvature bounds. Ann. Probab. , 43(1):339–404, 2015.
- 6[6] L. Ambrosio, A. Mondino, and G. Savaré. Nonlinear diffusion equations and curvature conditions in metric measure spaces. Ar Xiv e-prints , 1509.07273, Sept. 2015.
- 7[7] L. Ambrosio, G. Savaré, and L. Zambotti. Existence and stability for Fokker-Planck equations with log-concave reference measure. Probab. Theory Relat. Fields , 145(3-4):517–564, 2009.
- 8[8] D. Bakry and M. Émery. Diffusions hypercontractives. Séminaire de probabilités de Strasbourg , 19:177–206, 1985.
