Optimal transport, gradient estimates, and pathwise Brownian coupling on spaces with variable Ricci bounds
Mathias Braun, Karen Habermann, Karl-Theodor Sturm

TL;DR
This paper establishes the equivalence of synthetic Ricci curvature bounds, gradient estimates, and pathwise Brownian coupling on metric measure spaces with variable Ricci bounds, linking curvature conditions to transport costs and stochastic couplings.
Contribution
It proves the equivalence between curvature-dimension conditions, gradient estimates, and a new characterization via coupled Brownian motions with variable Ricci bounds.
Findings
Equivalence of Ricci curvature bounds and gradient estimates in variable curvature spaces.
Characterization of curvature bounds through nonincreasing perturbed p-transport costs.
Existence of coupled Brownian motions satisfying exponential contraction properties.
Abstract
Given a metric measure space and a lower semicontinuous, lower bounded function , we prove the equivalence of the synthetic approaches to Ricci curvature at being bounded from below by in terms of the Bakry-\'Emery estimate in an appropriate weak formulation, and the curvature-dimension condition in the sense Lott-Sturm-Villani with variable . Moreover, for all , these properties hold if and only if the perturbed -transport cost \begin{equation*} W_p^{\underline{k}}(\mu_1,\mu_2,t):=\inf_{(\mathsf{b}^1,\mathsf{b}^2)} \mathbb{E}\Big[\mathrm{e}^{\int_0^{2t} p \underline{k}\left(\mathsf{b}^1_{r}, \mathsf{b}^2_{r}\right)/2\,\mathrm{d} r} \mathsf{d}^p\!\left(\mathsf{b}^1_{2t},\mathsf{b}^2_{2t}…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
33footnotetext: Key words and phrases. Ricci curvature, Bakry–Émery estimate, gradient estimate, optimal transport, coupling.33footnotetext: University of Bonn, Institute for Applied Mathematics, Endenicher Allee 60, 53115 Bonn, Germany.
Optimal transport, gradient estimates, and pathwise Brownian coupling on spaces with variable Ricci bounds
Mathias [email protected]. Funded by the European Union through the ERC-AdG “RicciBounds”.
Karen [email protected]. Funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy - GZ 2047/1, Projekt-ID 390685813.
Karl-Theodor [email protected]. Funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy - GZ 2047/1, Projekt-ID 390685813 as well as through the Collaborative Research Center 1060, and funded by the European Union through the ERC-AdG “RicciBounds”.
Abstract
Given a metric measure space and a lower semicontinuous, lower bounded function , we prove the equivalence of the synthetic approaches to Ricci curvature at being bounded from below by in terms of
- •
the Bakry–Émery estimate in an appropriate weak formulation, and
- •
the curvature-dimension condition in the sense Lott–Sturm–Villani with variable .
Moreover, for all , these properties hold if and only if the perturbed -transport cost
[TABLE]
is nonincreasing in . The infimum here is taken over pairs of coupled Brownian motions and on with given initial distributions and , respectively, and denotes the “average” of along geodesics connecting and .
Furthermore, for any pair of initial distributions and on , we prove the existence of a pair of coupled Brownian motions and such that a.s. for every with , we have
[TABLE]
Contents
-
1.1 Lagrangian formulation of synthetic variable Ricci bounds
-
3 Gradient estimates, Bochner’s inequality, and their self-improvements
-
4 From -gradient estimates to CD and differential -transport estimates
-
5.2 From weak differential -transport inequalities to -transport estimates
-
5.4 Gradient estimates out of pathwise and transport estimates
-
6.2 Extension to arbitrary initial distributions and time interval
1 Introduction
Throughout this paper, the triple is a metric measure space, that is, a complete and separable metric space equipped with a locally finite measure defined on the Borel -field , and is a lower semicontinuous function which is bounded from below. We always assume that is an space for some .
Denote by the space of Borel probability measures on . For , is the set of with for some . As usual, denotes the -Kantorovich–Wasserstein distance defined through
[TABLE]
where the infimum is taken over all with marginals and . If it exists, the limit is called metric speed of the curve at , and we write if for every . Moreover, denotes the space of geodesics on , i.e. the set of with for all . Similarly, we define as the space of -geodesics in the space of probability measures. We say that represents the -geodesic if for all , where is the evaluation map defined by . By [Lis07], every -geodesic can be represented by some .
We present various synthetic approaches to the definition of Ricci curvature at bounded from below by and prove their equivalence. These characterizations are suitable extensions of the curvature-dimension condition, the evolution variational inequality, Bochner’s inequality, gradient estimates and transport estimates to nonconstant curvature bounds. To this list, we add a description in terms of pathwise coupling of Brownian motions. In total, our main result is the following.
Theorem 1.1**.**
Let be an space for some , and let be a lower semicontinuous, lower bounded function. For all exponents and , the following properties are equivalent:
- (i)
the curvature-dimension condition , 2. (ii)
the evolution variational inequality , 3. (iii)
the -Bochner inequality , 4. (iv)
the -gradient estimate , 5. (v)
the -transport estimate , and 6. (vi)
the pathwise coupling property .
Moreover, any of these properties yields (iii), (iv) and (v) for all exponents .
Let us now introduce each of these extensions and give an overview of the organization of our reasoning. Throughout, we assume the reader to be familiar with the theory of spaces and basic properties of these. An account on this will be collected in Section 2 which can be read independently of the rest of this paper.
1.1 Lagrangian formulation of synthetic variable Ricci bounds
Here and in the sequel, denotes the Green’s function of the unit interval . Define the Boltzmann entropy as
[TABLE]
We put .
Definition 1.2 [Stu15, Definition 3.2].
A metric measure space is said to satisfy the curvature-dimension condition with variable curvature bound , briefly , if for every there exists a measure representing some -geodesic connecting and such that, for all ,
[TABLE]
Definition 1.3 [Stu15, Definition 3.3].
A metric measure space is said to satisfy the evolution variational inequality with variable curvature bound , briefly , if for every there exists a locally absolutely continuous curve in with as , and for every and there exists a measure representing some -geodesic connecting and such that
[TABLE]
From [Stu15, Theorem 3.4], it is already known that is equivalent to , which establishes the equivalence of (i) and (ii) in Theorem 1.1.
1.2 Eulerian formulation of synthetic variable Ricci bounds
Let us now switch to the Eulerian picture which, to shorten the presentation, is directly presented for arbitrary exponents. Define the Cheeger energy as
[TABLE]
where denotes the local Lipschitz slope at . We put \mathrm{Dom}(\mathscr{E}):=\smash{\big{\{}f\in L^{2}(X,\mathfrak{m}):\mathscr{E}(f)<\infty\big{\}}}.
Definition 1.4**.**
Given , we say that satisfies the -Bochner inequality or -Bakry–Émery estimate with variable curvature bound , briefly , if
[TABLE]
holds for all with as well as and for every nonnegative with .
The equivalence of (i) and (iii) for in our major Theorem 1.1 above states that the variable Eulerian and Lagrangian approaches to synthetic lower Ricci bounds coincide, i.e. is equivalent to . If is constant, this has been proved by Ambrosio, Gigli and Savaré in their groundbreaking work [AGS15]. In the nonconstant case, this remained open in previous contributions [Ket15, Ket17, Stu15].
The implication from to follows from Theorem 3.4 and Theorem 4.5. The proof of the converse is a consequence of Proposition 4.6, Theorem 5.6, Theorem 5.19 and eventually Theorem 3.4. This requires a detailed heat flow analysis, both at the level of functions and measures, and in particular an extension of Kuwada’s duality [Kuw10, Theorem 2.2] between -gradient estimates and -transport estimates for dual and . This is quite demanding – indeed, until now not even a formulation of an appropriate -transport estimate with nonconstant curvature bound existed.
The “self-improvement property” of the -Bochner inequality will be another key result. Indeed, the condition is independent of , see Theorem 3.5, which provides the equivalence of (i) and (iii) in Theorem 1.1 for general .
1.3 Improved gradient estimates
Following [Stu15], let be the Schrödinger semigroup on associated to the generator for . It extends to a strongly continuous semigroup on for each . In terms of the Brownian motion on starting in , it can be expressed through the Feynman–Kac formula
[TABLE]
Definition 1.5**.**
We say that a -gradient estimate with variable curvature bound , briefly , holds whenever
[TABLE]
is satisfied for every and every .
Adapting the well-known arguments for constant Ricci curvature bounds from [BÉ85, Sav14], we establish, as stated in Theorem 3.4, that holds if and only if is satisfied. This yields the equivalence of (iii) and (iv) in Theorem 1.1 for general .
1.4 Variable transport estimates
In order to formulate a dual -transport estimate for , we consider evolutions on the product space . Denoting by the set of with and , we introduce the function defined by
[TABLE]
Its basic properties are summarized in Section 2. As we will see in Remark 5.12, Theorem 6.1 and Theorem 5.17, it turns out that can indeed equivalently be replaced in all relevant quantities by the larger function defined by
[TABLE]
Given , we define the perturbed -transport cost at time by
[TABLE]
where the infimum is taken over all pairs of coupled Brownian motions \smash{\big{(}\mathbb{P},\mathsf{b}^{1}\big{)}} and \smash{\big{(}\mathbb{P},\mathsf{b}^{2}\big{)}} on , restricted to and modeled on a common probability space, with initial distributions and , respectively. Note that and that for general , if is constant, say , the perturbed -transport cost can be expressed in terms of the usual -transport cost via
[TABLE]
Definition 1.6**.**
Given any , we say that a -transport estimate with variable curvature bound , briefly , holds if the map is nonincreasing on for every pair .
Having at our disposal appropriate replacements for the expressions \smash{\mathrm{e}^{-qKt}\,\mathsf{P}_{t}\big{(}\Gamma(f)^{q/2}\big{)}} and in terms of Feynman–Kac formulas with potentials for the Brownian motion on and for pairs of coupled Brownian motions on , respectively, we are in a position to formulate and prove a generalization of the fundamental Kuwada duality in the case of nonconstant . This addresses the equivalence of (iv) and (v) in Theorem 1.1.
Theorem 1.7**.**
For every with , the following are equivalent:
- (iv)
the -gradient estimate , and 2. (v)
the -transport estimate .
This result is a consequence of Theorem 5.16 and Theorem 5.19. For both results, it is crucial to use a localization argument in regions where or are “approximately constant” and then use tail estimates for Brownian paths to control the remainder terms.
Suitable extensions to the case and will be discussed, and eventually shown to be equivalent, in Theorem 5.10, Theorem 5.17 and Theorem 6.1. Therefore, making sense of an appropriate condition for is the content of the subsequent Section 1.5.
Remark 1.8**.**
It is often convenient to use the characterization of , which is zeroth-order in nature, through a first-order condition via the differential -transport inequality
[TABLE]
very much in the spirit of the connection between and . The equivalence of and the foregoing estimate, which for constant is essentially Gronwall’s lemma and a standard coupling technique, is treated in Corollary 5.7.
A posteriori, for every , any of the conditions (i) to (vi) from Theorem 1.1 will indeed give the much stronger estimate
[TABLE]
where have finite -distance to each other, and is an arbitrary measure representing a -geodesic from to , see Corollary 5.11. ∎
1.5 Pathwise coupling of Brownian motions
Finally, we reinforce the -transport estimate by passing to the limit and by replacing the mean value estimates by a pathwise one.
Definition 1.9**.**
We say that the pathwise coupling property with variable curvature bound , briefly , holds if for every pair there exists a pair \smash{\big{(}\mathbb{P},\mathsf{b}^{1}\big{)}} and \smash{\big{(}\mathbb{P},\mathsf{b}^{2}\big{)}} of coupled Brownian motions on with initial distributions and , respectively, such that -a.s., we have
[TABLE]
It is proved in [ACT08, Theorem 4.1] that complete Riemannian manifolds with Ricci curvature bounded from below by satisfy with constant . The work [Stu15, Theorem 2.9] extended this to general spaces. A first result into the nonconstant direction is due to [Vey11, Theorem 6]. Again on Riemannian manifolds with a uniform lower bound on the Ricci curvature, it deduces the existence of a pair \smash{\big{(}\mathsf{b}^{1},\mathsf{b}^{2}\big{)}} of coupled Brownian motions starting in obeying for every , on the event that \smash{\big{(}\mathsf{b}_{r}^{1},\mathsf{b}_{r}^{2}\big{)}} does not belong to the cut-locus of for all , the estimate
[TABLE]
where \smash{\kappa(x,y):=-\frac{\mathrm{d}^{+}}{\mathrm{d}t}\big{|}_{t=0}\log W_{1}(\mathsf{H}_{t}\delta_{x},\mathsf{H}_{t}\delta_{y})} denotes the coarse curvature at , . For close to each other, say with , , we have
[TABLE]
see [Vey11, Theorem 19 and Remark 20]. The construction of this process deeply relies on smooth calculus tools, which are unavailable in our setting and thus cannot be adopted.
Our main theorem extends these results in terms of and circumvents regularity issues involving the variable curvature bound. The existence of a process satisfying the condition is even equivalent to . Indeed, given for every large enough , we deduce by means of Theorem 6.1, the content of which is the implication from (v) to (vi) in Theorem 1.1. Note that according to the previous Theorem 1.7 and nestedness of -gradient estimates, see Lemma 3.3, the -gradient estimate implies for all and thus . The converse of this, i.e. the implication from to , is addressed in Theorem 5.17.
Acknowledgments
The authors warmly thank Matthias Erbar for a number of fruitful and enlightening discussions.
2 Preliminaries
Notations
We write and for the spaces of continuous and Lipschitz functions , respectively. We set for . The space of bounded continuous functions on is denoted by , and the space of functions in with bounded support is called , and similarly for and .
The Riemannian curvature-dimension condition
We say that the metric measure space is infinitesimally Hilbertian if the Cheeger energy is a quadratic form (in other words, if it satisfies the parallelogram identity). Furthermore, we say that satisfies the Riemannian curvature-dimension condition if it is infinitesimally Hilbertian and satisfies the curvature-dimension condition according to Definition 1.2. As said, we always assume that is an space for some constant . The value of does not enter any of our results. Without restriction on . Indeed, one should think of as being much larger than everywhere on .
The assumption carries numerous important consequences for . Further details on the subsequent results can be found in [AGS14a, AGS14b, RS14, Sav14].
- a.
Volume growth. For each there exists a nonnegative constant such that for every . 2. b.
Nondegeneracy of entropy. is well-defined and does not attain the value . 3. c.
Uniqueness of -geodesics. For each pair of -absolutely continuous measures , there exists a unique -geodesic connecting them. 4. d.
Dirichlet form. By polarization, defines a quasi-regular, strongly local, conservative Dirichlet form, unambiguously denoted by , on with dense domain . The latter is a Hilbert space w.r.t. \smash{\big{[}\|f\|_{L^{2}(X,\mathfrak{m})}^{2}+\mathscr{E}(f)\big{]}^{1/2}}. The generator of , i.e. the self-adjoint operator on defined by putting and if and only if
[TABLE]
is called Laplacian. 5. e.
Heat flow. The Dirichlet form defines the heat semigroup as its gradient flow in , or alternatively via spectral calculus as , . This semigroup is -symmetric and extends to a strongly continuous contraction semigroup on for any . It can be chosen to be strong Feller, more precisely, maps to for with if , while if , then
[TABLE]
The semigroup is in duality with the semigroup defined as the gradient flow of in and extended to by continuity, i.e.
[TABLE]
In particular, for every . 6. f.
Uniqueness of EVI curves. Every curve in satisfying the obstructions from Definition 1.3 with arbitrary choice of necessarily coincides with the heat flow starting at . 7. g.
Brownian motion. For each , there exists a conservative Markov process on , or for short, unique in law, with continuous sample paths and transition semigroup given by
[TABLE]
and with . This process is called the Brownian motion on with initial distribution . If we want to stress the dependence on the initial distribution, we write instead of , where we abbreviate by for . 8. h.
Carré du champ. The set is a core for . A quadratic functional can be defined by requiring
[TABLE]
Indeed, coincides -a.e. with the minimal weak upper gradient . 9. i.
Test functions. The set
[TABLE]
is a core for and an algebra w.r.t. pointwise multiplication. 10. j.
Twice differentiability. We have for all and
[TABLE] 11. k.
Sobolev-to-Lipschitz property. Every with has a Lipschitz representative with .
Hopf–Lax evolution
For later use, we summarize the main properties of the general -Hopf–Lax (or Hamilton–Jacobi) semigroup , . A detailed account on this topic in general metric spaces can be found in [AGS13, AGS14a, AGS14b].
Fix a Lipschitz function on . Its -Hopf–Lax evolution is defined by
[TABLE]
The map belongs to , where is endowed with the usual supremum metric. We also have with for all . Denoting by the dual exponent to , for every , we have
[TABLE]
for all but at most countably many , and equality holds e.g. if is geodesic.
Using the -Hopf–Lax semigroup gives a nice duality formula for the -Kantorovich–Wasserstein distance, see [Kuw10, Vil09] for details: for all , one has
[TABLE]
The function and Lipschitz approximation
Recall that is lower semicontinuous and bounded from below by , and so is by construction. If is also bounded from above, say by , then so is . By reparameterization of geodesics, we get for every . Note that can be reconstructed from , since . Lastly, the function defined in (1.2) is the pointwise monotone limit from below of bounded Lipschitz functions , and so is the function by considering on the diagonal. We intend Lipschitz continuity on w.r.t. the product metric given by \smash{\operatorname{\mathsf{d}}_{X\times X}\!\big{(}(x,y),(x^{\prime},y^{\prime})\big{)}:=\big{[}\!\operatorname{\mathsf{d}}^{2}(x,x^{\prime})+\operatorname{\mathsf{d}}^{2}(y,y^{\prime})\big{]}^{1/2}}. The former fact will be used frequently. Following [AGS08], we can, for instance, define for by
[TABLE]
Lemma 2.1**.**
The above functions , , have the following properties:
- (i)
for every , the function is Lipschitz on with , 2. (ii)
for all and each , we have , and 3. (iii)
the sequence converges pointwise from below to .
3 Gradient estimates, Bochner’s inequality, and their self-improvements
In this section, we adapt the well-known arguments of [BÉ85, Sav14] for constant curvature lower bounds to derive the equivalence of the -Bochner inequality with the -gradient estimate with exponent . Moreover, we prove that these properties are independent of .
Up to replacing by , , we may assume throughout this chapter that is bounded. In the general case, each of the subsequent results still holds for since and trivially imply and for every , respectively, and conversely, if and hold for each , the monotone convergence theorem implies and , respectively.
3.1 Equivalence of Bochner and gradient estimate
First, we review the measure-valued Laplacian and the measure-valued -operator as introduced and analyzed in [Gig18, Sav14], defined by means of
[TABLE]
for suitable . We write if the signed measure exists, which is then uniquely determined by (3.1). We denote the density of the -absolutely continuous part of by . The singular part of w.r.t. is a nonnegative measure. Both and are well-defined for . Lastly, a well-known consequence of the generic calculus rules of proved in [Sav14] is the following chain rule for .
Lemma 3.1**.**
Fix , an interval with containing the image of , and a function such that . Then and
[TABLE]
Once holds, one can argue exactly as for [Sav14, Lemma 3.2] to get
[TABLE]
for every . Taking these estimates into account, one can argue exactly as in the proof of [Sav14, Theorem 3.4] to obtain that, for every ,
[TABLE]
Using this, we deduce the whole range of -Bochner inequalities from .
Proposition 3.2**.**
The condition implies for every .
Proof.
Fix and a nonnegative with . Given , consider the smooth function defined for . Since , we obtain the -a.e. inequalities
[TABLE]
by means of (3.3). Multiplying this by and integrating, one gets
[TABLE]
Invoking Lemma 3.1, this amounts to
[TABLE]
Note that the left integrand vanishes -a.e. on the set for every . Therefore, letting in the preceding inequality gives the inequality for .
To extend this to general with and , we approximate it in by means of its heat flow regularizations as . Since and in as , is uniformly bounded in for small enough , and is uniformly bounded in for small enough , we easily get
[TABLE]
in . This yields the claim. ∎
By the Feynman–Kac representation (1.1) of and Jensen’s inequality, the following hierarchy between gradient estimates is immediate. This and the above self-improvement property of will be used in the proof of Theorem 3.4 below.
Lemma 3.3**.**
If holds for some , then is satisfied for all .
Theorem 3.4**.**
For every , the properties and are equivalent to each other.
Proof.
By density of in and an argument as in the proof of Proposition 3.2, the function under consideration may be assumed to belong to .
Suppose that is satisfied. Fix any , as above and a nonnegative with . Given any , consider the function as defined in the proof of Proposition 3.2 above. Define by
[TABLE]
This function belongs to since the functions and s\mapsto\Phi_{\varepsilon}\big{(}\Gamma(\mathsf{P}_{t-s}f)\big{)} as well as their derivatives in are bounded on , see also [AGS15, Lemma 2.1] for a similar argument. Thus
[TABLE]
which is nonnegative by . Fatou’s lemma gives
[TABLE]
which establishes for by the arbitrariness of .
Conversely, assume for . As for such , we deduce , which is a reformulation of the inequality with in place of . Letting gives the desired conclusion. If, on the other hand, we have , we cannot rely on the above regularity of . However, Lemma 3.3 ensures , which implies by the previous discussion. Therefore, holds by Proposition 3.2. ∎
3.2 Independence of the -Bochner inequality on
Theorem 3.5**.**
If the -Bakry–Émery estimate holds for some , then it holds for every .
Lemma 3.3 and Proposition 3.2 give the assertion of this theorem when starting with for . To cover the range , we adapt the arguments of [Han18] to prove that a -Bakry–Émery inequality for some implies . A crucial point in this argument is that our a priori assumption guarantees for all and every .
Arguing exactly as in the constant situation in [Han18, Lemma 3.2] (see also [Sav14, Theorem 3.4]), one can show that holds if and only if the inequalities
[TABLE]
are valid for every . Here, denotes the density of the -absolutely continuous part of w.r.t. , stands for the corresponding -singular part, and is the quasi-continuous representative of .
Proposition 3.6**.**
Let be satisfied for some . Then holds.
Proof.
As discussed above, it suffices to show the claimed implication starting from with . Due to our standing assumption , the set is dense in , thus it is enough to check the inequality for . Moreover, note that already yields by (3.4) which is independent of .
The crucial point is to show that
[TABLE]
for every . Given the observation (3.4), this will imply for each , and eventually letting and applying the monotone convergence theorem, we get the claimed condition.
Given for arbitrary , it is straightforward to follow the proof of [Han18, Theorem 3.6], which relies on generic calculus rules for and closely follows the strategy presented in [Sav14], to prove (3.5) with replaced by . Now, according to [Han18, Lemma 3.3], given any there exist and so that , where and is the -fold composition of . Since yields , iterating the foregoing reasoning allows us to finally reach the inequality (3.5). ∎
As for [Han18, Proposition 3.7], it is possible to obtain an equivalent characterization of in terms of a lower bound on the measure-valued Ricci tensor
[TABLE]
introduced in [Gig18]. As for the measure-valued Laplacian , we denote by the density of the -absolutely continuous part and by the -singular part of , respectively.
Corollary 3.7**.**
The metric measure space satisfies if and only if for every , we have
[TABLE]
3.3 Localization of Bochner’s inequality
To study a suitable local-to-global behavior of the -Bochner inequality, we present a reformulation of it where we enlarge the class of functions . Recall that our standing assumption implies for every and .
Lemma 3.8**.**
Given , the property holds if and only if for all and all nonnegative ,
[TABLE]
Proof.
Obtaining from (3.6) through integration by parts and the density of in is easy, thus we focus on the converse. Trivially, the inequality (3.6) holds for all with . Recall now, e.g. from [Gig18, Sav14], that any function can be approximated in by means of a mollified heat flow
[TABLE]
as . Since and for every , this allows us to extend the class of admissible . ∎
Definition 3.9**.**
We say that the local -Bakry–Émery condition with variable curvature bound , briefly , with holds if for every there exists such that
[TABLE]
for all and every nonnegative with .
It is elementary to pass from the global condition to . The converse is more involved.
Theorem 3.10**.**
For , the property implies the condition.
Proof.
Let be a countable dense subset of and consider the collection of metric balls with chosen in such a way that the local -Bakry–Émery inequality is satisfied around . For , define functions on by
[TABLE]
Then with support in and on . Thus, for arbitrary nonnegative , the assumption allows us to deduce
[TABLE]
We conclude the assertion using Lemma 3.8 above. ∎
4 From -gradient estimates to CD and differential -transport estimates
Our goal now is to derive the evolution variational inequality with variable curvature bound from the -gradient estimate . In [Stu15] there is a first part of the proof for this implication. With some extra arguments, we complete it.
The key point is a localization argument. Indeed, it suffices to prove the “locally”, that is, for measures in a given small neighborhood. The heat flow will neither stay within this neighborhood nor in any other bounded region. We thus modify it by truncating its tails. Due to the Gaussian behavior of the heat flow, the difference is of arbitrary polynomial order for small times. This will imply the inequality locally. However, the latter is already known to give the inequality globally, and this in turn yields the global version of the .
4.1 Tail estimates for the heat flow
Given any ball with and , and , we put
[TABLE]
Lemma 4.1**.**
Assume that is -absolutely continuous with density and . Then for every there exists such that for all and all bounded Borel functions , we have
[TABLE]
Proof.
To see the first assertion for , the case being trivial, observe that
[TABLE]
where the last inequality comes from the integrated Gaussian heat kernel estimate of [Stu95, Lemma 1.7]. Therefore, by the volume growth property in spaces and finally assuming that is small enough, we obtain
[TABLE]
The second assertion follows from the first one, since
[TABLE]
In Chapter 5, we need the following result, which is a consequence of Lemma 4.1.
Lemma 4.2**.**
For each , and there exists such that
[TABLE]
where \smash{\big{(}\mathbb{P}_{x},\mathsf{b}^{x}\big{)}} denotes Brownian motion on starting in .
Proof.
Let be the uniform distribution of . Choose a pair \big{(}\mathbb{P},\mathsf{b}^{x}) and of coupled Brownian motions with initial distributions and , respectively, such that \smash{\operatorname{\mathsf{d}}\mathopen{}\mathclose{{}\left(\mathsf{b}_{t}^{x},\mathsf{b}_{t}}\right)}\leq\smash{\mathrm{e}^{-Kt}\,\operatorname{\mathsf{d}}(x,\mathsf{b}_{0})} -a.s. for every , see [Stu15, Theorem 2.9] for the construction. Thus in particular, -a.s. we have
[TABLE]
for every and a suitable . According to the previous Lemma 4.1,
[TABLE]
for all and some depending only on and . Combining both estimates yields that
[TABLE]
uniformly in for small enough times. ∎
4.2 From -gradient estimates to
In this section, we assume that is Lipschitz and bounded. The general case follows using the approximation scheme via the sequence with for derived from Lemma 2.1. Indeed, trivially implies for every , which will imply both and . Since -geodesics between -absolutely continuous measures and -curves are unique, we may then pass to the limit by monotone convergence.
We present a modification of [Stu15, Lemma 3.5] which is proved in exactly the same way as the previous version subject to the choice of parameterization from [AGS15, Theorem 4.16] involving the additional parameter . Throughout this section, we denote by the -Hopf–Lax semigroup.
Lemma 4.3**.**
Assume the -gradient estimate with variable curvature bound , and let be an arbitrary constant. Let with be a regular curve in the sense of [AGS15, Definition 4.10], and for , define if and as well as if and . Then
[TABLE]
is satisfied for every and all . The term \big{|}\dot{\rho}_{\vartheta_{t}(s)}\big{|} has to be understood as the metric speed of the original curve evaluated at .
The same estimate is satisfied for every -geodesic with -absolutely continuous measures, in which case , independently of and .
Lemma 4.4**.**
Assume the -gradient estimate with variable curvature bound . Suppose that in for some , and . Then for all with support in and bounded densities w.r.t. , we have
[TABLE]
Proof.
The proof follows the reasoning for [Stu15, Lemma 3.6] and [AGS15, Theorem 4.16], but with a subtle modification. Fix . While the curve connects and , the potentials , , are Hopf–Lax interpolations of optimal Kantorovich potentials for the transport from to . Thus, we have to match these two different situations and then use the nice behavior of the remainder terms.
We know by [AGS14a, Proposition 3.9] that for any -optimal coupling of and , and any Kantorovich potential relative to , we have for -a.e. . Taking (2.3) and the bounded support of into account,
[TABLE]
The latter supremum is attained, see [AGS14a, Proposition 2.12], at some . Possibly adding constants and invoking a cutoff argument, we may assume that everywhere on for some independent of . Thus, is bounded on and , uniformly in .
Let be the -geodesic joining and . Note that the measures , , are supported in . The condition furthermore ensures that the are bounded uniformly in . Applying Lemma 4.3 with we get
[TABLE]
where we have put . Note that the as of the last term is nonnegative since -a.e. on for every and
[TABLE]
w.r.t. convergence in . Indeed, as for every and therefore pointwise -a.e. As all considered functions are nonnegative and for all , we have in as . We conclude by strong continuity of the heat and the Schrödinger semigroup with potential in .
Lower semicontinuity of yields , and clearly as . Lastly, observe that \smash{\big{(}W_{2}^{2}(\mathsf{H}_{t}\rho_{1},\rho_{0})-W_{2}^{2}(\mathsf{H}_{t}^{*}\rho_{1},\rho_{0})\big{)}}/2t\to 0 according to Lemma 4.1 applied with . Thus, we finally deduce
[TABLE]
Theorem 4.5**.**
The -gradient estimate implies .
Proof.
Given , Proposition 4.4 translates into a “local” property at time [math]: for every , choosing and such that in , we obtain that for all with support in and bounded densities w.r.t. , for representing the -geodesic from to , we have
[TABLE]
With the same argument used in the proof of [Stu15, Theorem 3.4] for the equivalence of and , we conclude that this local implies a “local” condition in the following sense: for all there exists such that for all with support in and bounded densities w.r.t. , if represents the -geodesic from to , for every , we have
[TABLE]
Using the local-to-global property from [Stu15, Theorem 3.7] and taking the limit , noticing again that the choice of -geodesics does not depend on , allows us to pass from this local property to and finally to . ∎
4.3 From to a differential 2-transport estimate
It has already been observed in [Ket15] that yields contraction estimates for the -Wasserstein distance along two heat flows starting at regular measures. For irregular initial data, we now aim in deducing a weak version of it, see also Remark 1.8.
Proposition 4.6**.**
The implies the following differential -transport estimates:
- (i)
for every , one has
[TABLE]
where represents the -geodesic from to , and 2. (ii)
for all ,
[TABLE]
Proof.
Concerning (i), up to truncating and using monotone convergence afterwards, we may assume that is bounded. Then the claim follows by adding up the , integrated from to , , for the flow with observation point and for the flow with observation point . The entropy terms cancel out, and we obtain the desired estimate by dividing by and letting . Some care, however, is requested to deal with the double -dependence of the nonsmooth function t\mapsto\smash{W_{2}^{2}\big{(}\mathsf{H}_{t}\rho_{0},\mathsf{H}_{t}\rho_{1}\big{)}}. This has been addressed in [Ket15, Theorem 6.1].
Next, we show (ii). Denote by a sequence converging pointwise from below in a monotone way to , see Lemma 2.1, and put for . Given and , select small enough so that, for every ,
[TABLE]
The local absolute continuity of the curves and on w.r.t. and property (i) with in place of , since on , yield
[TABLE]
where represents the -geodesic from to . As , by monotone convergence, the above inequality still holds with in place of . Thus, the definition of and the inequality on for every give, setting ,
[TABLE]
Since and w.r.t. as and since is bounded uniformly in for small , stability of optimal couplings, see [AGS08, Proposition 7.1.3], and uniqueness of the -optimal coupling imply that weakly as . Thus, the map is continuous at [math] by [Vil09, Lemma 4.3]. The claim follows by taking successively , and in the above inequality. ∎
A posteriori, knowing from Theorem 1.1 that implies , we will be able to improve the bound (ii) from Proposition 4.6 even for exponents different from , see Remark 5.12 below.
5 Duality of -transport estimates and -gradient estimates
Throughout the rest of this article, given , we use the short-hand notation . Moreover, at several instances we consider a function which, unless stated otherwise, is assumed lower semicontinuous and lower bounded. However, it should practically rather be thought of as a bounded Lipschitz function “approximating” from below without being of the particular form (1.2). This often allows us to assume that , while is not continuous in general, even if is Lipschitz.
5.1 Perturbed costs and coupled Brownian motions
Given any and , let us define the perturbed -transport cost with potential at by
[TABLE]
where the infimum is taken over all pairs of coupled Brownian motions \smash{\big{(}\mathbb{P},\mathsf{b}^{1}\big{)}} and \big{(}\mathbb{P},\mathsf{b}^{2}\big{)} on , restricted to and modeled on a common probability space, with initial distributions and , respectively. In more analytic words,
[TABLE]
the infimum being taken over all whose marginals are the laws of Brownian motions on , restricted to , with initial distribution and , respectively. If , this is the usual perturbed -transport cost from Section 1.4.
A natural, albeit nontrivial identity relates the perturbed -transport cost in the case of constant with the usual -transport cost.
Lemma 5.1**.**
If is constantly equal to then, for ,
[TABLE]
Proof.
Since \smash{W_{p}(\mathsf{H}_{t}\mu_{1},\mathsf{H}_{t}\mu_{2})^{1/p}=\inf_{(\mathsf{x},\mathsf{y})}\mathbb{E}\big{[}\!\operatorname{\mathsf{d}}^{p}(\mathsf{x},\mathsf{y})\big{]}^{1/p}}, the infimum ranging over pairs of random variables and defined on a common probability space , and as for every Brownian motion with initial distribution , we get
[TABLE]
For the converse inequality, let be a -optimal coupling of and . Consider Brownian motions \smash{\big{(}\mathbb{P}_{1},\mathsf{b}^{1}\big{)}} and \smash{\big{(}\mathbb{P}_{2},\mathsf{b}^{2}\big{)}}, restricted to , starting at and , defined on probability spaces and , respectively. Define the “bridge measures” for by disintegrating w.r.t. or, in other words, by conditioning on the event . Similarly, let for be the disintegration of w.r.t. . Consider the “glued measure” defined by
[TABLE]
on . Then \smash{\big{(}\widetilde{\mathbb{P}},\mathsf{b}^{1}\big{)}} and \smash{\big{(}\widetilde{\mathbb{P}},\mathsf{b}^{2}\big{)}} is a pair of coupled Brownian motions with joint distribution at time . The desired inequality then follows directly, since
[TABLE]
Lemma 5.2**.**
For every , and as above, the infima in (5.1) and in (5.2) are attained.
Moreover, for every sequence of lower semicontinuous functions converging pointwise to from below in an increasing way, we have
[TABLE]
Proof.
The lower semicontinuity of implies the one of
[TABLE]
w.r.t. the uniform topology on which in turn implies weak lower semicontinuity of
[TABLE]
in . This gives the existence of a minimizer for (5.2) by a standard argument since, according to [Vil09, Lemma 4.4], the family of with given marginals is tight as the sets of marginals are both singletons.
The second assertion is a standard argument via -convergence of the functionals whose infima give and , respectively, in . ∎
Let us denote by the completion of the Borel -field on w.r.t. a given , and then
[TABLE]
is the -field of all universally measurable subsets of .
Lemma 5.3**.**
For every and , there exists a universally measurable map
[TABLE]
such that for every , the marginals of are laws of Brownian motions, restricted to , starting in and , respectively, and is a minimizer in the definition (5.2) of .
Proof.
According to Lemma 5.2, for each pair there exists an admissible measure on which attains the infimum in (5.2). The class of all probability measures with this property is closed. Then a measurable selection argument, see [Bog07, Stu15], allows us to produce a family of measures still satisfying the minimality property so that is universally measurable in . ∎
An important consequence of these observations is a type of Markov property which will be crucial in the proof of Theorem 5.6. For this and also for later use, fix , a measure and a universally measurable map such that for all . Define their composition by
[TABLE]
where
[TABLE]
denotes the concatenation map “gluing” together the curves and .
Proposition 5.4**.**
For every , every and all , there exists a pair \smash{\big{(}\mathbb{P},\mathsf{b}^{1}\big{)}} and \smash{\big{(}\mathbb{P},\mathsf{b}^{2}\big{)}} of coupled Brownian motions on with initial distributions and , respectively, which minimizes (5.1) for the given time and such that
[TABLE]
Proof.
Denote the map from Lemma 5.3 with in place of by , denote a minimizer of (5.2) for time by , and define . This defines a coupling of the laws of two Brownian motions with initial distributions and , respectively, restricted to such that
[TABLE]
This proves the claim. ∎
Less formally, the previous construction can be described as follows. To estimate the perturbed -transport cost at time , we construct the required process by first choosing a pair process \smash{\big{(}\mathsf{b}^{1},\mathsf{b}^{2}\big{)}} of Brownian motions with given initial distributions and which realizes the minimum for . Then we switch to a pair of Brownian motions starting in and , respectively, which minimizes the cost at time .
5.2 From differential -transport inequalities to -transport estimates
To deduce a -transport estimate , we have to control the upper derivatives of the function or, more generally, of for .
Lemma 5.5**.**
Assume that . Then for all and , we have
[TABLE]
Proof.
Choose any exponent with dual exponent . For all , denote by \smash{\big{(}\mathbb{P},\mathsf{b}^{1}\big{)}} and \smash{\big{(}\mathbb{P},\mathsf{b}^{2}\big{)}} a pair of coupled Brownian motions starting in and such that the law of \smash{\big{(}\mathsf{b}_{2t}^{1},\mathsf{b}_{2t}^{2}\big{)}} constitutes a -optimal coupling of and . Albeit this process still depends on , we suppress this dependence in the sequel to simplify the notation. For a precise construction of such process, we refer to the proof of Lemma 5.1.
Observe that
[TABLE]
Each of the last two limits will be estimated separately. The last term will converge to the upper derivative of at [math] as by monotone convergence. Moreover, since is bounded, the former term can be estimated through
[TABLE]
Now we split the expectation into a term where \big{(}\mathsf{b}^{1},\mathsf{b}^{2}\big{)} behaves well and a remainder term. Let and choose such that
[TABLE]
and define the exceptional set for by
[TABLE]
By these definitions and Fubini’s theorem, since is bounded,
[TABLE]
According to Lemma 4.2, we have as and , therefore the latter is equal to . On the other hand, if denotes an upper bound for , using Hölder’s inequality the second term can be bounded through
[TABLE]
By the choice of the pair process \big{(}\mathsf{b}^{1},\mathsf{b}^{2}\big{)}, the first is equal to while the second one is [math], as already observed above. Since was arbitrary, we obtain the claim. ∎
Theorem 5.6**.**
Fix and assume the differential -transport estimate
[TABLE]
Then the -transport estimate is satisfied.
Proof.
We first show that for all , the function is nonincreasing on whenever with on .
To get started, we demonstrate that its -th power is upper Lipschitz continuous on . To see this, fix and , and consider the pair process \smash{\big{(}\mathsf{b}^{1},\mathsf{b}^{2})} as provided by Proposition 5.4. By the estimate (5.3) of this proposition, Lemma 5.1 and contractivity of the Wasserstein heat flow, we have
[TABLE]
for suitable nonnegative constants and . This proves upper Lipschitz continuity of the -th power of the perturbed -transport cost with potential , which in turn implies
[TABLE]
for every with . Letting , the estimate (5.5) and the observation
[TABLE]
which justifies to apply Fatou’s lemma, give
[TABLE]
Finally, the inequality (5.6) for the upper derivative inside the expectation, Lemma 5.5 and then the assumed estimate (5.4), noting that on , yield the initial claim.
The nonincreasingness of on is then immediate due to an easy approximation argument using Lemma 2.1 and Lemma 5.2. ∎
Corollary 5.7**.**
For every , implies
[TABLE]
In particular, and the differential -transport estimate (5.4) are equivalent.
Proof.
Fix . For every and , we denote by \smash{\big{(}\mathbb{P},\mathsf{b}^{1}\big{)}} and \smash{\big{(}\mathbb{P},\mathsf{b}^{2}\big{)}} a pair of coupled Brownian motions which realizes the minimum in the definition of . This process does depend on , but we leave out this dependency from the notation. Arguing as in the proof of Lemma 5.5, we get
[TABLE]
for all with on . Letting , the last upper derivative becomes nonpositive due to , and approximating from below using Lemma 2.1 gives the conclusion. ∎
Using this equivalence, Hölder’s inequality and the chain rule, the subsequent nestedness of , which is the Lagrangian analogue of Lemma 3.3, is easily shown.
Corollary 5.8**.**
If holds for some , then is satisfied for all .
5.3 Transport estimates via vertical Brownian perturbations
We prove the variable Kuwada duality from Theorem 1.7. We start by first showing the implication from to , where are dual to each other. Since the behavior of Brownian trajectories can only be controlled for small times, we show the equivalent infinitesimal first-order description of in terms of a differential -transport estimate. This is done by a localization argument.
Additionally, in the extremal case , the argument mentioned above can actually be circumvented and we are able to derive the contraction estimate
[TABLE]
for all of finite -distance to each other, for every . The measure induces an arbitrary -optimal coupling of and . This is discussed now, see Theorem 5.10 and Corollary 5.11, where, possibly replacing by for , we assume that is bounded. This is not restrictive as, given these results for every , they easily pass to the limit by monotone convergence.
Given and , we define the function by
[TABLE]
Here \smash{\big{(}\mathbb{P}_{\gamma_{s}},\mathsf{b}\big{)}} denotes Brownian motion starting in for every . We will not explicitly mention the dependence of the process on . The function can be turned into a metric on by defining
[TABLE]
It is equivalent to by boundedness of since is a length metric. Let us denote by and the transport “distances” w.r.t. and , respectively. Then is a metric on , which is equivalent to the usual -Wasserstein metric . Compared to the perturbed -transport cost which measures Brownian evolutions “horizontally” by following their trajectories with fixed starting points, the distance varies the initial points along a geodesic and may thus be seen as a “vertical” counterpart of .
Let be the -Hopf–Lax semigroup and such that . Similarly to [Kuw10, Proposition 3.7], the key point will be the following Lipschitz regularity along geodesics.
Lemma 5.9**.**
Let . Then for every and all , the map belongs to , and
[TABLE]
Proof.
Let and . Notice that
[TABLE]
The latter is bounded uniformly in and since the first integral can be controlled using the Lipschitz regularization estimate (2.1) of the heat flow while the second one exploits the fact that the map is Lipschitz from to .
It follows that can be written as
[TABLE]
The Kantorovich–Rubinstein formula (2.3) for , the -contractivity of the heat flow and the duality of and give us the following upper bound for the second in (5.7)
[TABLE]
Here we used as a shorthand for the Lipschitz constant of the map from to . These estimates conclude the proof. ∎
Theorem 5.10**.**
Assume the -gradient estimate . Then for every , and ,
[TABLE]
Proof.
Without loss of generality, we consider and for , and as the general case (or, to be more precise, the first inequality, since only is continuous in general) is covered by a standard coupling argument, see e.g. [Sav14, Theorem 4.4] or [Kuw10, Lemma 3.3]. It suffices to prove since the first claimed inequality then easily follows by definition of , and by construction .
By the duality (2.3), we have to estimate from above for every . Pick a geodesic . By the upper gradient property of and the inequality, we deduce for -a.e. that
[TABLE]
denoting by \big{(}\mathbb{P}_{\gamma_{s}},\mathsf{b}\big{)} Brownian motion on starting in . Invoking Lemma 5.9 and Young’s inequality, we infer that
[TABLE]
Taking the supremum over and then infimizing over all geodesics connecting to , we conclude the desired inequality. ∎
With this in hand, we can proceed to what we have indicated in Remark 1.8, i.e. that actually, a much stronger assertion than just a control on the upper derivative of the function at [math] is possible.
Corollary 5.11**.**
Assume that is satisfied. Let so that , let , and let represent an arbitrary -optimal coupling between and , i.e. is a -optimal coupling of and . Then
[TABLE]
Proof.
Given any optimal geodesic plan as above, using Theorem 5.10 gives
[TABLE]
where \big{(}\mathbb{P}_{\gamma_{s}},\mathsf{b}\big{)} denotes Brownian motion on starting in . In the very last step, we used the assumed boundedness of together with the dominated convergence theorem. ∎
Remark 5.12**.**
The previous corollary applied to and for at , choosing as the Dirac mass on an arbitrary geodesic , yields the estimate
[TABLE]
where, as in (1.3), the function is defined by
[TABLE]
Note that is lower semicontinuous and bounded from below.
This improves the differential -transport estimate (5.4), since on , see also Proposition 4.6. In Chapter 6, we shall construct a coupling of Brownian motions obeying pathwise bounds involving the larger function in place of . In particular, using Theorem 5.17, all equivalences from Theorem 1.1 and Theorem 1.7 are still valid when replacing the function by in all relevant quantities. ∎
The proof of the property starting from with dual is slightly more involved as a control of the error terms is only possible “locally” for small times. A crucial ingredient is the subsequent result.
Lemma 5.13**.**
Let and be bounded Borel functions on such that on a ball , and . Then for every and , there exists such that for every , every nonnegative Borel function on , and every Brownian motion on starting in , we have
[TABLE]
Proof.
The condition on and guarantees that for fixed and every ,
[TABLE]
Here, is a constant depending only on , and . Therefore,
[TABLE]
where denotes the dual exponent to . By Lemma 4.2, we know that for every and small enough . Thus, , which directly proves the claim. ∎
Remark 5.14**.**
With the very same strategy, also estimates for Feynman–Kac-type expressions in terms of pairs of Brownian motions can be derived, each component being required to start within . Moreover, the integrands and are then supposed to be functions on with on . ∎
Proposition 5.15**.**
Let such that and assume the -gradient estimate . Assume that with on , and put for . Then for every , and , there exist and such that for every , every and every , we have
[TABLE]
and thus in particular,
[TABLE]
Proof.
We adapt the proof of Theorem 5.10 by adding a localization argument. Given and , choose and such that on . Let and , and note that .
Denote by the -Hopf–Lax semigroup with dual exponent . Since is a weak upper gradient and using , which clearly implies , we directly obtain, for -a.e. ,
[TABLE]
Applying Lemma 5.13 with and in place of and , respectively, we get, for small enough ,
[TABLE]
and thus
[TABLE]
for -a.e. by Young’s inequality. Therefore, Lemma 5.9 with in place of yields
[TABLE]
Taking the supremum over , we conclude by (2.3). ∎
Theorem 5.16**.**
Given with , the -gradient estimate implies the -transport estimate .
Proof.
Fix , an arbitrary geodesic and as in Proposition 5.15. Given , choose a finite covering of by metric balls , and , such that each of the enlarged balls satisfies the assumption of the previous Proposition 5.15. Without restriction, we may assume and . Applying this proposition to pairs of intermediate points and and the reparameterized geodesics defined by , , yields
[TABLE]
Since is arbitrary, this bound holds with in place of by Lemma 2.1. Furthermore, by definition of and the arbitrariness of , we deduce the differential transport estimate (5.4) with replaced by . Since this true for every , this finally yields by Theorem 5.6 and monotone convergence. ∎
5.4 Gradient estimates out of pathwise and transport estimates
A modification of the arguments given in [Kuw10, Proposition 3.1] allows us to prove the converse direction of Theorem 1.7, i.e. that the -transport estimate implies the -gradient estimate , where . As in the previous section, a control of the error terms can only be achieved for small times. Therefore, instead of deriving directly, it is more convenient to establish a local version of the -Bochner inequality .
As in the preceding Section 5.3, the extremal version is much easier to treat: in this case, the condition “” is to be interpreted as “ holds for any ”, which translates into the requirement of as discussed in Chapter 6.
Theorem 5.17**.**
The property implies the 1-gradient estimate , that is, for every and , we have
[TABLE]
Proof.
Fix and . Recall that , where denotes Brownian motion on starting in . Pick a function with on , and set for . By , given any and , we may choose a pair \smash{\big{(}\mathbb{P}_{x,y},\mathsf{b}^{1}\big{)}} and \smash{\big{(}\mathbb{P}_{x,y},\mathsf{b}^{2}\big{)}} of coupled Brownian motions in such a way that -a.s., we have
[TABLE]
for every . With this in hand, we can estimate
[TABLE]
where \smash{V_{\varrho,t}:=\big{\{}\!\operatorname{\mathsf{d}}\!\big{(}\mathsf{b}_{t}^{1},\mathsf{b}_{t}^{2}\big{)}\geq\varrho^{1/2}\big{\}}}, \smash{W_{\varrho,t}:=\big{\{}\!\int_{0}^{t}\operatorname{\mathsf{d}}\!\big{(}\mathsf{b}_{r}^{1},\mathsf{b}_{r}^{2}\big{)}\mathop{}\!\mathrm{d}r/t\geq\varrho^{1/2}\big{\}}} and .
Let us consider this upper bound for the weak upper gradient term by term, starting with the contribution coming from . We have the inequality \smash{\int_{0}^{t}{\underline{\ell}}\big{(}\mathsf{b}_{r}^{1},\mathsf{b}_{r}^{2}\big{)}\mathop{}\!\mathrm{d}r}\geq\smash{\int_{0}^{t}\ell\big{(}\mathsf{b}_{r}^{1}\big{)}\mathop{}\!\mathrm{d}r}-\smash{\mathrm{Lip}\big{(}{\underline{\ell}}\big{)}t\varrho^{1/2}} on , which gives
[TABLE]
We point out the intermediate change from the process , which in general also depends on , to a Brownian motion \smash{\big{(}\mathbb{P}_{x},\mathsf{b}^{x}\big{)}} on starting in , chosen independently of .
Next we consider the term involving . Denoting by a suitable upper bound on , we obtain by (5.8) that
[TABLE]
Similarly, the last expression which involves can be bounded through
[TABLE]
Finally, we have to extend the class of admissible functions and pass to . Every can be approximated strongly in by a sequence of Lipschitz functions with bounded support. Since is quadratic, we have in and thus, possibly passing to a subsequence, we get, for some suitable , that
[TABLE]
Moreover, in as and thus, up to a subsequence, this convergence holds -a.e., which then proves for arbitrary . By the arbitrariness of , Lemma 2.1 and the identity for every , we deduce by the monotone convergence theorem. ∎
Proposition 5.18**.**
Let , and . Assume the transport estimate , where . Suppose that with on . Then for every , there exist and such that
[TABLE]
for every and all bounded Lipschitz functions on .
Proof.
Fix . Given , choose and such that for every . Given , and with , select a pair \smash{\big{(}\mathbb{P}_{x,y},\mathsf{b}^{1}\big{)}} and \smash{\big{(}\mathbb{P}_{x,y},\mathsf{b}^{2}\big{)}} of coupled Brownian motions starting in which attains the minimum in the definition of . The choice of this pair does depend on , and , but these dependencies are suppressed in the notation. Similarly to the proof of Theorem 5.17, for every , we have
[TABLE]
where \smash{V_{\varrho,t}:=\big{\{}\!\operatorname{\mathsf{d}}\!\big{(}\mathsf{b}_{t}^{1},\mathsf{b}_{t}^{2}\big{)}\geq\varrho^{1/2q}\big{\}}}. The contribution of vanishes as due to
[TABLE]
for a suitable , where we used the assumption that in the first inequality and the condition in the last inequality.
Next we study the influence coming from . Choosing some exponents and dual to each other, using Hölder’s inequality, Lemma 5.13 with and in place of and , respectively, and eventually assumption , we obtain for sufficiently small that
[TABLE]
Here is a Brownian motion on starting in which is chosen independently of . Once again using Lemma 5.13 as above to estimate the last expression, we finally obtain
[TABLE]
Theorem 5.19**.**
Given with , the -transport estimate implies the -gradient estimate .
Proof.
Let be as in Proposition 5.18 and put for . First, we assume that . Given , , , and the associated time from in Proposition 5.18, arguing as in the proof of Theorem 3.4, the function defined by
[TABLE]
belongs to for every and all nonnegative functions supported in . The function itself and its derivative at [math] are nonnegative by Proposition 5.18. The latter translates into
[TABLE]
Approximating from below by the sequence of functions for , or in other words, replacing by for every , where tends to from below as provided by Lemma 2.1, and letting and , we obtain precisely the local -Bakry–Émery inequality according to Definition 3.9. Since the latter implies by Theorem 3.10, the equivalence with finishes the proof in the case .
If , choosing in Proposition 5.18 and arguing as above, we obtain , which in turn implies . ∎
6 A pathwise coupling estimate
This section is dedicated to the proof of the existence of a pair \smash{\big{(}\mathbb{P},\mathsf{b}^{1}\big{)}} and \smash{\big{(}\mathbb{P},\mathsf{b}^{2}\big{)}} of coupled Brownian motions with arbitrary initial distributions, under a slightly stronger assumption than for large enough , such that -a.s.,
[TABLE]
where is defined as in (1.3). It is necessary to adapt the arguments from [Stu15, Section 2] in a nontrivial way, since this pathwise estimate requires control of the entire path of \smash{\big{(}\mathsf{b}^{1},\mathsf{b}^{2}\big{)}} on the interval and not just at the endpoints.
Theorem 6.1**.**
Suppose that, for all large enough , the map is nonincreasing on for every . Then for every there exists a pair \smash{\big{(}\mathbb{P},\mathsf{b}^{1})} and \smash{\big{(}\mathbb{P},\mathsf{b}^{2}\big{)}} of coupled Brownian motions on with initial distributions and , respectively, such that -a.s., we have
[TABLE]
In particular, the pathwise coupling property holds.
The assumption of Theorem 6.1 above is satisfied if holds by Remark 5.12, and it implies for all large enough by the discussion from Theorem 5.6 and Corollary 5.7. By nestedness of -transport estimates from Corollary 5.8, we may suppose without restriction that the assumption of Theorem 6.1 holds for every .
The proof of this theorem will be subdivided into multiple steps. Firstly, we construct a coupled process starting in , , satisfying the desired pathwise contraction estimate on the interval . Secondly, a gluing procedure will let us extend the process to . Finally, we use a coupling technique to allow for arbitrary initial distributions.
6.1 Deterministic initial distributions and time interval
Proposition 6.2**.**
For every , there exists a universally measurable map
[TABLE]
such that for every , the marginals of are laws of Brownian motions, restricted to , starting in and , respectively, and
[TABLE]
Proof.
Given and an increasing sequence tending to , denote by the measure obtained by Lemma 5.3 for the exponent , replaced by , and time in place of . As for Lemma 5.2, we see that the sequence is tight. Hence it converges weakly to some along a subsequence which we do not relabel.
Let arbitrary, and fix with on . Then by Hölder’s inequality and the nonincreasingness of for large enough , we obtain
[TABLE]
Sending and then approximating from below by means of Lemma 2.1 gives
[TABLE]
A measurable selection argument as in the proof of Lemma 5.3 establishes the claim. ∎
The next goal is to obtain a measure which obeys such pathwise bound at every initial and terminal time instance in, say, . Indeed, this is the point where the main work has to be done.
Theorem 6.3**.**
There exists a universally measurable map
[TABLE]
such that for every , we have that the marginals of are laws of Brownian motions, restricted to , starting in and , respectively, and that there exists a -negligible Borel set such that
[TABLE]
for all .
Proof.
The strategy relies on patching the laws obtained in the previous proposition together on small dyadic partitions of . Denote by the map from Proposition 6.2 and define
[TABLE]
that is, at every dyadic partition point of at scale , we attach a new random curve evolving according to the law obtained in Proposition 6.2 to the random endpoint of the previous curve. The marginals of are the laws of Brownian motions on , restricted to , starting in and , respectively. As in the proof of Lemma 5.2, we may exhibit a subsequence, not relabeled in the sequel, weakly converging to some .
The key point lies in proving that for every with , there exists a -negligible Borel set such that, for every ,
[TABLE]
By continuity of curves, the desired requirements are then satisfied by the -null set
[TABLE]
Let as above, i.e. on . Pick and as above and notice that the sequences and tend to and , respectively, as . Fix and an arbitrary . Given any , for every path one gets
[TABLE]
Observing that the dyadic partition of of step size contains the one at scale and then integrating the resulting -a.e. valid estimate, truncated at large enough , against an arbitrary nonnegative function , we obtain
[TABLE]
where . Since is bounded, for all and every , this yields
[TABLE]
for all large enough . Letting , and then in the previous estimate as well as extending the class of to nonnegative, bounded Borel functions by a routine approximation argument, we get
[TABLE]
Let us now put
[TABLE]
which clearly satisfies \smash{\boldsymbol{\mu}_{x,y}\big{[}\widetilde{E}_{s,t}\big{]}}=0, and (6.1) holds on with in place of by the convergences and as . Finally, denoting by a sequence approximating from below as provided by Lemma 2.1, the above reasoning gives Borel subsets of such that \smash{\boldsymbol{\mu}_{x,y}\big{[}\widetilde{E}_{s,t}^{n}\big{]}=0} and
[TABLE]
for every . Putting
[TABLE]
we see that \boldsymbol{\mu}_{x,y}\big{[}E_{s,t}\big{]}=0 and that (6.1) holds for all by monotone convergence.
A similar argument and arguing as for Lemma 5.3 shows that we can then select the obtained measures in a universally measurable way. ∎
6.2 Extension to arbitrary initial distributions and time interval
The cases of arbitrary initial distributions and an infinite time horizon are immediate given the construction in the proof of Theorem 6.3. By iteratively composing copies of with , we obtain a measure such that . The pathwise coupling properties on each interval , , which are inherited by carry over to the entire space. As a result, we get the following.
Theorem 6.4**.**
For all with marginals , the measure constructed above satisfies the following properties: both its marginals coincide with the law of Brownian motions on starting in and , respectively, and for -a.e. , we have
[TABLE]
By considering the canonical process \big{(}\mathsf{b}^{1},\mathsf{b}^{2}\big{)} defined by and under the measure , we immediately obtain the assertion of Theorem 6.1, which is just a stochastic rephrasing of the previous result.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[AG + 15] L. Ambrosio, N. Gigli, A. Mondino and T. Rajala. Riemannian Ricci curvature lower bounds in metric measure spaces with σ 𝜎 \sigma -finite measure. Trans. Amer. Math. Soc. 367 (2015), no. 7, 4661–4701.
- 2[AGS 08] L. Ambrosio, N. Gigli and G. Savaré. Gradient flows in metric spaces and in the space of probability measures. Second edition. Lectures in Mathematics ETH Zürich. Birkhäuser, Basel, 2008.
- 3[AGS 13] L. Ambrosio, N. Gigli and G. Savaré. Density of Lipschitz functions and equivalence of weak gradients in metric measure spaces. Rev. Mat. Iberoam. 29 (2013), no. 3, 969–996.
- 4[AGS 14a] L. Ambrosio, N. Gigli and G. Savaré. Calculus and heat flow in metric measure spaces and applications to spaces with Ricci bounds from below. Invent. Math. 195 (2014), no. 2, 289–391.
- 5[AGS 14b] L. Ambrosio, N. Gigli and G. Savaré. Metric measure spaces with Riemannian Ricci curvature bounded from below. Duke Math. J. 163 (2014), no. 7, 1405–1490.
- 6[AGS 15] L. Ambrosio, N. Gigli and G. Savaré. Bakry–Émery curvature-dimension condition and Riemannian Ricci curvature bounds. Ann. Probab. 43 (2015), no. 1, 339–404.
- 7[ACT 08] M. Arnaudon, K. A. Coulibaly and A. Thalmaier. Horizontal diffusion in C 1 superscript C 1 \mathrm{C}^{1} path space. Séminaire de Probabilités XLIII, 73–94, Lecture Notes in Math., 2006, Springer, Berlin, 2011.
- 8[BS 10] K. Bacher and K.-T. Sturm. Localization and tensorization properties of the curvature-dimension condition for metric measure spaces. J. Funct. Anal. 259 (2010), no. 1, 28–56.
