An intrinsic parallel transport in Wasserstein space
John Lott

TL;DR
This paper introduces a geometric method for parallel transporting tangent cones along geodesics in Wasserstein space over a Riemannian manifold, aligning with previous formal calculations in smooth cases.
Contribution
It provides a rigorous geometric construction of parallel transport in Wasserstein space, extending formal calculations to a precise mathematical framework.
Findings
Constructs a geometric parallel transport in Wasserstein space
Shows agreement with formal calculations in smooth cases
Extends understanding of geometric structures in optimal transport
Abstract
If M is a smooth compact connected Riemannian manifold, let P(M) denote the Wasserstein space of probability measures on M. We describe a geometric construction of parallel transport of some tangent cones along geodesics in P(M). We show that when everything is smooth, the geometric parallel transport agrees with earlier formal calculations.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeometric Analysis and Curvature Flows · Geometry and complex manifolds · Advanced Differential Geometry Research
An intrinsic parallel transport in Wasserstein space
John Lott
Department of Mathematics
University of California - Berkeley
Berkeley, CA 94720-3840
USA
(Date: January 6, 2017)
Abstract.
If is a smooth compact connected Riemannian manifold, let denote the Wasserstein space of probability measures on . We describe a geometric construction of parallel transport of some tangent cones along geodesics in . We show that when everything is smooth, the geometric parallel transport agrees with earlier formal calculations.
Research partially supported by NSF grant DMS-1207654 and a Simons Fellowship
1. Introduction
Let be a smooth compact connected Riemannian manifold without boundary. The space of probability measures of carries a natural metric, the Wasserstein metric, and acquires the structure of a length space. There is a close relation between minimizing geodesics in and optimal transport between measures. For more information on this relation, we refer to Villani’s book [13].
Otto discovered a formal Riemannian structure on , underlying the Wasserstein metric [10]. One can do formal geometric calculations for this Riemannian structure [6]. It is an interesting problem to make these formal considerations into rigorous results in metric geometry.
If has nonnegative sectional curvature then is a compact length space with nonnegative curvature in the sense of Alexandrov [8, Theorem A.8], [12, Proposition 2.10]. Hence one can define the tangent cone of at a measure . If is absolutely continuous with respect to the volume form then is a Hilbert space [8, Proposition A.33]. More generally, one can define tangent cones of without any curvature assumption on , using Ohta’s -uniform structure on [9]. Gigli showed that is a Hilbert space if and only if is a “regular” measure, meaning that it gives zero measure to any hypersurface which, locally, is the graph of the difference of two convex functions [3, Corollary 6.6]. For examples of tangent cones at nonregular measures, if is an embedded submanifold of , and is an absolutely continuous measure on , then was computed in [7, Theorem 1.1].
If is a smooth curve in a Riemannian manifold then one can define the (reverse) parallel transport along as a linear isometry from to . If is a finite-dimensional Alexandrov space then the replacement of a tangent space is a tangent cone. If one wants to define a parallel transport along a curve , as a map from to , then there is the problem that the tangent cones along may not look much alike. For example, the curve may pass through various strata of . One can deal with this problem by assuming that is in the interior of a minimizing geodesic. In this case, Petrunin proved the tangent cones along are mutually isometric, by constructing a parallel transport map [11]. His construction of the parallel transport map was based on passing to a subsequential limit in an iterative construction along . It is not known whether the ensuing parallel transport is uniquely defined, although this is irrelevant for Petrunin’s result.
In the case of a smooth curve in the space of smooth probability measures, one can do formal Riemannian geometry calculations on to write down an equation for parallel transport along [6, Proposition 3]. It is a partial differential equation in terms of a family of functions . Ambrosio and Gigli noted that there is a weak version of this partial differential equation [1, (5.9)]. By a slight extension, we will define weak solutions to the formal parallel transport equation; see Definition 2.13.
Petrunin’s construction of parallel transport cannot work in full generality on , since Juillet showed that there is a minimizing Wasserstein geodesic with the property that the tangent cones at measures on the interior of are not all mutually isometric [5]. However one can consider applying the construction on certain convex subsets of . We illustrate this in two cases. The first and easier case is when is a Wasserstein geodesic of -measures (Proposition 3.1). The second case is when is a Wasserstein geodesic of absolutely continuous measures, lying in the interior of a minimizing Wasserstein geodesic, and satisfying a regularity condition. Suppose that is an element of the tangent cone at the endpoint. Here is a square-integrable gradient vector field on and is in the Sobolev space . For each sufficiently large integer , we construct a triple
[TABLE]
with , which represents an approximate parallel transport along .
Theorem 1.2**.**
Suppose that has nonnegative sectional curvature. A subsequence of converges weakly to a weak solution of the parallel transport equation with . If is a smooth geodesic in , is smooth, and there is a smooth solution to the parallel transport equation (2.6) with , then in norm.
Remark 1.3**.**
In the setting of Theorem 1.2, we can say that is the parallel transport of along to .
Remark 1.4**.**
We are assuming that has nonnegative sectional curvature in order to apply some geometric results from [11]. It is likely that this assumption could be removed.
Remark 1.5**.**
A result related to Theorem 1.2 was proven by Ambrosio and Gigli when [1, Theorem 5.14], and extended to general by Gigli [4, Theorem 4.9]. As explained in [1, 4], the construction of parallel transport there can be considered to be extrinsic, in that it is based on embedding the (linear) tangent cones into a Hilbert space and applying projection operators to form the approximate parallel transports. Although we instead use Petrunin’s intrinsic construction, there are some similarities between the two constructions; see Remark 3.32. We use some techniques from [1], especially the idea of a weak solution to the parallel transport equation.
Remark 1.6**.**
Besides its inherent naturality, the intrinsic construction of parallel transport given here is likely to allow for extensions. For example, using the results of [7], it seems likely that Petrunin’s construction could be extended to define parallel transport along Wasserstein geodesics of absolutely continuous measures on submanifolds of . In the present paper we have done this when the submanifolds have dimension zero or codimension zero.
The structure of this paper is as follows. In Section 2 we discuss weak solutions to the parallel transport equation. In Section 3 we prove Theorem 1.2.
I thank Takumi Yokota and Nicola Gigli for references to the literature.
2. Weak solutions to the parallel transport equation
Let be a compact connected Riemannian manifold without boundary. Put
[TABLE]
Given , define a vector field on by saying that for ,
[TABLE]
The map passes to an isomorphism . Otto’s Riemannian metric on is given [10] by
[TABLE]
In view of (2.2), we write . Then
[TABLE]
To write the equation for parallel transport, let be a smooth curve. We write and define , up to a constant, by . This is the same as saying
[TABLE]
Let be a vector field along , with . The equation for to be parallel along [6, Proposition 3] is
[TABLE]
Lemma 2.7**.**
[6, Lemma 5]** If are solutions of (2.6) then is constant in .
Lemma 2.8**.**
Given , there is at most one solution of (2.6) with , up to time-dependent additive constants.
Proof.
By linearity, it suffices to consider the case when . From Lemma 2.7, and so is spatially constant. ∎
For consistency with later notation, we will write for .
Lemma 2.9**.**
(c.f. [1, (5.8)]) Given , if satisfies (2.6) then
[TABLE]
Proof.
We have
[TABLE]
Then
[TABLE]
This proves the lemma. ∎
We now weaken the regularity assumptions. Let denote the absolutely continuous probability measures on with full support. Suppose that is a Lipschitz curve whose derivative exists for almost all . We can write with . By the Lipschitz assumption, the essential supremum over of is finite. As before, we write .
Definition 2.13**.**
Let be a Lipschitz curve whose derivative exists for almost all . Given , and , we say that is a weak solution of the parallel transport equation if
[TABLE]
for all .
Remark 2.15**.**
In what follows, there would be analogous results if we replaced everywhere by . We will stick with for concreteness.
From Lemma 2.9, if is a smooth curve in and is a solution of (2.6) then is a weak solution of the parallel transport equation. We now prove the converse.
Lemma 2.16**.**
Suppose that is a smooth curve in . Given and , if is a weak solution of the parallel transport equation then satisfies (2.6), and (modulo constants).
Proof.
In this case, equation (2.14) is equivalent to
[TABLE]
Taking with , it follows that (2.6) must hold. Then taking all , it follows that and . Hence and (modulo constants). ∎
Lemma 2.18**.**
Suppose that is a smooth curve in . Given , , and , suppose that
- (1)
* is a weak solution to the parallel transport equation,* 2. (2)
* satisfies (2.6),* 3. (3)
, 4. (4)
[TABLE]
and 5. (5)
[TABLE]
Then , and for almost all .
Proof.
From (2.6) (applied to ) and (2.14), we have
[TABLE]
From Lemma 2.7,
[TABLE]
Then
[TABLE]
Thus in .
Next, replacing by in (2.14) gives
[TABLE]
Then
[TABLE]
Thus in , for almost all . ∎
3. Parallel transport along Wasserstein geodesics
3.1. Parallel transport in a finite-dimensional Alexandrov space
We recall the construction of parallel transport in a finite-dimensional Alexandrov space .
Let be a geodesic segment that lies in the interior of a minimizing geodesic. Then is an isometric product of with the normal cone . We want to construct a parallel transport map from to .
Given and , define by . We define an approximate parallel transport as follows. Given , let be a minimizing geodesic segment with and . For each , let be a minimizing geodesic with and . Let be the normal projection of . After passing to a sequence , we can assume that . Then . If has nonnegative Alexandrov curvature then .
In [11], the approximate parallel transport from an appropriate dense subset to was defined to be . It was shown that by taking and applying a diagonal argument, in the limit one obtains an isometry from a dense subset of to . This extends by continuity to an isometry from to .
If is a smooth Riemannian manifold then is independent of the choices and can be described as follows. Given , let be the Jacobi field along with and . (It is unique since is in the interior of a minimizing geodesic.) Then .
3.2. Construction of parallel transport along a
Wasserstein geodesic of delta measures
Let be a compact connected Riemannian manifold without boundary. Let be a geodesic segment that lies in the interior of a minimizing geodesic. Let be (reverse) parallel transport along . Put . Then is a Wasserstein geodesic that lies in the interior of a minimizing geodesic. We apply Petrunin’s construction to define parallel transport directly from the tangent cone to the tangent cone (instead of the normal cones). From [7, Theorem 1.1], we know that .
Proposition 3.1**.**
The parallel transport map from to is the map .
Proof.
Given and , define by and by . We define an approximate parallel transport as follows.
Given and a real vector space , let be multiplication by . Let be a compactly-supported element of . For small , there is a Wasserstein geodesic , with and corresponding to , given by . Given , let be a minimizing geodesic with and . There is a compactly-supported measure so that for , we have . If is large and is small then all of the constructions take place well inside a totally convex ball, so is unique and can be written as . Then exists and equals . Thus .
Now
[TABLE]
Taking , this approaches . ∎
3.3. Construction of parallel transport along a
Wasserstein geodesic of absolutely continuous measures
Let be a compact connected boundaryless Riemannian manifold with nonnegative sectional curvature. Then has nonnegative Alexandrov curvature.
Let be a geodesic segment that lies in the interior of a minimizing geodesic. Write . Since is defined up to a constant, it will be convenient to normalize it by . We assume that
[TABLE]
In particular, this is satisfied if lies in .
Let denote the normal cone to at . We want to construct a parallel transport map from to .
Given and , define by . Correspondingly, write . We define an approximate parallel transport , using Jacobi fields, as follows.
Let us write , i.e. . The curve is given by , where . That is, for any ,
[TABLE]
If is a variation of , i.e. , then taking the variation of (3.4) gives
[TABLE]
Here
[TABLE]
with . The corresponding tangent vector at is represented by , where is orthogonal projection on . We can think of as a Jacobi field along . If then its approximate parallel transport along is represented by .
Next, using (3.6), for we have
[TABLE]
Here is the vector at given by
[TABLE]
If instead then
[TABLE]
We will need to estimate .
Lemma 3.10**.**
For large , there is an estimate
[TABLE]
Here, and hereafter, denotes a constant that can depend on the fixed Riemannian manifold .
Proof.
Since is projection onto , and , we have
[TABLE]
(Compare with [1, Proposition 4.3].) Defining by
[TABLE]
we obtain
[TABLE]
Since , if is large then is much smaller than the injectivity radius of . In particular, the curve lies well within a normal ball around . Now can be estimated in terms of . In general, if a function on a complete Riemannian manifold satisfies then the manifold isometrically splits off an -factor and the optimal transport path generated by is translation along the -factor. In such a case, the analog of is the identity map. If then the divergence of a short optimal transport path from being a translation can be estimated in terms of . Putting in the estimates gives (3.11). ∎
Using Lemma 3.10, we have
[TABLE]
Next, given , consider the geodesic
[TABLE]
Put
[TABLE]
Then is a Jacobi field along , with and . Jacobi field estimates give
[TABLE]
again for large.
Lemma 3.19**.**
Define by
[TABLE]
Then for large , the map is invertible for all .
Proof.
Define by
[TABLE]
Then whenever , we have
[TABLE]
so whenever , for large we have
[TABLE]
Hence , so for large the map is invertible and a right inverse for is given by . This implies that is surjective.
Now suppose that is nonzero, with . After normalizing, we may assume that has unit length. Then
[TABLE]
for large . If is sufficiently large then this is a contradiction, so is injective. ∎
Fix . If then after normalizing, we may assume that it has unit length. For large and , define as follows. First, using Lemma 3.19, find so that . For , put
[TABLE]
Doing backward recursion, starting with , using Lemma 3.19 we find so that . For , put
[TABLE]
Decrease by one and repeat. The last step is when .
From the argument in [11, Lemma 1.8],
[TABLE]
We note that the proof of [11, Lemma 1.8] only uses results about geodesics in Alexandrov spaces, it so applies to our infinite-dimensional setting. It also uses the assumption that lies in the interior of a minimizing geodesic. After passing to a subsequence, we can assume that
[TABLE]
in the weak topology on . Note that .
From (3.9), (3.15) and (3.18), for a fixed , on each interval we have
[TABLE]
It follows that is a weak solution of the parallel transport equation. As the limiting vector fields are gradient vector fields, we can write for some .
Suppose that is a smooth geodesic in , that (and hence ) is smooth and that there is a smooth solution to the parallel transport equation (2.6) with . By Lemma 2.7, is independent of . By Lemma 2.18, . We claim that
[TABLE]
in the norm topology on . This is because of the general fact that if is a sequence in a Hilbert space with , and there is some unit vector so that every weakly convergent subsequence of has weak limit , then in the norm topology.
In particular,
[TABLE]
in the norm topology on .
This proves Theorem 1.2.
Remark 3.32**.**
The construction of parallel transport in [1, Section 5] and [4, Section 4] is also by taking the limit of an iterative procedure. The underlying logic in [1, 4] is different than what we use, which results in a different algorithm. The iterative construction in [1, 4] amounts to going forward along the curve applying certain maps , instead of going backward along using the inverses of the ’s as we do. In the case of , the map is the same as , but this is not the case in general. The map is nonexpanding, which helps the construction in [1, 4]. In contrast, is not nonexpanding. In order to control its products, we use the result (3.27) from [11].
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] L. Ambrosio and N. Gigli, “Construction of the parallel transport in the Wasserstein space”, Meth. Appl. Anal. 18, p. 1-30 (2008)
- 2[2] D. Burago, Y. Burago and S. Ivanov, A course on metric geometry , Graduate Studies in Mathematics 33, Amer. Math. Soc., Providence (2001)
- 3[3] N. Gigli, “On the inverse implication of Brenier-Mc Cann theorems and the structure of ( 𝒫 2 ( M ) , W 2 ) subscript 𝒫 2 𝑀 subscript 𝑊 2 ({\mathcal{P}}_{2}(M),W_{2}) ”, Methods Appl. Math. 18, p. 127-158 (2011)
- 4[4] N. Gigli, “Second order analysis on ( P 2 ( M ) , W 2 ) subscript 𝑃 2 𝑀 subscript 𝑊 2 (P_{2}(M),W_{2}) ”, Mem. Amer. Math. Soc. 216 (2012)
- 5[5] N. Juillet, “On displacement interpolation of measures involved in Brenier’s theorem”, Proc. Amer. Math. Soc. 139, p. 3623-3632 (2011)
- 6[6] J. Lott, “Some geometric calculations on Wasserstein space”, Comm. Math. Phys. 277, p. 423-437 (2008)
- 7[7] J. Lott, “On tangent cones in Wasserstein space”, to appear, Proc. of the Amer. Math. Soc.
- 8[8] J. Lott and C. Villani, “Ricci curvature for metric-measure spaces via optimal transport”, Ann. Math. 169, p. 903-991 (2009)
