$L^1$-Monge problem in metric spaces possibly with branching geodesics
Shinichiro Kobayashi

TL;DR
This paper proves the existence of optimal transport maps in certain metric spaces with branching geodesics, extending classical results to more complex geometries like normed spaces and Hilbert geometries.
Contribution
It establishes the existence of optimal transport maps in metric spaces with branching geodesics, broadening the scope of optimal transport theory.
Findings
Optimal transport maps exist in some metric spaces with branching geodesics.
Results apply to normed spaces and Hilbert geometries.
The existence holds when the first marginal is absolutely continuous.
Abstract
In this paper, we consider the Monge optimal transport problem with distance cost. We prove that in some metric spaces, possibly with many branching geodesics, an optimal transport map exists if the first marginal is absolutely continuous. The result is applicable to normed spaces and Hilbert geometries.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeometry and complex manifolds · Geometric Analysis and Curvature Flows · Advanced Differential Geometry Research
-Monge problem in metric spaces possibly with branching geodesics
Shinichiro Kobayashi
Mathematical Institute, Tohoku University, Sendai 980-8578, Japan
(Date: August 2, 2019.)
Abstract.
In this paper, we consider the Monge optimal transport problem with distance cost. We prove that in some metric spaces, possibly with many branching geodesics, an optimal transport map exists if the first marginal is absolutely continuous. The result is applicable to normed spaces and Hilbert geometries.
Key words and phrases:
optimal transport; Monge problem; Hilbert geometry; projective metric.
2010 Mathematics Subject Classification:
Primary 49J45; Secondary 49K30, 58B20
1. Introduction
Mass transportation problem has its origin in Monge-Kantorovich [Monge1781, Kantorovich] and is stated as follows. Let be a topological space and let and be two Borel probability measures on . Let be a function and call it a cost function. The Monge mass transportation problem, the Monge problem for short, with the cost function asks whether the infimum
[TABLE]
is achieved, where runs over all Borel measurable maps from to satisfying . We call such a map a transport map from to . The measure is the push-forward measure of under . We call the objective functional in (M) the total cost functional. A minimizer in (M) is called a -optimal transport map from to . The lack of linearity and coercivity of the total cost functional makes the Monge problem difficult. Kantorovich formulated a relaxation of the problem and overcame these difficulties. His idea is to use so-called transport plans, also known as joint distributions, instead of maps. The set of transport plans, , is defined by
[TABLE]
where the map is the canonical projection for and the set denotes the set of all Borel probability measures on . Kantorovich proposed to study the attainability of the infimum
[TABLE]
A minimizer in (K) is called a -optimal transport plan from to . If is a Polish space and if the cost function is lower semi-continuous, then an optimal transport plan exists. In order to obtain an optimal transport map, it is sufficient to see that a certain optimal transport plan is induced by a map.
Let be a complete separable metric space. We focus on the case that the cost function is the distance function . Under a certain lower bound condition of curvature, the existence of an optimal transport map is proved by many researchers. For metric spaces without non-branching geodesics, see [BiCa11, BiCa13, Cava17]. For normed spaces, see e.g., [AKP04, AP03, CP11].
In this paper, inspired by the work of Champion-Pascale [CP11], we consider the Monge problem on a metric space , where is a bounded convex -set in an Euclidean space with non-empty interior and is a metric on satisfying some projectivity conditions.
Assumption A**.**
We assume the following three conditions on a metric on .
- (i)
The topology induced by coincides with the Euclidean one. 2. (ii)
Any line segment is a geodesic in . 3. (iii)
The -dimensional Lebesgue measure on is locally doubling.
Note that we do not assume that is non-branching. Under these settings, we have the following main theorem.
Theorem 1.1**.**
Let be a bounded convex -set in the -dimensional Euclidean space with non-empty interior and be a metric on satisfying Assumption A. Let and be two Borel probability measures on with compact support. If is absolutely continuous with respect to the -dimensional Lebesgue measure , then there exists a -optimal transport map from to .
The idea of the proof of Theorem 1.1 is based on [CP11]. We modify a variational approximation of transport plans, introduced in [CP11], and use it. To see that our approximation scheme works well, we need an appropriate grid for a projective metric. Unlike the case of normed spaces, it is not clear that there exists such a grid. If we assume (iii), then this difficulty is resolved. In this case, the doubling dimension may be different from . Note that our approximation scheme depends on the doubling dimension.
The Hilbert metric on an bounded convex open set in an Euclidean space satisfies Assumption A. The Hilbert metric is a generalization of the Cayley-Klein model of the hyperbolic geometry. For a bounded convex open set in an Euclidean space endowed with the Hilbert metric is called the Hilbert geometry for . Under some regularity assumptions on the boundary of a domain, the Hilbert geometry is regarded as a Finsler manifold of constant flag curvature . We obtain the existence result of an optimal transport map for a Hilbert geometry as a corollary.
Corollary 1.2**.**
Let be a bounded convex open set and the Hilbert metric on . Let and be two Borel probability measures on with compact support. If is absolutely continuous with respect to the -dimensional Lebesgue measure , then there exists an -optimal transport map from to .
Acknowledgment**.**
The author would like to thank Professor Takashi Shioya for helpful comments.
2. Preliminaries
2.1. General facts from optimal transport theory
In this section, we recall some basics on optimal transport theory. We refer to [AG13, Vil09] for more details. For a topological space , denote by the set of all Borel probability measures on .
Definition 2.1**.**
For two topological spaces and , a Borel measurable map and a Borel probability measure , we define the push-forward measure of under by
[TABLE]
for any Borel set in .
Let be two Borel probability measures. For a Borel measurable map , we say that is a transport map from to if holds. The set of all transport maps from to is denoted by . For a Borel probability measure , we say that is a transport plan from to if for holds, where the map is the canonical projection for . In this case, we say that is the first marginal of and that is the second marginal of . The set of all transport plans from to is denoted by . For a transport map , we denote by , where is the map that assigns each to .
Definition 2.2**.**
Let be a function, which we call a cost function. A transport map is a -optimal transport map if it minimizes
[TABLE]
A transport plan is a -optimal transport plan if it minimizes
[TABLE]
The minimization problem for and is called the Monge problem and the Kantorovich problem for the cost function , respectively. By a direct argument, the existence of a -optimal transport plan is ensured if is a Polish space and if is lower semi-continuous. Kantorovich problem admits a dual formulation. We recall the notion of -transform and -concavity.
Definition 2.3**.**
Let be a function.
- (1)
The -transform of is defined by
[TABLE] 2. (2)
is -concave if there exists a function on such that holds.
Example 2.4**.**
Let be a metric space and a function. Then, is -concave if and only if is -Lipschitz continuous.
Definition 2.5**.**
A subset is said to be c-cyclically monotone if for any finitely many points , we have
[TABLE]
where . A Borel probability measure on is said to be -cyclically monotone if is concentrated on a -cyclically monotone set.
In the case that is the -dimensional Euclidean space and is the square of the Euclidean metric, the -cyclical monotonicity of yields the monotonicity of , i.e., for any , we have
[TABLE]
where the dot is the Euclidean inner product of . Next we recall the notion of Kantorovich potential. Denote by the set of all pairs of functions such that
[TABLE]
holds for -almost all and -almost all . Denote by the set of all pairs of functions such that
[TABLE]
holds for all . For , we define by
[TABLE]
We call every maximizing a Kantorovich potential. The following theorem is known as the Kantorovich duality.
Theorem 2.6** (Kantorovich duality).**
Let be a lower semi-continuous function. Then we have the equalities
[TABLE]
Theorem 2.7**.**
Let be a lower semi-continuous function. If is optimal and if is finite, then is concentrated on a -cyclically monotone set. Moreover, if is real-valued, then there exists a -cyclically monotone set such that for any , the following are equivalent to each other.
- (1)
* is a -optimal transport plan from to .* 2. (2)
* is -cyclically monotone.* 3. (3)
There exists a -concave Borel measurable function such that for -almost every point , we have
[TABLE] 4. (4)
* is concentrated on .*
Theorem 2.8**.**
Let be a lower semi-continuous function. Assume that ,
[TABLE]
and
[TABLE]
Then, there exists a -concave function such that the pair is a Kantorovich potential.
Let be a complete and separable metric space and let . We say that a measure has finite -th moment with respect to if there exists a point such that
[TABLE]
Denote by the set of all probability measures with finite -th moment with respect to . Note that for , the assumptions in Theorem 2.8 are fulfilled for . The -Wasserstein metric on is defined by
[TABLE]
i.e., is the -th root of the -optimal transport cost from to . The function is indeed a metric on .
2.2. Some facts on metric spaces
In this section, we enumerate some facts and prove some claims on metric spaces.
Let be a metric space. We call a discrete subset of a net. For a net and , we say that is an -net if the -neighborhood of coincides with the whole . We say that is -discrete if for any two distinct points . The set of all -discrete nets equipped with the inclusion relation forms a poset and we see that any maximal element of the set is an -net.
Definition 2.9**.**
Let be a non-negative Borel measure on . We say that is locally doubling if for any , there exists a constant such that for any and .
The locally doubling condition is equivalent to
[TABLE]
for any and , where is the base logarithm function and is the ceil function.
Lemma 2.10**.**
Let be a metric space, let be a bounded Borel set with diameter and let be an -discrete net of . Assume that admits a locally doubling Borel measure . Then the number of is bounded above by .
Proof.
The -discreteness of yields that a family of -balls
is pairwise disjoint. Combining this fact with the doubling condition (2.1) for and , we obtain
[TABLE]
which yields the lemma. ∎
Definition 2.11**.**
Let be any subset of . A map is called a nearest point projection to if for any , where .
Lemma 2.12**.**
If is a finite subset, then there exists a Borel measurable nearest point projection to .
Proof.
We put
[TABLE]
and . The assertion follows from
[TABLE]
where is the distance function from . ∎
3. Proof of Main Theorem
In order to prove the existence of an optimal transport map, it suffices to show that a certain optimal transport plan is induced by a transport map. The following lemma gives a criterion according to which a measure in a product space is induced by a map.
Lemma 3.1**.**
Let and be two Polish spaces and let . Let be the first marginal of . Then, is induced by a map if and only if there exists a Borel measurable set of full -measure such that for -almost every there exists a unique with . In this case, the map is -measurable and .
Throughout this section, let be a bounded and convex -set with non-empty interior and let be a metric on satisfying Assumption A (see Section 1). The idea that we treat -sets is based on Mazurkiewictz’s theorem (for the proof, see e.g.[Wil70]).
Theorem 3.2** (Mazurkiewictz).**
Let be a completely metrizable topological space and a subspace of . Then is completely metrizable if and only if is a -set.
Let be a Borel probability measure on that is absolutely continuous with respect to the -dimensional Lebesgue measure . We call the function defined by
[TABLE]
the density of and denote it by .
For two probability measures , let be the set of all -optimal transport plans from to . We construct two families and of optimal transport plans and show that each of elements in is induced by a map. As a result, the set consists of a single element. The construction of is similar to that of [San09] and [CP11]. We first define the set . We consider the secondary variational problem:
[TABLE]
We denote by the set of all solutions of (3). Since the topology of is Euclidean, it is easy to see that is non-empty. On the other hand, we cannot deduce the monotonicity of a measure in due to the lack of elements in the feasible region in (3). Nevertheless, we will see that a weak version of monotonicity holds for any measure in . We say that a subset of is restrictedly monotone if for any with , we have
[TABLE]
where the set is the line segment between and . A Borel measure on is said to be restrictedly monotone if is concentrated on a restrictedly monotone subset of . We will see that any measure in is restrictedly monotone. We replace the problem (3) with an equivalent one whose feasible region is the whole . Accordingly, we must change the objective functional. We fix a Kantorovich potential for and put
[TABLE]
The ordinary optimal transport problem with cost function
[TABLE]
is equivalent to the secondary variational problem (3). More precisely, we have the following lemma.
Lemma 3.3**.**
Let and be as above. Then, for any , is a solution of (SVP1) if and only if it is a solution of (SVP2).
By virtue of Lemma 3.3, we see that for any , there exists a -cyclically monotone set on which concentrates. We also observe that for any , takes finite values on . Furthermore, the -cyclical monotonicity is a weak version of the monotonicity. More generally, we have the following. In the proof, we use the condition on geodesics for .
Proposition 3.4**.**
Let be a -Lipschitz continuous function. Define a extended-real valued function by
[TABLE]
Let be a -cyclically monotone set on which is finite. Then is restrictedly monotone.
Proof.
By the -cyclical monotonicity of and the finiteness of , we have
[TABLE]
Due to the -Lipschitz continuity of , it suffices to prove that
[TABLE]
and
[TABLE]
Note that the argument is not symmetric. We first prove (3.1). Since and are aligned, we have
[TABLE]
Combining (3.2) with the -Lipschitz continuity of yields
[TABLE]
In particular, we have
[TABLE]
Combining (3.3) with the triangle inequality, we arrive at
[TABLE]
which proves (3.1). We next prove (3.0). By using the triangle inequality for and , we obtain
[TABLE]
The proof is completed. ∎
Next, we construct a subset of , which plays a crucial role in the proof of the main theorem. We assume that the support of and is compact, respectively. We denote by the set of all compactly supported Borel probability measures on . Let be a positive real number. We define a functional by
[TABLE]
where is the the base 2 logarithm of the doubling constant of and , . The functional is lower semi-continuous with respect to the weak convergence of measures. We denote by the set of all minimizers of whose first marginal is :
[TABLE]
We observe that the set is not empty. is defined to be the set of all cluster points of any sequence . By the compactness of the support of and , we see that is not empty. Furthermore, we see that any measure in solves (3).
Lemma 3.5**.**
For any , let . Then the second marginal of converges to weakly as . Moreover, every limit point of is an element of .
Proof.
For every natural number , we take a maximal -net of . Put . By Lemma 2.10, we have for any . Since is finite, there exists a Borel measurable nearest point projection . Pick and fix a transport plan . Set , where the map is defined by . The measure is a transport plan from to . Since , we have
[TABLE]
Since the map is a nearest point projection to , we have
[TABLE]
and
[TABLE]
so we obtain
[TABLE]
In particular, this yields that
[TABLE]
By multiplying and letting , we have
[TABLE]
Since is arbitrary, converges to weakly as . By letting , (3.4) yields
[TABLE]
Letting yields
[TABLE]
Since , we have . Moreover, by using
[TABLE]
[TABLE]
and (3.4), we get
[TABLE]
In particular, by letting and , we obtain
[TABLE]
which leads us to
[TABLE]
Since , we have . ∎
We consider restricted measures and interpolated measures. For a Borel measure on and a Borel set of positive measure, we denote by the restriction of to . We interpolate measures along line segments in . For , we put
[TABLE]
Lemma 3.6**.**
Let be a Borel set, let and . Denote by and the first and the second marginal of respectively. Then, is absolutely continuous with respect to and we have the following (1), (2) and (3).
- (1)
The measure is a -optimal transport plan from to . 2. (2)
For any , the interpolated measure is absolutely continuous with respect to . 3. (3)
If the density of is essentially bounded, then so is the density of . In this case, we have
[TABLE]
Proof.
The absolute continuity of is clear. The assertion (i) follows from the stability of optimality under the restriction of measures. Let us prove (ii). If , then the assertion is trivial. We may assume . Let be the support of and put and . By a simple calculation, for any Borel set ,
[TABLE]
where we have used the translation invariance of in the last inequality. We now claim that the family is pairwise disjoint for any . Assume the contrary. Then, there exist two points such that
[TABLE]
From this, we have
[TABLE]
and
[TABLE]
by the strict convexity of . Exchanging and , we also have
[TABLE]
On the other hand, the -cyclical monotonicity of yields
[TABLE]
This is a contradiction and the assertion (iii) follows. ∎
Lemma 3.7**.**
Let , , be Borel probability measures with a common first marginal. If converges to weakly, then for any Borel set , the sequence converges to weakly.
Definition 3.8**.**
Let be a subset. We put
[TABLE]
and call it the transport set associated to .
Note that if is -compact, then so is . We also observe that if a measure is concentrated on , then the measure is concentrated on for any . Now we recall the notion of density-regular points and regular points, introduced in [CP11].
Definition 3.9**.**
Let be a probability measure concentrated on a -compact set . Assume that the first marginal of is absolutely continuous with respect to and let be the density. A point is said to be density-regular if for any , there exists a point and a positive number such that
- •
,
- •
The point is a Lebesgue point and .
- •
, ,
where is the density of . We denote by the set of all density-regular points of .
Definition 3.10**.**
Let be a -compact set. A point is said to be regular if for any , is a Lebesgue point of , where the set is defined by
[TABLE]
Remark 3.11**.**
Any density-regular point is regular.
Lemma 3.12**.**
Under the same settings as in Definition 3.10, is concentrated on .
Proposition 3.13**.**
Let and let . Assume that the measure is absolutely continuous with respect to and let be the density of . Then, for any point with and positive real number ,
[TABLE]
Proof.
Define a Borel set by
[TABLE]
Since is a Lebesgue point of and , we have
[TABLE]
Fix a positive real number so that and
[TABLE]
hold for any . By putting , we have
[TABLE]
Put and let be the density of . By the definition of and , we have
[TABLE]
Lebesgue’s differentiation theorem tells
[TABLE]
for -almost all . Integrating this on , we obtain
[TABLE]
We also observe that for any , and , which yields
[TABLE]
Combining (3.7), (3.8) and (3.9), we reach
[TABLE]
Since , there exist a sequence of positive real numbers converging to [math] and a sequence of Borel probability measures converging to weakly. From Lemma 3.6 (iii), for any natural number , we have
[TABLE]
By Lemma 3.7, we know that the sequence converges to weakly. Since the -bound is stable under the weak convergence, we obtain
[TABLE]
The fact that is concentrated on implies that
[TABLE]
Combining this with (3.10), we arrive at
[TABLE]
which is equivalent to
[TABLE]
This completes the proof. ∎
The following theorem is the main result of this section and Theorem 1.1 follows from this.
Theorem 3.14**.**
Let . If the measure is absolutely continuous with respect to , then consists of a single element and the transport plan is induced by a map.
Proof.
Let . By virtue of Lemma 3.1 and Lemma 3.12, it suffices to prove that the set is concentrated on the graph of a map. Assume that there exist two points with . Then we have either or . By symmetry, we may assume . By continuity, for sufficiently small , any points , and satisfy
[TABLE]
On the other hand, by the -regularity of , we have
[TABLE]
and by the density-regularity of , we have
[TABLE]
Thus for sufficiently small , there exists a point such that
[TABLE]
This implies the existence of points , and such that and . From the restricted monotonicity, we have
[TABLE]
Since lies in the line segment , we have
[TABLE]
for some . Thus we obtain
[TABLE]
which is a contradiction. ∎
4. Hilbert geometries
In this section, we recall the notion of Hilbert geometries, introduced by Hilbert [Hilbert1895] himself. We refer to [Papa14] for details.
Let be a bounded convex open domain. For any distinct two points , the Hilbert metric from to is given by
[TABLE]
where the points and are the intersection of the affine straight line passing through and and the boundary (see Figure 1). The function defines a complete metric on and the metric space is called the Hilbert geometry for . The Hilbert geometry for a unit open disk coincides with the Cayley-Klein model of the hyperbolic geometry. If the boundary is -regular and strictly convex, then the Hilbert metric on is induced by the smooth Finsler structure , given by
[TABLE]
where the points and , are the intersection of the affine straight line passing through with the direction (see Figure 1). In this case, the Finsler manifold has the constant flag curvature (cf. [Okada83]).
For any bounded convex open set , the Hilbert metric satisfies (i) and (ii) of Assumption A in Section 1. To verify (iii), we use the following result, proved by Ohta [Ohta13].
Theorem 4.1** ([Ohta13]).**
Let be a bounded convex open domain and the Hilbert metric. Then the metric-measure space satisfies the curvature-dimension condition for
[TABLE]
The curvature-dimension condition , which we do not explain in this paper, is a generalization of the condition that the weighted Ricci curvature is bounded below by and the dimension is bounded above by . If a metric-measure space satisfies the curvature-dimension condition for and finite , then the base measure satisfies the local doubling property. In particular, the Hilbert metrics satisfy (iii) of Assumption A, so we obtain Corollary 1.2.
References
