Approximate Monge solutions continuously depending on the parameter
S.N. Popova 111Moscow Institute of Physics and Technology; National Research University Higher School of Economics.
Abstract.
We consider Kantorovich optimal transportation problem in the case where the cost function and marginal distributions continuously depend on a parameter with
values in a metric space.
We prove the existence of approximate optimal Monge mappings continuous with respect to the parameter.
Keywords: optimal transportation problem, Kantorovich problem, Monge problem, continuity with respect to a parameter.
1. Introduction
We recall that, given two Borel probability measures μ and ν on topological spaces X and Y respectively
and a nonnegative Borel function h on X×Y, the Kantorovich optimal transportation problem concerns minimization of the integral
[TABLE]
over all measures σ in the set Π(μ,ν) consisting of Borel probability measures on X×Y
with projections μ and ν on the factors, that is, σ(A×Y)=μ(A) and σ(X×B)=ν(B)
for all Borel sets A⊂X and B⊂Y. The measures μ and ν are called marginal distributions
or marginals, and h is called a cost function.
In general, there is only infimum Kh(μ,ν), which may be infinite. If the cost function h is continuous (or at least lower semicontinuous)
and bounded and the measures μ and ν are Radon, then the minimum is attained and measures on which it is attained are called
optimal measures or optimal Kantorovich plans. The boundedness of h can be replaced by the assumption that
there is a measure in Π(μ,ν) with respect to which h is integrable.
The Monge problem for the same triple (μ,ν,h) consists in finding a Borel mapping T:X→Y taking μ into ν,
that is ν=μ∘T−1, (μ∘T−1)(B)=μ(T−1(B)) for all Borel sets B⊂Y, for which the integral
[TABLE]
is minimal. In general, there is only infimum Mh(μ,ν) (possibly, infinite),
but in many interesting cases there exist optimal Monge mappings. In any case,
Kh(μ,ν)≤Mh(μ,ν), but if both measures are Radon, μ has no atoms and is separable, and the cost function h is continuous,
then Kh(μ,ν)=Mh(μ,ν) (see [9], [20]). This equality implies that if there is a unique
solution T to the Monge problem, then the image of μ under the mapping x↦(x,T(x)) is an optimal
Kantorovich plan.
General information about Monge and Kantorovich problems can be found
in [1], [10], [21], [22], and [24].
We consider optimal transportation of measures on metric and topological spaces
in the case where the cost function ht and marginal distributions μt and νt depend on a parameter t with
values in a metric space. Kantorovich problems depending on a parameter were investigated in [24], [25], [18], [11], where the questions of measurability were studied.
We address the problem of continuity with respect to the parameter. Here the questions naturally arise about the continuity with respect to t of
the optimal cost Kht(μt,νt) and also about the possibility to select an optimal plan in Π(μt,νt) continuous with respect to
the parameter.
In [12], [13] it was proved that the cost of optimal transportation is continuous with respect to the parameter in the case of continuous
dependence of the cost function and marginal distributions on this parameter. Furthermore, it was shown that it is not always possible to select an optimal plan continuously depending on the parameter t.
However, it is possible to select approximate optimal plans continuous with respect to the parameter.
Continuous dependence on marginals was considered in [4], [23], and [16].
Similar problems may be studied for nonlinear cost functionals (see [17], [2], [3], [14], [19]), see also the recent survey [8].
Introduce the notation and terminology that will be used in this paper.
A nonnegative Radon measure on a topological space X is a bounded Borel measure μ≥0
such that for every Borel set B and every ε>0 there is a compact set K⊂B such that
μ(B\K)<ε (see [5]). If X is a complete separable metric space,
then all Borel measures are Radon.
The space Mr(X) of signed bounded Radon measures on X can be equipped with
the weak topology generated by the seminorms
[TABLE]
where f is a bounded continuous function.
A set M of nonnegative Radon measures on a space X is called uniformly tight,
if for every ε>0 there exists a compact set K⊂X such that
μ(X\K)<ε for all μ∈M.
Let (X,dX) and (Y,dY) be metric spaces. The space X×Y is equipped with the metric
[TABLE]
The weak topology on the spaces of Radon probability measures Pr(X), Pr(Y), Pr(X×Y)
is metrizable by the corresponding Kantorovich–Rubinshtein metrics dKR (also called the
Fortet–Mourier metrics, see [6]) defined by
[TABLE]
where Lip1 is the space of 1-Lipschitz functions. If X is complete, then
(Pr(X),dKR) is also complete and if X is Polish, then Pr(X) is also Polish.
In this paper we study the existence of approximate optimal Monge mappings continuous with respect to the parameter.
Section 2 addresses the case where the measures μ∈Pr(X) and ν∈Pr(Y) are fixed and
h:X×Y×T→[0,∞) is a continuous cost function. In Section 3 we assume that the measure μ∈Pr(X) is fixed and the measures νt∈Pr(Y) continuously depend on t in the weak topology.
We prove that there exist approximate Monge solutions Ttε such that Ttε is continuous in t
in the sense of convergence μ-a.e.: if tn→t as n→∞,
then Ttnε→Ttε μ-a.e. We also generalize this result to the case where the measures μt are continuous in t in the total variation norm and the measures νt are continuous in t in the weak topology.
2. The Monge problem with fixed marginals
In [12] the question was addressed whether it is possible to select an optimal plan continuously depending on the parameter t.
The examples were constructed which show that such a choice is not always possible.
However, the situation improves for approximate optimal plans.
Given ε>0, a measure σ∈Π(μ,ν) will be called
ε-optimal for the cost function h if
[TABLE]
Theorem 2.1** ([12]).**
Let X, Y be complete metric spaces. Let T be a metric space, and for every t∈T
we are given measures μt∈Pr(X) and νt∈Pr(Y) such that
the mappings t↦μt and t↦νt are continuous in the weak topology
(which is equivalent to the continuity in the Kantorovich–Rubinshtein metric). Suppose also
that there is a continuous nonnegative function (t,x,y)↦ht(x,y).
Suppose that for every t there exist nonnegative Borel functions at∈L1(μt) and bt∈L1(νt)
such that
[TABLE]
Then one can select ε-optimal measures σtε∈Π(μt,νt) for the cost functions ht
such that they will be continuous in t in the weak topology for every fixed ε>0.
If for every t there is a unique optimal plan σt, then it is continuous
in t.
In this paper we strengthen the result from [12] looking at approximate optimal Monge mappings continuously depending on the parameter.
First, we consider the particular case where the marginals μ∈Pr(X), ν∈Pr(Y) are fixed and cost functions ht depend on the parameter t.
We prove the following result on the existence of approximate optimal Monge mappings continuously depending on the parameter t.
Theorem 2.2**.**
Let X,Y be completely regular topological spaces. Let μ be a non-atomic Radon probability measure on X, let ν be a Radon probability measure on Y, and the measures μ and ν are concentrated on countable unions of metrizable compact sets (i.e. we may assume that X and Y are Souslin spaces). Let T be a metric space, h:X×Y×T→[0,∞) be a continuous function such that h(x,y,t)≤at(x)+bt(y), where at∈L1(μ), bt∈L1(ν) and
[TABLE]
Then for any ε>0 one can select ε-optimal Monge mappings Ttε for the cost functions ht
such that Ttε is continuous in t in the sense of convergence μ-a.e.: if tn→t as n→∞, then Ttnε→Ttε μ-a.e.
Proof.
We first consider the case where the function h is bounded. We may assume that h≤1.
Let ε>0. Set ε1=ε/5. Let us take a metrizable compact set K~1⊂X such that μ(X∖K~1)<ε1/2.
Since the measure μ is non-atomic and the compact set K~1 is metrizable,
the measure space (K~1,μ∣K~1) is almost homeomorphic to
([0,μ(K~1)],λ), where
λ is Lebesgue measure (see [5, Theorem 9.6.3]). Let φ:[0,μ(K~1)]→K~1 be an almost homeomorphism. Then there exists a compact set
S⊂[0,μ(K~1)] such that
0<λ([0,μ(K~1)]∖S)<ε1/2 and φ∣S is a homeomorphism.
Denote K1=φ(S). Then K1 is a metrizable compact set and the measure space (K1,μ∣K1) is homeomorphic to (S,λ). Moreover, we have
[TABLE]
Let us take a metrizable compact set K2⊂Y such that ν(Y∖K2)≤μ(X∖K1).
Let dK1 be the metric generating the topology on K1.
Let us prove that there exists a continuous (strictly positive) function δ:T→(0,+∞) such that for any
x1,x2∈K1, y∈K2, t∈T we have ∣h(x1,y,t)−h(x2,y,t)∣<ε1 if dK1(x1,x2)<δ(t). Since h is continuous on K1×K2×T, it follows that for any t0∈T there exists a real number
κt0>0 and an open neighbourhood Wt0⊂T
(t0∈Wt0) such that ∣h(x1,y,t)−h(x2,y,t)∣<ε1 for any x1,x2∈K1 with dK1(x1,x2)<κt0 and for any y∈K2, t∈Wt0.
The metric space T posseses a locally finite continuous partition of unity {ψα,α∈A} subordinated to the open cover {Wt,t∈T}, i.e. a set of continuous functions ψα, α∈A, such that 0≤ψα≤1 for any α∈A, suppψα⊂Wτ(α) for some τ(α)∈T, for every point t∈T there exists a neighbourhood W such that W∩suppψα=∅ for at most finite number of indices α∈A, and ∑αψα(t)=1.
Set
[TABLE]
Then the function δ(t) is continuous, since for any point t∈T there exists a neighbourhood W such that δ(t) is equal to the sum of a finite number of continuous functions on W. Let us show that the function δ(t) satisfies the required condition. Fix t0∈T. Let α1,…,αN be all indices from the set A such that ψαi(t0)=0. Then t0∈Wτ(αi) for all i∈{1,…,N}.
The equality ∑αψα(t0)=1 implies that 0<δ(t0)≤max(κτ(α1),…,κτ(αN)). Therefore, by the definition of the numbers κt we have
∣h(x1,y,t0)−h(x2,y,t0)∣<ε1 if x1,x2∈K1, dK1(x1,x2)<δ(t0), y∈K2.
Let us build a partition
[TABLE]
satisfying the following properties:
for any j∈N the mapping t↦ISj(t) (where IB denotes the indicator function of a set B) is continuous in the sense of convergence λ-a.e., that is, for any sequence tn→t, n→∞, we have
ISj(tn)→ISj(t) λ-a.e.,
for any j∈N and for any t∈T we have
∣h(φ(s1),y,t)−h(φ(s2),y,t)∣<ε1 for all s1,s2∈Sj(t), y∈K2.
Since the mapping φ is continuous, as proven above, there exists a continuous function δ~:T→(0,+∞) such that for any s1,s2∈S, y∈K2, t∈T we have ∣h(φ(s1),y,t)−h(φ(s2),y,t)∣<ε1 if ∣s1−s2∣≤δ~(t). Set
[TABLE]
Then S=⨆j=1∞Sj(t). From the definition of the function δ~(t) it follows that the property 2) is satisfied. Let us prove that the property 1) is fulfilled. Let tn→t as n→∞. For any j∈N let us show that ISj(tn)→1 for all s∈S∩((j−1)δ~(t),jδ~(t)).
Fix s∈S, s∈((j−1)δ~(t),jδ~(t)).
Then for all sufficiently large numbers n it holds that s∈((j−1)δ~(tn),jδ~(tn)), since
δ~(tn)→δ~(t). Therefore, ISj(tn)(s)=1 for all sufficiently large n.
Thus for all s∈S∩((j−1)δ~(t),jδ~(t)) and for all i∈N
we have ISi(tn)(s)→ISi(t)(s). Therefore, the property 1) is satisfied.
Set Xj(t)=φ(Sj(t)). Then K1=⨆j=1∞Xj(t). We have
IXj(tn)→IXj(t) μ-a.e., if tn→t, n→∞ (this also implies that
μ(Xj(tn)△Xj(t))→0 as n→∞). Furthermore, for any j∈N and for any t∈T we have ∣h(x1,y,t)−h(x2,y,t)∣<ε1 for all x1,x2∈Xj(t), y∈K2.
Consider the Kantorovich problem with the cost function h(x,y,t) and measures μ∣K1, αν∣K2, where α=μ(K1)/ν(K2)≤1.
By Theorem 2.1 there exist ε-optimal measures πt∈Π(μ∣K1,αν∣K2) for the cost function h(x,y,t) such that πt is continuous in t in the weak topology.
Let νtj be the projection of the measure IXj(t)πt on Y, j∈N. Let us show that νtj is continuous in t
in the weak topology. Let tn→t as n→∞, we show that the measures νtnj converge weakly to νtj. We have ∥IXj(tn)πtn−IXj(t)πtn∥=μ(Xj(tn)△Xj(t))→0, where ∥⋅∥ is the total variation norm. Therefore, it is sufficient to prove that the measures IXj(t)πtn converge weakly to IXj(t)πt.
Let g∈Cb(X×Y), ∣g∣≤1, we show that
[TABLE]
Fix δ>0. Take a compact set Fj and an open set Uj such that Fj⊂Xj(t)⊂Uj and μ(Uj∖Fj)<δ. There exists a continuous function f:X→R such that f=1 on Fj, f=0 outside Uj, 0≤f≤1. Then
[TABLE]
since πtn converge weakly to πt.
Furthermore, we have ∣IXj(t)−f(x)∣≤IUj∖Fj. Therefore,
[TABLE]
From above we obtain
[TABLE]
Hence ∫g(x,y)IXj(t)πtn(dxdy)−∫g(x,y)IXj(t)πt(dxdy)→0.
Therefore, the measures νtnj converge weakly to νtj, i.e. the mapping t↦νtj is continuous in the weak topology.
Since the compact set K2 is metrizable, it posseses the strong Skorohod property (see [6]), that is, for any probability measure
η on K2 there exists a mapping ξη:[0,1]→K2 such that
λ∘ξη−1=η, where λ is Lebesgue measure on [0,1], and if measures ηn converge weakly to
η, then ξηn→ξη λ-a.e.
Since the mapping t↦νtj is continuous in the weak topology for any j∈N, by the strong Skorohod property
for any j∈N there exists a mapping ξt,j:[0,λ(Sj(t))]→K2 such that
[TABLE]
and ξt,j is continuous in t in the sense of convergence λ-a.e. Set
[TABLE]
Then the mapping t↦Ftj is continuous in t in the topology of pointwise convergence: if tn→t as n→∞, then Ftnj(s)→Ftj(s) for any s∈S.
Indeed, ∣Ftnj(s)−Ftj(s)∣≤λ(Sj(tn)△Sj(t))→0 as n→∞.
Set
[TABLE]
Then μ∣Xj(t)∘Tt−1=νtj, since φ−1:K1→S is a homeomorpism which transfers the measure μ∣Xj(t) to the measure λ∣Sj(t) and the mapping Ftj transfers the measure
λ∣Sj(t) to the measure λ∣[0,λ(Sj(t))].
Therefore, μ∣K1∘Tt−1=αν∣K2.
Since the measure μ is non-atomic, there exists a mapping T:X∖K1→Y such that
[TABLE]
Set Tt(x)=T(x) for any x∈X∖K1. Then μ∘Tt−1=ν.
Let us show that the mapping Tt is continuous in t in the sense of convergence μ-a.e. Let tn→t, n→∞. Prove that for any j∈N
[TABLE]
For μ-a.e. x∈Xj(t) it holds that x∈Xj(tn) for all sufficiently large n,
since IXj(tn)→IXj(t) μ-a.e. Therefore, for μ-a.e. x∈Xj(t) we have for all sufficiently large n
[TABLE]
since
Ftnj(φ−1(x))→Ftj(φ−1(x)) due to continuity of Ftj in t
and ξtn,j→ξt,j λ-a.e.
Thus μ({x∈X:Ttn(x)→Tt(x)})=0 and the mapping Tt is continuous in t in the sense of convergence μ-a.e.
Let us show that the mapping Tt is ε-optimal for every t∈T. Fix t∈T.
For any j∈N we have (fix some x0∈Xj(t))
[TABLE]
since μ∣Xj(t)∘Tt−1=νtj and
∣ht(x,y)−ht(x0,y)∣<ε1 for any x∈Xj(t), y∈K2.
Similarly
[TABLE]
Therefore,
[TABLE]
Summing over j∈N, we obtain the inequality
[TABLE]
Moreover, ∫X∖K1ht(x,Ttx)μ(dx)≤μ(X∖K1)<ε1.
Hence
[TABLE]
Let σ∈Π(μ,ν) be an optimal measure in the Kantorovich problem with the cost function ht(x,y) and measures μ,ν. Let μ1 and ν1 be the projections of the measure IK1×K2σ on X and Y respectively.
Set σ~=αIK1×K2σ+ζ, where
ζ∈Π(μ∣K1−αμ1,αν∣K2−αν1).
Then σ~∈Π(μ∣K1,αν∣K2) and hence
[TABLE]
We have ν(K2)−ν1(K2)=σ((X∖K1)×K2)≤μ(X∖K1)<ε1.
Therefore,
[TABLE]
So the mapping Tt is 5ε1-optimal for any t∈T.
Consider now the general case. Let h(x,y,t)≤at(x)+bt(y), where the functions at∈L1(μ) and bt∈L1(ν) satisfy (2.2). Let N∈N. As proven above, for the bounded continuous function min(h,N) there exist
ε/2-optimal Monge mappings Tt which are continuous in t in the sense of convergence μ-a.e.
For any measure σ∈Π(μ,ν) we have
[TABLE]
Take N∈N such that ∫at≥N/2atdμ+∫bt≥N/2btdν<ε/4.
Then the mappings Tt are ε-optimal for the cost function h.
∎
3. The Monge problem with marginals depending on the parameter
Assume that the measure μ∈Pr(X) is fixed and the measures νt∈Pr(Y) continuously depend on t in the weak topology.
We show that one can select approximate optimal Monge mappings continuously depending on the parameter t in the sense of convergence μ-a.e.
Theorem 3.1**.**
Let X,Y be complete metric spaces and let
μ be a non-atomic Radon probability measure on X.
Let T be a metric space, the mapping t↦νt, T→Pr(Y), is continuous in the weak topology,
h:X×Y×T→[0,∞) is a continuous function such that
h(x,y,t)≤at(x)+bt(y), where at∈L1(μ), bt∈L1(νt) and
[TABLE]
Then for any ε>0 one can select ε-optimal Monge mappings Ttε for the cost functions ht and measures μ, νt (i.e. μ∘(Ttε)−1=νt for every t∈T) such that
Ttε is continuous in t in the sense of convergence μ-a.e.: if tn→t as n→∞, then Ttnε→Ttε μ-a.e.
Proof.
The assertion of Theorem 3.1 reduces to the case where h≤1.
Let ε>0. Set ε1=ε/6.
Since the measure μ is non-atomic, there exists a compact set K1⊂X such that μ(X∖K1)<ε1 and (K1,μ∣K1) is homeomorphic to (S,λ), where S⊂[0,1] is a compact set and
λ is Lebesgue measure. Let φ:S→K1 be a homeomorphism, λ∣S∘φ−1=μ∣K1. Let dX and dY be the metrics of X and Y respectively.
Let us prove that there exists a continuous (strictly positive) function δ:T→(0,+∞) and a collection of closed sets Y(t)⊂Y, t∈T, such that for any t∈T we have νt(Y∖Y(t))<ε1 and
∣h(x1,y,t)−h(x2,y,t)∣<ε1 for all x1,x2∈K1 with dX(x1,x2)<δ(t) and for all y∈Y(t).
For any t∈T take a compact set K2(t)⊂Y such that νt(Y∖K2(t))<ε1. Since h is continuous on K1×Y×T, it follows that for any t0∈T there exist real numbers κ(t0)>0, r(t0)>0 and an open neighbourhood W~t0⊂T
(t0∈W~t0) such that ∣h(x1,y,t)−h(x2,y,t)∣<ε1 for any x1,x2∈K1 with dX(x1,x2)<κ(t0) and for any y∈K2(t0)r(t0) (where Br={y∈Y:dY(y,B)≤r} is a closed r-neighbourhood of a set B in the metric space Y), t∈W~t0. Since the mapping t↦νt is continuous in the weak topology and
νt0(Y∖K2(t0))<ε1, there exists an oper neighbourhood Wt0′⊂T (t0∈Wt0′) such that νt(Y∖K2(t0)r(t0))<ε1 for any t∈Wt0′.
Set Wt0=W~t0∩Wt0′.
The metric space T posseses a locally finite continuous partition of unity {ψα,α∈A} subordinated to the open cover {Wt,t∈T}, i.e. a set of continuous functions ψα, α∈A, such that 0≤ψα≤1 for any α∈A, suppψα⊂Wτ(α) for some τ(α)∈T, for every point t∈T there exists a neighbourhood W such that W∩suppψα=∅ for at most finite number of indices α∈A, and ∑αψα(t)=1.
Set
[TABLE]
Then the function δ(t) is continuous, since for any point t∈T there exists a neighbourhood W such that δ(t) is equal to the sum of a finite number of continuous functions on W.
For any t∈T choose an index α(t) from the finite set {α∈A:ψα(t)=0} for which
the value κ(τ(α)) is maximal.
Set
[TABLE]
Let us show that the function δ(t) and the sets Y(t), t∈T, satisfy the required condition.
Fix t0∈T. Let α1,…,αN be all indices from the set A such that ψαi(t0)=0. Then t0∈Wτ(αi) for all i∈{1,…,N}.
Since ∑αψα(t0)=1, we have δ(t0)≤max(κ(τ(α1)),…,κ(τ(αN)))=κ(τ(α(t0))). Therefore, by the definition of the numbers κ(t) we obtain that
∣h(x1,y,t0)−h(x2,y,t0)∣<ε1 if x1,x2∈K1, dX(x1,x2)<δ(t0), y∈Y(t0).
Moreover, νt0(Y∖Y(t0))<ε1, because t0∈Wτ(α(t0)).
Since the mapping φ is continuous, as proven above, there exists a continuous function δ~:T→(0,+∞) and a collection of closed sets Y(t)⊂Y, t∈T, such that for any t∈T we have νt(Y∖Y(t))<ε1 and
∣h(φ(s1),y,t)−h(φ(s2),y,t)∣<ε1 for all s1,s2∈S with ∣s1−s2∣≤δ~(t) and for all y∈Y(t).
As described in the proof of Theorem 2.2, we can construct a partition S\leavevmode =\leavevmode ⨆j=1∞Sj(t) satisfying the following properties:
for any j∈N the mapping t↦ISj(t) is continuous in the sense of convergence λ-a.e., that is, for any sequence tn→t, n→∞, we have
ISj(tn)→ISj(t) λ-a.e.,
for any j∈N and for any t∈T we have
∣h(φ(s1),y,t)−h(φ(s2),y,t)∣<ε1 for all s1,s2∈Sj(t), y∈Y(t).
Set Xj(t)=φ(Sj(t)). Then K1=⨆j=1∞Xj(t). We have
IXj(tn)→IXj(t) μ-a.e., if tn→t, n→∞ (this also implies that
μ(Xj(tn)△Xj(t))→0 as n→∞). Furthermore, for any j∈N and for any t∈T we have ∣h(x1,y,t)−h(x2,y,t)∣<ε1 for all x1,x2∈Xj(t), y∈Y(t).
Set X0(t)=X∖K1.
By Theorem 2.1 there exist ε1-optimal measures πt∈Π(μ,νt) for the cost function h(x,y,t) such that πt is continuous in t in the weak topology. Let νtj be the projection of the measure IXj(t)πt on Y, j∈N∪{0}. Then νtj is continuous in t in the weak topology. Indeed, if tn→t as n→∞, then the measures νtnj converge weakly to νtj, since
the measures πtn converge weakly to πt and μ(Xj(tn)△Xj(t))→0.
The complete metric space Y posseses the strong Skorohod property for Radon measures (see [6]),
that is, for any Radon probability measure η on Y there exists a mapping ξη:[0,1]→Y such that
λ∘ξη−1=η, where λ is Lebesgue measure on [0,1], and if measures ηn converge weakly to η, then ξηn→ξη λ-a.e.
Since the mapping t↦νtj is continuous in the weak topology for any j∈N∪{0}, by the strong Skorohod property for any j∈N∪{0} there exists a mapping ξt,j:[0,μ(Xj(t))]→Y (where μ(Xj(t))=λ(Sj(t)) for any j∈N and μ(X0(t))=μ(X∖K1)) such that
[TABLE]
and ξt,j is continuous in t in the sense of convergence λ-a.e.
Let
[TABLE]
The mapping t↦Ftj is continuous in t in the topology of pointwise convergence: if tn→t as n→∞, then
Ftnj(s)→Ftj(s) for any s∈S. Indeed, ∣Ftnj(s)−Ftj(s)∣≤λ(Sj(tn)△Sj(t))→0 as n→∞.
Set
[TABLE]
Then μ∣Xj(t)∘Tt−1=νtj, since φ−1:K1→S is a homeomorphism which transfers the measure μ∣Xj(t) to the measure λ∣Sj(t) and the mapping Ftj transfers
λ∣Sj(t) to the measure λ∣[0,λ(Sj(t))].
Since the measure μ is non-atomic, there exists a mapping
F:X∖K1→[0,μ(X∖K1)] such that
[TABLE]
Set Tt(x)=ξt,0(F(x)) for any x∈X∖K1.
Then μ∣X∖K1∘Tt−1=νt0.
Therefore, μ∘Tt−1=νt for any t∈T.
Let us show that the mapping Tt is continuous in t in the sense of convergence μ-a.e. Let tn→t, n→∞. Prove that for any j∈N
[TABLE]
Indeed, for μ-a.e. x∈Xj(t) it holds that x∈Xj(tn) for all sufficiently large n,
since IXj(tn)→IXj(t) μ-a.e. Therefore, for μ-a.e. x∈Xj(t) for all sufficiently large n we have
[TABLE]
since Ftnj(φ−1(x))→Ftj(φ−1(x)) due to the continuity of Ftj in t
and ξtn,j→ξt,j λ-a.e.
Moreover,
[TABLE]
Therofore, μ({x∈X:Ttn(x)→Tt(x)})=0 and the mapping Tt is continuous in t in the sense of convergence μ-a.e.
Let us prove that the mapping Tt is ε-optimal for any t∈T. Fix t∈T.
For any j∈N we have (fix some x0∈Xj(t))
[TABLE]
since μ∣Xj(t)∘Tt−1=νtj and
∣ht(x,y)−ht(x0,y)∣<ε1 for any x∈Xj(t), y∈Y(t).
Similarly
[TABLE]
Therefore,
[TABLE]
Summing over j∈N, we obtain the inequality
[TABLE]
Furthermore,
[TABLE]
Therefore,
[TABLE]
Thus the mapping Tt is 6ε1-optimal for every t∈T.
∎
Corollary 3.2**.**
The statement of Theorem 3.1 holds true if we replace the condition that X is a complete metric space by the condition that X is a completely regular topological space and the measure μ is concentrated on a countable union of metrizable compact sets (i.e. we may assume that X is a Souslin space).
Proof.
Following the proof of Theorem 3.1 we construct the sets Y(t) and partitions
K1=⨆j=1∞Xj(t), t∈T. According to Theorem 2.1, consider ε1-optimal measures πt\leavevmode ∈\leavevmode Π(μ∣K1,μ(K1)ν) in the Kantorovich problem for the measures μ∣K1 and μ(K1)ν with the cost function h(x,y,t) such that πt is continuous in t in the weak topology. Set νtj=IXj(t)πt for any j∈N. Then νtj is continuous in t in the weak topology.
Define the mapping Tt on K1 in the same way as in the proof of Theorem 3.1, then we have
μ∣K1∘Tt−1=μ(K1)νt. Take a mapping F:X∖K1→[0,μ(X∖K1)] such that
[TABLE]
Set Tt(x)=ξt(F(x)) for any x∈X∖K1, where ξt:[0,μ(X∖K1)]→Y,
[TABLE]
and ξt is continuous in t in the sense of convergence λ-a.e. Then μ∘Tt−1=νt, Tt is continuous in t in the sense of convergence μ-a.e.
and Tt is ε-optimal for every t∈T.
∎
Consider now the most general case where the measures μt∈Pr(X) and νt∈Pr(Y) continuously depend on t.
Assuming that the measures μt are continuous in t in the total variation norm we prove the existence
of approximate optimal Monge mappings continuously depending on the parameter t in the sense of convergence μt-a.e.
Theorem 3.3**.**
Let X be a complete separable metric space and let Y be a complete metric space. Let T be a metric space,
the mapping t↦νt, T→Pr(Y), is continuous in the weak topology, the mapping t↦μt, T→Pr(X), is continuous in the total variation norm, and the measures μt are non-atomic for all t∈T.
Let
h:X×Y×T→[0,∞) be a continuous function such that h(x,y,t)≤at(x)+bt(y), where at∈L1(μt), bt∈L1(νt) and
[TABLE]
Then for any ε>0 one can select ε-optimal Monge mappings Ttε for the cost functions ht and measures μt, νt (i.e. μt∘(Ttε)−1=νt for every t∈T) such that
Ttε is continuous in t in the sense of convergence μt-a.e.: if tn→t as n→∞, then Ttnε→Ttε μt-a.e.
Proof.
The assertion of Theorem 3.3 reduces to the case where h≤1.
Let ε>0. Set ε1=ε/7.
Since every complete separable metric space is homeomorphic to a Gδ-set in [0,1]∞ (see [15]), we may assume that X⊂[0,1]∞.
The compact metrizable space [0,1]∞ is a continuous image of the Cantor set C, i.e. there exists a surjective continuous mapping f:C→[0,1]∞. By measurable selection theorem (see [5]) there exists a Borel measurable mapping g:[0,1]∞→C such that f(g(x))=x for all x∈[0,1]∞.
Set γt=μt∘g−1, t∈T. Then μt=γt∘f−1 for every t∈T and the measures γt are non-atomic. Moreover, the mapping t↦γt is continuous in the total variation norm, since
∥γt−γτ∥=∥(μt−μτ)∘g−1∥≤∥μt−μτ∥ for any t,τ∈T.
Set S=g(X). Then S is a Borel subset of C.
Let dX and dY be the metrics on X and Y respectively.
Let us prove that there exists a continuous (strictly positive) function δ:T→(0,+∞) and a collection of compact sets X(t)⊂X and closed sets Y(t)⊂Y, t∈T, such that for any t∈T we have
μt(X∖X(t))<ε1, νt(Y∖Y(t))<ε1 and
∣h(x1,y,t)−h(x2,y,t)∣<ε1 for any x1,x2∈X(t) with dX(x1,x2)<δ(t) and for any
y∈Y(t).
For every t∈T take compact sets K1(t)⊂X and K2(t)⊂Y such that
μt(X∖K1(t))<ε1 and νt(Y∖K2(t))<ε1.
Since h is continuous on X×Y×T, for any t0∈T there exist real numbers κ(t0)>0, r(t0)>0 and an open neighbourhood W~t0⊂T
(t0∈W~t0) such that ∣h(x1,y,t)−h(x2,y,t)∣<ε1 for any x1,x2∈K1(t0) with dX(x1,x2)<κ(t0) and for any y∈K2(t0)r(t0) (where Br={y∈Y:dY(y,B)≤r} is a closed r-neighbourhood of a set B in the metric space Y), t∈W~t0. Since the mapping t↦νt is continuous in the weak topology and νt0(Y∖K2(t0))<ε1, there exists an open neighbourhood Wt0′⊂T (t0∈Wt0′) such that νt(Y∖K2(t0)r(t0))<ε1 for any t∈Wt0′.
Since the mapping t↦μt is continuous in the total variation norm, there exists an open neighbourhood Wt0′′⊂T (t0∈Wt0′′) such that
μt(X∖K1(t0))<ε1 for any t∈Wt0′′.
Set Wt0=W~t0∩Wt0′∩Wt0′′.
The metric space T posseses a locally finite continuous partition of unity {ψα,α∈A} subordinated to the open cover {Wt,t∈T}, i.e. a set of continuous functions ψα, α∈A, such that 0≤ψα≤1 for any α∈A, suppψα⊂Wτ(α) for some τ(α)∈T, for every point t∈T there exists a neighbourhood W such that W∩suppψα=∅ for at most finite number of indices α∈A, and ∑αψα(t)=1.
Set
[TABLE]
Then the function δ(t) is continuous, since for any point t∈T there exists a neighbourhood W such that δ(t) is equal to the sum of a finite number of continuous functions on W.
For any t∈T choose an index α(t) from the finite set {α∈A:ψα(t)=0} for which
the value κ(τ(α)) is maximal.
Set
[TABLE]
Let us show that the function δ(t) and the sets X(t), Y(t), t∈T, satisfy the required condition.
Fix t0∈T. Let α1,…,αN be all indices from the set A such that ψαi(t0)=0.
Then t0∈Wτ(αi) for all i∈{1,…,N}.
Since ∑αψα(t0)=1, we have δ(t0)≤max(κ(τ(α1)),…,κ(τ(αN)))=κ(τ(α(t0))).
Therefore, by the definition of the numbers κ(t) we obtain that
∣h(x1,y,t0)−h(x2,y,t0)∣<ε1 if x1,x2∈X(t0), dX(x1,x2)<δ(t0), y∈Y(t0).
Moreover, μt0(X∖X(t0))<ε1 and νt0(Y∖Y(t0))<ε1,
because t0∈Wτ(α(t0)).
Since the mapping f is continuous, the function h(f(s),y,t) is continuous on S×Y×T. As proven above,
there exists a continuous function δ~:T→(0,+∞) and a collection of sets
S(t)⊂S, Y(t)⊂Y, t∈T, such that for any t∈T we have γt(S∖S(t))<ε1, νt(Y∖Y(t))<ε1 and
∣h(f(s1),y,t)−h(f(s2),y,t)∣<ε1 for all s1,s2∈S(t) with ∣s1−s2∣≤δ~(t) and for all y∈Y(t).
As described in the proof of Theorem 2.2, we can construct a partition S\leavevmode =\leavevmode ⨆j=1∞Sj(t) satisfying the following properties:
for any j∈N the mapping t↦ISj(t) is continuous in the sense of convergence γt-a.e., that is, for any sequence tn→t, n→∞, we have
ISj(tn)→ISj(t) γt-a.e.,
for any j∈N and for any t∈T we have
∣h(f(s1),y,t)−h(f(s2),y,t)∣<ε1 for all s1,s2∈S(t)∩Sj(t), y∈Y(t).
Set X(t)=f(S(t)) and Xj(t)=f(Sj(t)), j∈N. Then X=⨆j=1∞Xj(t). We have
IXj(tn)→IXj(t) μt-a.e.., if tn→t, n→∞ (this also implies that
μt(Xj(tn)△Xj(t))→0 as n→∞). Furthermore, for any j∈N and for any t∈T we have ∣h(x1,y,t)−h(x2,y,t)∣<ε1 for all x1,x2∈X(t)∩Xj(t), y∈Y(t).
By Theorem 2.1 there exist ε1-optimal measures πt∈Π(μt,νt) for the cost function h(x,y,t) such that πt is continuous in t in the weak topology. Let νtj be the projection of the measure IXj(t)πt on Y, j∈N.
Let us show that νtj is continuous in t in the weak topology. Let tn→t as n→∞, we show that the measures νtnj converge weakly to νtj.
We have
[TABLE]
since the mapping t↦μt is continuous in the total variation norm.
Let us prove that the measures IXj(t)πtn converge weakly to IXj(t)πt.
Let ζ∈Cb(X×Y), ∣ζ∣≤1, we show that
[TABLE]
Fix δ>0. Take a compact set Fj and an open set Uj such that Fj⊂Xj(t)⊂Uj and μt(Uj∖Fj)<δ. There exist a continuous function χ:X→R such that χ=1 on Fj, χ=0 outside Uj, 0≤χ≤1.
Then
[TABLE]
since the measures πtn converge weakly to πt. Furthermore,
[TABLE]
since ∣IXj(t)−χ∣≤IUj∖Fj and ∣ζ∣≤1.
Therefore,
[TABLE]
Hence we obtain that ∫X×Yζ(x,y)IXj(t)πtn(dxdy)−∫X×Yζ(x,y)IXj(t)πt(dxdy)→0.
Therefore, the measures νtnj converge weakly to νtj, i.e. the mapping t↦νtj is continuous in t in the weak topology.
The complete metric space Y posseses the strong Skorohod property for Radon measures,
that is, for any Radon probability measure η on Y there exists a mapping ξη:[0,1]→Y such that
λ∘ξη−1=η, where λ is Lebesgue measure on [0,1], and if measures ηn converge weakly to η, then ξηn→ξη λ-a.e.
Since the mapping t↦νtj is continuous in the weak topology for any j∈N, by the strong Skorohod property for any j∈N there exists a mapping ξt,j:[0,μt(Xj(t))]→Y (where μt(Xj(t))=γt(Sj(t)) for any j∈N) such that
[TABLE]
and ξt,j is continuous in t in the sense of convergence λ-a.e.
Let
[TABLE]
The mapping t↦Ftj is continuous in t in the topology of pointwise convergence: if tn→t as n→∞, then Ftnj(s)→Ftj(s) for any s∈S.
Indeed,
[TABLE]
Set
[TABLE]
Then μt∣Xj(t)∘Tt−1=νtj, since the mapping g transfers the measure μt∣Xj(t) to the measure γt∣Sj(t) and the mapping Ftj transfers the measure
γt∣Sj(t) to the measure λ∣[0,μt(Xj(t))].
Therefore, μt∘Tt−1=νt for any t∈T.
Let us show that the mapping Tt is continuous in t in the sense of convergence μt-a.e.
Let tn→t, n→∞. Prove that for any j∈N
[TABLE]
Indeed, for μt-a.e. x∈Xj(t) it holds that x∈Xj(tn) for all sufficiently large n,
since IXj(tn)→IXj(t) μt-a.e.
Therefore, for μt-a.e. x∈Xj(t) for all sufficiently large n we have
[TABLE]
since Ftnj(g(x))→Ftj(g(x)) due to the continuity of Ftj in t
and ξtn,j→ξt,j λ-a.e.
Therofore, μt({x∈X:Ttn(x)→Tt(x)})=0 and the mapping Tt is continuous in t in the sense of convergence μt-a.e.
Let us prove that the mapping Tt is ε-optimal for any t∈T. Fix t∈T.
For any j∈N we have (fix some x0∈Xj(t)∩X(t))
[TABLE]
since μt∣Xj(t)∘Tt−1=νtj and
∣ht(x,y)−ht(x0,y)∣<ε1 for any x∈Xj(t)∩X(t), y∈Y(t).
Similarly
[TABLE]
Therefore,
[TABLE]
Summing over j∈N, we obtain that
[TABLE]
Therefore, the mapping Tt is 7ε1-optimal for every t∈T.
∎
Corollary 3.4**.**
The statement of Theorem 3.3 holds true in the case where X is a Souslin space.
Proof.
The Souslin space X is an image of a complete separable metric space X~ under a continuous surjective mapping
f:X~→X. By measurable selection theorem (see [5]) there exists a mapping g:X→X~ such that g is measurable with respect to the σ-algebra generated by Souslin sets and f(g(x))=x for all x∈X.
Set γt=μt∘g−1 for any t∈T. Then μt=γt∘f−1 and the measures
γt are non-atomic. The mapping t↦γt is continuous in the total variation norm,
since ∥γt−γτ∥=∥μt−μτ∥ for any t,τ∈T.
The function h(f(x~),y,t) is continuous on X~×Y×T.
Consider the Kantorovich problem with the cost function h(f(x~),y,t) and measures
γt, νt, t∈T.
By Theorem 3.3 there exist ε-optimal mappings T~t:X~→Y
such that T~t is continuous in t in the sense of convergence γt-a.e. Set Tt(x)=T~t(g(x)).
Then μt∘Tt−1=γt∘T~t−1=νt for any t∈T. The mapping t↦Tt is continuous in t in the sense of convergence μt-a.e. Indeed, if tn→t, n→∞, then
[TABLE]
Let us show that the mapping Tt is ε-optimal for any t∈T.
We have
[TABLE]
Let σ∈Π(μt,νt) be an optimal plan in the Kantorovich problem with the cost function h(x,y,t) and measures μt,νt.
Let σ~ be the image of the measure σ under the mapping (x,y)↦(g(x),y). Then σ~∈Π(γt,νt) and
[TABLE]
Therefore, the minimum in the Kantorovich problem with the cost function h(f(x~),y,t) and measures γt,νt equals the minimum in the Kantorovich problem with the cost function h(x,y,t) and measures μt,νt.
Therefore, the mapping Tt is ε-optimal.
∎