Variational characterization of the regularity of Monge-Brenier maps
Ali Suleyman Ustunel

TL;DR
This paper investigates the regularity of Monge-Brenier maps on Wiener spaces, demonstrating their Sobolev regularity through variational methods and large deviations theory, under specific measure assumptions.
Contribution
It introduces a variational approach to establish Sobolev regularity of Monge-Brenier maps in an abstract Wiener space setting, extending previous results.
Findings
Monge-Brenier maps are Sobolev regular under finite information hypothesis.
Variational methods can be used to analyze the regularity of optimal transport maps.
Results apply to both forward and backward Monge-Brenier maps.
Abstract
On an abstract Wiener space, assume that T is the solution of the quadratic Monge problem associated to the Wiener measure and a second one with a Radon-Nikodym derivative of exponential type. Under the finite information hypothesis, using a variational method, we prove that T minimizes a certain functional originating from the large deviations theory. Applying a variational method a la Euler, we obtain the Sobolev regularity of the backward Monge-Brenier map. A similar result also holds for the forward Monge-Brenier map.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeometry and complex manifolds · Geometric Analysis and Curvature Flows · Nonlinear Partial Differential Equations
Variational Characterization of the Regularity of Monge-Brenier Maps
A. S. Üstünel
Abstract: Let be an abstract Wiener space, assume that is the solution of the Monge problem associated to the measures and . Under the finite information hypothesis, using a variational method, we prove that and this result implies the Sobolev regularity of the backward Monge-Brenier map. A similar result also holds for the forward Monge-Brenier map.
Keywords: Entropy, adapted perturbation of identity, Wiener measure, Monge and Monge Kantorovich problems, Monnge potential, Monge-Brenier map.
1. **Introduction **
Let be the probability measure defined by
[TABLE]
such that the relative entropy of w.r.t. the Wiener measure , denoted as is finite. Let be the set of the probability measures on whose first marginals are and the secones ones are . Consider the problem of minimization which defines also a strong Wasserstein distance between and :
[TABLE]
where denotes the Cameron-Martin norm. In the finite dimensional case this problem has been extensively studied since almost three centuries and we refer to the texts [14] and [20] for history and references and also to [3] and [13].
In the infinite dimensional case, where the cost function is very singular, in the sense that the set on which the cost function is finite has zero measure w.r.t. the product measure has been solved in a series of papers ([8, 9, 10]) and the answer can be summarized as follows: There exists a -convex function , in the Gaussian Sobolev space , called Monge potential or Monge-Brenier map such that the above infimum is attained at , i.e., the image of the measure under the map , where , where is the -extended derivative of in the direction of Cameron-Martin space. Moreover, there exists also a dual Monge potential , which has an -extended derivative in the direction of Cameron-Martin space, such that, the map satisfies , hence -a.s. and -a.s. The next important issue in this subject is to show the Sobolev regularity of the Monge-Brenier maps in such a way that one can write the Jacobian functions associated to the corresponding transformations and . In finite dimensional case this problem has been treated by several authors (cf. [4] and the references given in [20]). In the infinite dimensional case there are also some results (cf. [2, 6, 11] ) which are generalizations of the results given in [9, 10]. These results are generally the suitable extensions to the finite dimensional case of those which were developped especially by L. Caffarelli, though we have also given another method to calculate the Jacobian functions in infinite dimensions using the Itô calculus.
In this work we shall present a totally different method, namely, we shall prove the Sobolev regularity of the Monge-Brenier functions using the Calculus of variations. Let us begin by recalling a celebrated variational formula, which holds on any measurable space but we formulate on a Wiener space for the notational simplicity:
[TABLE]
where denotes the set of probability measures on , being the Borel sigma field of , are as described above. The infimum is attained at provided that is finite, cf. [17]. On the other hand, we know from [8] that there exists some , -convex function such that , where we use the same notation for the image of a point and of a measure under a measurable map (here the map under question is ). Consequently the following identity holds true:
[TABLE]
Therefore
[TABLE]
For this infimum to be finite we need that , which implies . Besides the right hand side of the inequality (1.2) is always greater than
[TABLE]
therefore we have equality between all these expressions:
Theorem 1**.**
Assume that , where and is a measurable function. Then the infimum
[TABLE]
is attained at the vector field , where is the unique (up to an additive constant) Monge potential such that and that the -norm of is equal to the Wasserstein distance between and :
[TABLE]
where denotes the set of probability measures on , whose first marginals are and the second ones are .
Note that if we could apply the variational principle above, namely, by taking the derivative of the functional at the minimizing vector field in any admissible direction, we would obtain the following relation:
[TABLE]
where denotes the Gaussian divergence, i.e., the adjoint of the derivative w.r.t. the Gaussian measure and this equation implies Sobolev regularity of . A similar method can be used for the dual Monge potential also. We shall realize this programm in the sequel beginning from the finite dimensions and passing to the infinite dimensional case by a limiting argument. Let us not that this method is applicable in other situations than the Gaussian case also as one can see already in the case of dual potential.
Let us resume the following important observation: this work is devoted to the creation of a variational calculus by parametrizing the formula 1.1 with the vector fields which are derivatives of scalar functionals. In another work, which has already appeared, [18], we have parametrized the same formula with adapted vector fields to obtain totally different results, like the existence, uniqueness and non-existence results of stochastic differential equations with past depending drift coefficients.
2. Preliminaries
Let be a separable Fréchet space equipped with a Gaussian measure of zero mean whose support is the whole space111The reader may assume that , or .. The corresponding Cameron-Martin space is denoted by . Recall that the injection is compact and its adjoint is the natural injection . The triple is called an abstract Wiener space. Recall that if and only if is finite dimensional. A subspace of is called regular if the corresponding orthogonal projection has a continuous extension to , denoted again by the same letter. It is well-known that there exists an increasing sequence of regular subspaces , called total, such that is dense in and in . Let be the -algebra generated by , then for any , the martingale sequence converges to (strongly if ) in . Observe that the function can be identified with a function on the finite dimensional abstract Wiener space , where .
Since the translations of with the elements of induce measures equivalent to , the Gâteaux derivative in direction of the random variables is a closable operator on -spaces and this closure will be denoted by cf., for example [15]. The corresponding Sobolev spaces (the equivalence classes) of the real random variables will be denoted as , where is the order of differentiability and is the order of integrability. If the random variables are with values in some separable Hilbert space, say , then we shall define similarly the corresponding Sobolev spaces and they are denoted as , . Since is a continuous and linear operator its adjoint is a well-defined operator which we represent by . In the case of classical Wiener space, i.e., when , then coincides with the Itô integral of the Lebesgue density of the adapted elements of (cf.[15]).
For any and measurable , we note by
[TABLE]
it is well-known that is a hypercontractive semigroup on , which is called the Ornstein-Uhlenbeck semigroup (cf.[15]). Its infinitesimal generator is denoted by and we call the Ornstein-Uhlenbeck operator (sometimes called the number operator by the physicists). Due to the Meyer inequalities (cf., for instance [15]), the norms defined by
[TABLE]
are equivalent to the norms defined by the iterates of the Sobolev derivative . This observation permits us to identify the duals of the space by , with , where the latter space is defined by replacing in (2.3) by , this gives us the distribution spaces on the Wiener space (in fact we can take as any real number). An easy calculation shows that, formally, , and this permits us to extend the divergence and the derivative operators to the distributions as linear, continuous operators. In fact and continuously, for any and , where denotes the completed Hilbert-Schmidt tensor product (cf., for instance [15]). The following assertion is useful: assume that converges to in , assume further that each each is a probability measure on , then is also a probability and converges to in the weak topology of measures. In particular, a lower bounded distribution (in the sense that there exists a constant such that is a positive distribution) is a (Radon) measure on , c.f. [15].
A measurable function is called -convex (cf.[7]) if
[TABLE]
is convex -almost surely, i.e., if for any , , we have
[TABLE]
almost surely, where the negligeable set on which this inequality fails may depend on the choice of and of . We can rephrase this property by saying that is an -valued convex function on . is called -convex if the map
[TABLE]
is convex on the Cameron-Martin space with values in . Note that all these notions are compatible with the -equivalence classes of random variables thanks to the Cameron-Martin theorem. It is proven in [7] that this definition is equivalent the following condition: Let be a sequence of regular, finite dimensional, orthogonal projections of , increasing to the identity map . Denote also by its continuous extension to and define . For , let and . Then is -convex if and only if
[TABLE]
is -almost surely convex. We define similarly the notion of -concave and -log-concave functions. In particular, one can prove that, for any -log-concave function on , and are again -log-concave [7].
3. Variational calculations
Assume for a while that is smooth; this can be achived by replacing by its regularization defined as
[TABLE]
where is the Ornstein-Uhlenbeck semi-group, is the sigma-algebra generated by and is a complete, orthonormal basis of . Since , if we take the Gateau derivative of at , it should give zero: Let and denote by the Gaussian Jacobian of :
[TABLE]
where is the Ornstein-Uhlenbeck operator , denotes the modified Carleman-Fredholm determinant, where the adjoint is taken w.r.t. the Wiener measure , c.f. [19]. It follows from the change of variables formula, c.f.[19], that , hence
[TABLE]
In particular, thanks to the -convexity of , if we replace by , for small , the shift becomes strongly monotone and it is the solution of the Monge transportation problem for the measure (i.e., the image of under ). Let be defined as
[TABLE]
If such that has small -norm as a Hilbert-Schmidt operator, then is a strongly monotone shift for small , hence it is almost-surely invertible (cf. [19], Corollary 6.4.2). Note moreover that the shift is the unique solution of another Monge problem, namely the one which corresponds to the measure . Here the multiplication with a small permits us to have a sufficiently large set on which we calculate the Gateau derivative while preserving the -convexity of the corresponding Monge potential, namely . Using again the change of variables formula for , we get
[TABLE]
Therefore
[TABLE]
Since minimizes the function between all the absolutely continuous shifts, we should have
[TABLE]
for any with . Since the set of vector fields
[TABLE]
is dense in any , we have proved the following
Theorem 2**.**
In the finite dimensional smooth case, the Monge potential satisfies the following relation
[TABLE]
almost surely, where denotes the Gaussian divergence w.r.t. , i.e., the adjoint of w.r.t. .
**Proof: **In the equation (3) we have a term with trace, we just interpret it as a scalar product on the Hilbert-Schmidt operators on the Cameron-Martin space and the claim follows, for the case , from the definition of as a mapping from Hilbert-Schmidt-valued operators to the vector fields under this scalar product. Hence we have the identity
[TABLE]
Since we have a.s., where and , in probability, where . The justification of the other terms being trivial, the proof is completed.
Lemma 1**.**
Let and let be any fixed element of the Cameron-Martin space , then we have almost surely
[TABLE]
**Proof: **Let , this is a symmetric operator; as is a positive operator, we can write
[TABLE]
trivially, where denotes the Hilbert-Schmidt norm.
Proposition 1**.**
Let be the Monge potential of Monge-Kantorovich problem with the target measure . Assume that is smooth as explained above, then we have
[TABLE]
where denotes the Hilbert-Schmidt norm on .
**Proof: **The proof follows from the calculation of the second moment of the norm of a vector-valued divergence of Theorem 2 combined with the result of Lemma 1.
The next two lemmas give useful stability results of the forward and backward potentials in the finite dimensional situations whenever the target measures are approximated with more regular measures. There are some results in the literature (cg. [5, 20]), but they are of limited applicability. sequel:
Lemma 2**.**
Let be the standard Gaussian measure on , s.t.
[TABLE]
Let be the Monge potentials associated to the Monge-Kantorovitch problem , where . Define as to be , where denotes the Ornstein-Uhlenbeck semigroup on . Let be the Monge potentials corresponding to Monge-Kantorovich problem , where . Then converges to in , converges to in and converges to in
**Proof: **Let be the unique solutions of Monge-Kantorovitch problems for and respectively. From Brenier’s theorem (cf.[3])
[TABLE]
and for any . Similarly
[TABLE]
and for any . As weakly and since is exponentially integrable w.r.t. uniformly in , it is easy to deduce, using the Young inequality, that
[TABLE]
and this implies that (cf.[1], Lemma 8.3)
[TABLE]
where denotes the second order Wasserstein distance on the probability measures on . These relations imply that is bounded in . Moreover, we have
[TABLE]
By the boundedness of in there exists such that converges weakly to (upto a subsequence) in , hence also in . In the sequel we replace by and by to avoid the ambiguities about the constants. We have
[TABLE]
for any
Moreover
[TABLE]
Hence converges to [math] in , consequently is also uniformly integrable in , therefore there exists some which is a weak adherent point of . Therefore
[TABLE]
-a.s. Let and be the convex combinations of the sequences and respectively, which converge strongly in and respectively. Let and . We have then
[TABLE]
for all and
[TABLE]
-almost surely. By the uniqueness of the solution of the Monge-Kantorovitch problem we should have and -a.s. Assume now that is another weak cluster point of , then -a.s., hence -a.s. Hence converges to in . Similarly converges to in , moreover is closable on and and this completes the proof.
Lemma 3**.**
Let be the standard Gaussian measure on , be a probability density such that
[TABLE]
Let be the Monge potentials associated to the Monge-Kantorovitch problem , where . Define as another density, where is approximating the constant . Let be the Monge potentials corresponding to Monge-Kantorovich problem with quadratic cost over Then converges to in , converges to in .
**Proof: **The proof is similar to the proof of Lemma 2. Let and be the transport plans corresponding to the Monge-Kantorovitch problems for and respectively. As in the Lemma 2, we have
[TABLE]
and for any . Since
[TABLE]
we have
[TABLE]
By positivity, we deduce that is bounded in and that is uniformly integrable in . Hence there are weak some adherence points of in denoted as and of in denoted as such that
[TABLE]
-a.s. By taking convex combinations, we can assume these convergences to be in the strong sense. Let us define and as and , where denotes convex combinations, we obtain
[TABLE]
for all and that
[TABLE]
-a.s. By uniqueness, we should have and -a.s. Since this construction holds for any infinite subsequences of and , these sequences have unique weak accumulation points, i.e., they converge weakly in and in respectively. Moreover we know that , hence converges strongly in and hence strongly in .
After these preparations, we can prove our first regularity result:
Theorem 3**.**
Assume that is the Monge-Brenier map which is the solution of Monge problem with the quadratic cost where is the norm of the Cameron-Martin space and with the target measure with for some and let be the inverse map wich maps to . Assume that
[TABLE]
Then we have , where denotes the space of Hilbert-Schmidt operators on . In other words belongs to the -domain of and belongs to -domain of again. Besides we have the following control:
[TABLE]
**Proof: **Note that the relation (3.7) implies the closability of the gradient operator in , hence and are well-defined. For the proof, let be defined as before with the relation . Let us denote by and by forward and backward Monge potentials corresponding to the measure . From Proposition 1, we have
[TABLE]
Since and are inverse to each other, we have
[TABLE]
Substituting this identity in (3.9), we obtain
[TABLE]
Let be the Monge potentials corresponding to the transportation of to the measure . It follows from Lemma 2 that converges to in and that converges to in as and as . Hence we have
[TABLE]
Moreover, taking weak limits first as then as we obtain, due to the weak lower semi-continuity of the norms, that
[TABLE]
Let us show now the regularity of the forward Monge potential : assume first that, we have reduced the problem to the case where everything is smooth using the approximation results that we have proven before. Let be the measure defined by . The following relation holds then true:
[TABLE]
It is important to remark that in the equation 3.11, the infimum is taken over the set of probability measures and in the equation 3.12, the infimum is taken over the perturbations of identity of the form when runs in the set of the gradients of -convex functions, cf. [9]. Moreover, denoting by , we have
[TABLE]
where is the Gaussian Jacobian associated to . Therefore and we get
[TABLE]
Consequently
[TABLE]
We know that the above infimum is attained at , hence we should have
[TABLE]
for any smooth suct that . A similar calculation as performed before implies that
[TABLE]
for any as above. Consequently we have
Theorem 4**.**
The dual Monge potential satisfies the relation
[TABLE]
where denotes the adjoint of w.r.t. the measure .
We need a couple of techical results:
Lemma 4**.**
Let be a smooth vector field, then the following results hold true:
- (1)
. 2. (2)
For any ,
[TABLE] 3. (3)
For any and smooth ,
[TABLE]
Lemma 5**.**
For any smooth , we have
[TABLE]
**Proof: **By the definition of , we have
[TABLE]
Besides (cf.[19]). Hence
[TABLE]
We also have
[TABLE]
Substituting this expression in the above calculation gives
[TABLE]
Theorem 5**.**
Assume that for some , satisfying . Assume moreover that it is -convex for some , in the sense that the mapping
[TABLE]
is a convex map from the Cameron-Martin space to (i.e., the equivalence class of real-valued Wiener functionals under the topology of convergence in probability). Then the forward Monge potential belongs to the Gaussian Sobolev space .
**Proof: **let be defined as . Since is a smooth, -convex function, the corresponding forward potential is also smooth from the classical finite dimensional results (cf. [4], [20]). Let , then we have, from Theorem 4
[TABLE]
From Lemma 5 and denoting by , we get
[TABLE]
Since the second terms at the right of the second line is positive (as we have already observed), we obtain
[TABLE]
Hence
[TABLE]
but and
[TABLE]
We also have
[TABLE]
Consequently we get
[TABLE]
and the claim follows by taking the limit at the r.h.s. and he limit inferior at the l.h.s. even with an explicit bound:
[TABLE]
The next corollary follows from Theorem 3 and from Lemma 5 it is about the regularity of the dual potential :
Corollary 1**.**
Assume that , where denotes the operator norm on . Then belongs to .
We can get rid of the hypothesis of the second order differentiability of by increasing the degree of integrability of . We show first
Lemma 6**.**
Assume that , then
[TABLE]
where is a universal constant.
**Proof: **We can work without loss of generality on the classical Wiener space on which lives the canonical Brownian motion . Let be the fourth order Wasserstein distance, i.e.,
[TABLE]
Let , we can represent it as using the Itô representation theorem, where is an element of such that is -measurable for almost all . Moreover, from the Clark formula, we have
[TABLE]
where is the Lebesgue density of . Define as , the Girsanov theorem implies that the measure belongs to , hence
[TABLE]
where is a universal constant coming from the Burkholder-Davis-Gundy inequality. By the cyclic monotonicity of the -sections of , we know that is the unique solution of the Monge-Kantorovitch problem of fourth degree, hence .
Theorem 6**.**
Assume that , then .
**Proof: **Let be the smooth approximations of defined as and let be the corresponding dual Monge potentials converging to in s.t. . We have
[TABLE]
where denotes the Hilbert-Schmidt inner product. Applying again the Hölder inequality to the remaining terms and using the formula given in Lemma 5, and applying Lemma 6, we get
[TABLE]
Since converges to on the smooth cylindrical functions, the inequality implies that this sequence is bounded in . Using the inequality that we have already proven:
[TABLE]
and using the weak lower semi-continuity, we obtain
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] P.J. Bickel and D.A. Freedman: “Some asymptotic theory for the bootstrap”. Ann. Statis. 9(6), 1196–1217 (1981)
- 2[2] V. I. Bogachev and A. V. Kolesnikov:“ Sobolev regularity for the Monge-Ampère equation in the Wiener space. Kyoto J. Math. 53, no. 4, 713–738 (2013).
- 3[3] Y. Brenier: “Polar factorization and monotone rearrangement of vector valued functions”. Comm. Pure Appl. Math, 44 , 375-417, 1991.
- 4[4] L. A. Caffarelli: “The regularity of mappings with a convex potential”. J. Am. Math. Soc. 5, 99–104 (1992)
- 5[5] V. Chernozhukov: “Uniform convergence of transport maps”. ar Xiv:1412.8434 v 1.
- 6[6] S. Fang and V. Nolot: “Sobolev estimates for optimal transport maps on Gaussian spaces”. J. Funct. Anal. 266, no. 8, 5045–5084 (2014).
- 7[7] D. Feyel and A.S. Üstünel. The notion of convexity and concavity on Wiener space. Journal of Functional Analysis , vol. 176,pp. 400-428, 2000.
- 8[8] D. Feyel and A. S. Üstünel: “Transport of measures on Wiener space and the Girsanov theorem”. Comptes Rendus Mathématiques, Vol. 334 , Issue 1, 1025-1028, 2002.
