Regularity of Schr\"odinger's functional equation in the weak topology and moment measures
Toshio Mikami

TL;DR
This paper investigates the continuity and measurability of solutions to Schrödinger's functional equation under weak topology, and applies these results to construct convex functions with prescribed moment measures via stochastic optimal transportation.
Contribution
It extends previous work by analyzing the problem under weak topology and introduces a method to construct convex functions with specific moment measures using zero noise limits.
Findings
Established continuity and measurability of solutions in weak topology.
Constructed convex functions with given moment measures.
Linked Schrödinger's equation solutions to stochastic optimal transportation.
Abstract
We study the continuity and the measurability of the solution to Schr\"odinger's functional equation, with respect to space, kernel and marginals, provided the space of all Borel probability measures is endowed with the weak topology. This is a continuation of our previous result where the space of all Borel probability measures was endowed with the strong topology. As an application, we construct a convex function of which the moment measure is a given probability measure, by the zero noise limit of a class of stochastic optimal transportation problems.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and financial applications · advanced mathematical theories · Advanced Mathematical Modeling in Engineering
Regularity of Schrödinger’s functional equation in the weak topology and moment measures
††thanks: 2010 Mathematics Subject Classification : Primary 60G30 ; Secondary 93E20
Toshio Mikami
This work was supported by JSPS KAKENHI Grant Numbers JP26400136 and JP16H03948.
Abstract
We study the continuity and the measurability of the solution to Schrödinger’s functional equation, with respect to space, kernel and marginals, provided the space of all Borel probability measures is endowed with the weak topology. This is a continuation of our previous result where the space of all Borel probability measures was endowed with the strong topology. As an application, we construct a convex function of which the moment measure is a given probability measure, by the zero noise limit of a class of stochastic optimal transportation problems.
1 Introduction
E. Schrödinger considered the following problem to find the statistical property of a particle on a finite time interval. Suppose that there exist particles in a set and each particle moves independently, with a given transition probability, to a set , where . Find the maximal probability of such events, provided the numbers of particles in each point , are fixed (see section 7 in [41] and also [40]). Though he did not succeed in finding the maximal probability, he obtained Euler’s equation for the variational problem above. The continuum limit is called Schrödinger’s functional equation (see [5, 9, 22, 24] for the solution of this problem). S. Bernstein [4] generalized Schrödinger’s idea and introduced the so-called Bernstein processes which are also called reciprocal processes. The theory of stochastic differential equation for Schrödinger’s functional equation was given by B. Jamison [25]. The solution is Doob’s h-path process (see [15]) with given two end point marginals. Schrödinger’s problem is also related to the theory of large deviations, the optimal mass transportation problem, entropic estimates and functional inequalities (see, e.g. [1, 2, 11, 12, 16, 18, 23, 26, 27, 28, 30, 31, 35, 36, 37, 43] and the references therein).
We describe E. Schrödinger’s functional equation (see e.g. [24]) in the setting considered in this paper. Let be a -compact metric space and . For Borel probability measures on , find nonnegative -finite Borel measures on for which the following holds:
[TABLE]
It is known that there exists a solution of (1.1) (see [9, 24]). is unique up to a constant though the product measure is unique. Indeed, for any , is also a solution of (1.1). By the uniqueness of the solution to (1.1), we mean that of the product measure . Let be a nondecreasing sequence of compact subsets of such that , where when is compact. When we consider and separately, considering if necessary, we assume that the following holds:
[TABLE]
where
[TABLE]
[TABLE]
[TABLE]
Then and are positive and
[TABLE]
(1.1) can be rewritten as follows: for , ,
[TABLE]
In particular, Schrödinger’s problem (1.1) is equivalent to finding functions and for which (1.6) holds. is unique up to a constant though is unique.
Let and denote the space of all Radon measures and that of all Borel probability measures on , respectively, where a Radon measure means a locally finite and inner regular Borel measure. It is easy to see that and are functionals of , and :
[TABLE]
In [33], we considered the case where is endowed with the strong topology and showed that if is compact, then the following is continuous:
[TABLE]
[TABLE]
and . Here is endowed with the strong topology and and are endowed with the topology induced by the uniform convergence on and , respectively. We also showed that if is -compact, then the following is Borel measurable:
[TABLE]
[TABLE]
As an application of this measurability result, we showed that the coefficients of the mean field PDE system for the h-path process with given two end point marginals are measurable functions of space, time and marginal.
Remark 1.1
(1.2) was assumed in [33] and implies that for ,
[TABLE]
In particular, the measurability of implies that of .
In this paper we consider the case where is endowed with the weak topology and show the continuity and measurability results on and (see Theorem 2.1 and Corollaries 2.1-2.3 in section 2). Our continuity result in the weak topology is useful when one considers the existence of a minimizer of a variational problem (see [10] for the continuity result on optimal transport). Indeed, it is not easy to show that a minimizing sequence is compact in the strong topology. As an application (see Theorem 2.2 in section 2), we give a stochastic optimal transportation approach to moment measures (see [13, 39]). The definition of a moment measure of a convex function is the following.
Definition 1.1
Given a convex function , the following is called the moment measure of :
[TABLE]
Remark 1.2
If is a moment measure of a convex function , then is the unique minimizer of the -Wasserstein distance , provided and is finite (see [7, 8, 43]). Here denotes the delta measure on and for ,
[TABLE]
We describe an application of our continuity result more precisely. Let and let and denote a -dimensional Brownian motion and a progressively measurable -valued stochastic process on a filtered probability space, respectively. Consider the following SDE in a weak sense (see e.g. [20]):
[TABLE]
For ,
[TABLE]
where if the set over which the infimum is taken is empty (see [1, 2, 16, 18, 19, 27] for related problems on large deviations). For ,
[TABLE]
For , ,
[TABLE]
where
[TABLE]
By our weak continuity result of , we can easily prove the existence of a minimizer of from the lower semicontinuities of a relative entropy and of with respect to the weak topology (see (1.20) and also Lemmas 3.4 and 3.5 in section 3). We show that a subsequence of weakly converges, as , to a Borel probability measure such that is convex and is a moment measure of . This is formally implied by the representation of and the SDE for the minimizer of (see (2.11) and (1.17)). We also show that has a subsequence which uniformly converges, as , to , provided is compactly supported.
formally converges, as , to the functional considered in [39] where they take the infimum over instead of . Our approach makes the proof easier than [39] since is compact in the weak topology but can not be applied if we replace by , which we regret.
In the proof of the representation of in (2.11), we also make use of properties of the solution to Schrödinger’s functional equation and the duality theorem for :
[TABLE]
Here the supremum is taken over all classical solutions to the following Hamilton-Jacobi-Bellman PDE:
[TABLE]
(see [30, 31, 34, 42] and the references therein).
[TABLE]
It is known that for any for which , there exists the unique weak solution to the following two end points problem of SDE (see [25] and also [33, 34]):
[TABLE]
is called the h-path process for on with initial and terminal distribution and , respectively. The following is also known:
[TABLE]
Suppose that is finite (see Remark 2.2 in section 2 for a sufficient condition). Then in (1.17) is the unique minimizer of (see [14, 21], [26]-[38], [42], [44] and the references therein). Besides, there exists which is unique up to a constant such that the following holds (see [30, 31, 33, 34, 42] and the references therein and also (1.5)):
[TABLE]
In particular, the following holds:
[TABLE]
Here denotes the relative entropy of two measures: for ,
[TABLE]
Remark 1.3
If is finite, then . Indeed, is the relative entropy of with respect to on and
[TABLE]
Here denotes the convolution of two measures.
In section 2 we state our main results and prove them in section 4 by lemmas which are given in section 3.
2 Main result
In this section we state our main results. We first describe assumptions precisely.
(A1) is a complete -compact metric space.
(A1)’ is a compact metric space.
(A2) .
We remark that is endowed with the weak topology and is endowed with the topology induced by the uniform convergence on every compact subset of .
Under (A1), let be a nondecreasing sequence of functions in such that the following holds:
[TABLE]
(see (1.2)). If , then and we assume that . For , ,
[TABLE]
provided the right hand side is well defined (see (1.7) and also (1.4)).
[TABLE]
The following is the continuity result on , and .
Theorem 2.1
Suppose that (A1) and (A2) hold and that , , , and
[TABLE]
[TABLE]
Then for any ,
[TABLE]
In particular,
[TABLE]
For any which converges, as , to , and for sufficiently large ,
[TABLE]
Since is measurable, Theorem 2.1 implies the following.
Corollary 2.1
Suppose that (A1) and (A2) hold. Then the following are Borel measurable: for ,
[TABLE]
[TABLE]
If is compact, then (see (1.2)). This implies, from Theorem 2.1, the following of which the proof is omitted.
Corollary 2.2
Suppose that (A1)’ and the assumption of Theorem 2.1 except (A1) hold. Then the following holds: for ,
[TABLE]
and for any which converges, as , to ,
[TABLE]
A uniformly bounded sequence of convex functions on a convex neighborhood of a convex subset of is compact in , provided is positive (see e.g., [3], section 3.3). We describe an additional assumption and state a stronger result than above, provided .
(A3.) There exists for which and are convex on for any and any , respectively.
Remark 2.1
If has bounded second order partial derivatives on , then (A3.) holds.
[TABLE]
The following is a stronger convergence result than Corollary 2.2.
Corollary 2.3
Let . Suppose that (A3.) and the assumptions of Corollary 2.2 with hold. Then for any ,
[TABLE]
[TABLE]
As an application of our regularity result, we show that there exists a convex function of which the moment measure is a given probability measure.
Theorem 2.2
For any for which is finite, there exists a minimizer of . For any minimizer of ,
[TABLE]
where is a normalizing constant. Besides, there exists a subsequence of which weakly converges, as , to a probability measure such that is a moment measure of . Suppose, in addition, that is compactly supported. Then there exists a subsequence of which uniformly converges, as , to a probability density function such that is a moment measure of .
Remark 2.2
If and is finite, then is finite. Indeed, from the last equality of (1.20),
[TABLE]
*since, the relative entropy is nonnegative. *
3 Lemmas
In this section we state and prove lemmas. When it is not confusing, we omit the dependence of on .
3.1 Lemmas for the proof of Theorem 2.1 and Corollary 2.3
The following lemma will be used in the proof of Theorem 2.1.
Lemma 3.1
Suppose that (A1) and (A2) hold. Then for any , defined by (1.3),
[TABLE]
(Proof) The proof is done by the following (see (1.3)):
[TABLE]
For and ,
[TABLE]
Lemmas 3.2 and 3.3 will be used to prove Corollary 2.3.
Lemma 3.2
([5], p. 194)* Let . Suppose that (A2) with holds. Then, for any , the following holds (see (1.4) for notation):*
[TABLE]
By the method of proving the convexity of a log moment generating function, we obtain the following.
Lemma 3.3
Let and be a convex subset of and a nonnegative Radon measure, respectively. Suppose that is convex, -a.e.. Then is convex.
(Proof) For and , by Hölder’s inequality,
[TABLE]
3.2 Lemmas for the proof of Theorem 2.2
In this subsection, we prove lemmas for the proof of Theorem 2.2. Lemma 3.3 will be also used in the proof of Theorem 2.2.
[TABLE]
The lower semicontinuity of a relative entropy and the continuity result in Theorem 2.1 imply the following.
Lemma 3.4
Suppose that Theorem 2.1 holds. Then for any , the following is lower-semicontinuous on (see (1.4), (1.7) and (1.16) for notation):
[TABLE]
(Proof) From (1.20),
[TABLE]
(see (1.18) and (2.2) for notation). Since is lower semicontinuous (see [17], Lemma 1.4.3), the proof is over from Theorem 2.1.
The following lemma can be proved by the lower semicontinuity of a relative entropy.
Lemma 3.5
For any , is lower-semicontinuous on in the weak topology.
(Proof)
[TABLE]
The proof is done by the following:
[TABLE]
(see e.g. [17], Lemma 1.4.3).
Lemma 3.6
Suppose that (A1) and (A2) hold. Then for any , defined by (1.3) and sufficiently large , is nondecreasing, and the following holds:
[TABLE]
(Proof) The proof is done by the following (see (1.3) and (2.1)):
[TABLE]
provided the right hand side is positive.
For
[TABLE]
In the following lemma, the boundedness of the set plays a crucial role.
Lemma 3.7
For any and ,
[TABLE]
Suppose that in (2.11) is a minimizer of . Then for ,
[TABLE]
In particular, for any sequence which converges to [math] as , the set has a positive Lebesgue measure, provided and is finite.
(Proof) Let denote the probability density function of the uniform distribution on . Then the following implies (3.11):
[TABLE]
We prove (3.12). We only have to consider the case where is finite and . From (1.20) and (2.11), by Jensen’s inequality,
[TABLE]
Indeed, one can show that is convex from Lemma 3.3 and that is finite and continuous on since is a finite measure on . The last part of this lemma can be shown by Fatou’s lemma from (3.8) in Lemma 3.6 and from the following: for ,
[TABLE]
since is supported on .
For a convex function , the [math]-sublevel set is convex. Roughy speaking, the following lemma can be proved from the fact that a uniformly bounded sequence of convex functions defined on the same open set is compact in the sup norm on any compact subset of the open set (see section 3.3 in [3]).
Lemma 3.8
(i) For a convex set , is a convex function. (ii) For a bounded sequence of convex sets , there exists a closed convex set and a subsequence of such that converges, as , to uniformly on every compact subset of . (iii) For any , the following holds: for sufficiently large ,
[TABLE]
where .
(Proof) (i) For , , , since ,
[TABLE]
Taking the infimum over all , the proof is done.
(ii) Since is bounded, is also locally bounded, which implies that there exists a convex function and a subsequence such that
[TABLE]
uniformly on every compact subset of (see, e.g., [3], section 3.3).
[TABLE]
Then it is easy to see that the set is a closed convex set and .
(iii) We only have to consider the case where . From (ii), for sufficiently large ,
[TABLE]
where
[TABLE]
For , if , then the following which contradicts (3.16) holds: for ,
[TABLE]
Indeed, since is convex, for , there exists such that
[TABLE]
4 Proof of main results
In this section we prove our main results.
(Proof of Theorem 2.1) We first prove (2.5). For the sake of simplicity,
[TABLE]
Since is convergent, is tight. Indeed, for any Borel sets ,
[TABLE]
and a convergent sequence of probability measures on a complete separable metric space is tight by Prohorov’s Theorem (see, e.g., [6]). Here notice that a -compact metric space is separable. By Prohorov’s theorem, take a weakly convergent subsequence and denote the limit by . Then it is easy to see that the following holds:
[TABLE]
From (A2) and (2.3)-(2.4), the following holds: for any ,
[TABLE]
Indeed,
[TABLE]
The rest of the proof of (2.5) is divided into the following (4.3)-(4.4) which will be proved later.
There exists a subsequence and finite measures , such that for sufficiently large and any ,
[TABLE]
From (4.3), for sufficiently large and any Borel sets ,
[TABLE]
(4.4) implies that is a product measure which satisfies (1.1). (4.2) and the uniqueness of the solution to (1.1) implies that (2.5) is true.
We prove (4.3)-(4.4) to compete the proof of (2.5). (4.3) can be proved by the diagonal method, since is tight and since for sufficiently large ,
[TABLE]
has a convergent subsequence from (3.1) in Lemma 3.1 by Prohorov’s Theorem and since any weak limit is a product measure. We prove (4.4). From (4.2) and (4.3), for sufficiently large ,
[TABLE]
From (4.6), for , setting ,
[TABLE]
Substitute (4.7) to (4.6) and let . Then we obtain (4.4). (2.7) can be shown from (2.5) by the following: from (2.1),
[TABLE]
provided the right hand side is positive.
For a compact set , is upper semicontinuous in the weak topology and is hence measurable. Corollary 2.1 can be proved in the same way as in [33] and we omit the proof.
As we mentioned in section 2, we omit the proof of Corollary 2.2. Corollary 2.2 and Lemmas 3.2 and 3.3 immediately imply Corollary 2.3 (see [3], section 3.3) and we omit the proof. Indeed, if any subsequence of a sequence of pointwise convergent continuous functions has a uniformly convergent subsequence, then it is uniformly convergent.
Before we prove Theorem 2.2, we briefly describe the idea of the proof. Theorem 2.1 and Lemmas 3.4 - 3.5 imply the lower semicontinuity of the functional that we minimize in . (3.11) in Lemma 3.7 implies the finiteness of . In particular, the existence of a minimizer of is obtained. (2.11) can be proved by the Duality Theorem (1.14) for and by the fact that the relative entropy of two probability measures is nonnegative and is equal to zero if and only if two probability measures are the same. The characterization of the limit of , as , can be inferred from the following. Roughly speaking, from [28],
[TABLE]
(see (1.9) and (1.11) for notation). Besides, there exists a convex function such that for the minimizer of , as ,
[TABLE]
[TABLE]
In particular,
[TABLE]
(Proof of Theorem 2.2) Since is tight, by Prohorov’s Theorem (see, e.g., [6]), Lemmas 3.4-3.5 and (3.11) in Lemma 3.7 imply the existence of a minimizer of . By (1.14),
[TABLE]
Let denote in (1.19) with . Then
[TABLE]
(see (1.19), (1.5) and Remark 2.2). Indeed, for ,
[TABLE]
since
[TABLE]
[TABLE]
and by Jensen’s inequality,
[TABLE]
(2.11) holds since is a constant (see (1.19)) and since, for ,
[TABLE]
Here
[TABLE]
and the equality holds if and only if
[TABLE]
We prove the second part of Theorem 2.2. For , and ,
[TABLE]
(see (2.1) for notation). Since is compact, and has a weakly convergent subsequence by Prohorov’s theorem in the same way as in the proof of Theorem 2.1 (see [6]). Let and denote the weak limit along the same subsequence, as , of and , respectively. For sufficiently large , by the diagonal method, has a subsequence which is uniformly convergent, as , on every compact subset of (see (3.10) for notation). Indeed, for sufficiently large and small , , are convex from Lemma 3.3, and is uniformly bounded on every compact subset of , from (3.8) in Lemma 3.6:
[TABLE]
Let denote a sequence which converges to [math], as and along which the above sequences are all convergent.
[TABLE]
There exists the limit
[TABLE]
Indeed, is nondecreasing since
[TABLE]
From the last statement of Lemma 3.7, there exists such that , since
[TABLE]
To complete the proof of Theorem 2.2, we show that the following holds:
[TABLE]
[TABLE]
[TABLE]
where is a convex subset of and is a normalizing constant. Notice that is convex and is differentiable a.e. on its domain.
The following implies that (4.16) holds: for sufficiently large ,
[TABLE]
Indeed, from (4.14) and (4.19), for sufficiently large ,
[TABLE]
, -a.s.. To prove (4.19), we first prove that the following holds: for sufficiently large ,
[TABLE]
For , ,
[TABLE]
Then for and ,
[TABLE]
(see (4.15)). Indeed, for , is supported on since and
[TABLE]
[TABLE]
Next we prove that the following holds: for sufficiently large ,
[TABLE]
[TABLE]
Then is open since is convex and finite (see (4.12)-(4.13)) and is continuous. The following implies that (4.23) is true: from (4.13), for sufficiently large ,
[TABLE]
For ,
[TABLE]
Indeed, from (4.14) and (4.20), for sufficiently large such that ,
[TABLE]
(4.14) and the following imply (4.25): from (4.13),
[TABLE]
[TABLE]
and are finite for from (4.12), since from (4.14) and the equality in (4.25),
[TABLE]
For a set and a function ,
[TABLE]
[TABLE]
Then, from (4.25), for ,
[TABLE]
Here denotes the restriction of on and the equality holds if for some , in which case , where for a function ,
[TABLE]
In particular, , -a.s. from (4.16). -a.s. since
[TABLE]
[TABLE]
and since has a probability density function. In the same way, one can show that , -a.s..
[TABLE]
(see (3.10) for notation). Then, from Lemma 3.7,
[TABLE]
Indeed,
[TABLE]
For ,
[TABLE]
Then, from Lemma 3.8, there exists a convergent subsequence in and a closed convex set such that
[TABLE]
[TABLE]
Then we prove that the following holds: for a closed set ,
[TABLE]
The proof of (4.29) is done by the following (4.30)-(4.31) which will be proved later.
[TABLE]
[TABLE]
Notice that, from (4.13)-(4.14) and Lemma 3.7, the following holds:
[TABLE]
We prove (4.30). For sufficiently large ,
[TABLE]
(see (3.10) for notation). Let denote the function with replaced by . Then for , from (4.13) and (4.28),
[TABLE]
since is nondecrerasing.
We prove (4.31).
[TABLE]
Then
[TABLE]
From (4.32), we only have to prove that the following holds:
[TABLE]
[TABLE]
since . For any , sufficiently large and , from Lemma 3.8 and (4.13),
[TABLE]
[TABLE]
[TABLE]
Here, from (3.10) and (4.21) (see also (1.1)),
[TABLE]
(4.12) and (4.13) complete the proof of (4.34).
If is compactly supported, then and for , provided . (4.11)-(4.13) imply that the last statement of Theorem 2.2 holds.
Acknowledgement: This work was supported by JSPS KAKENHI Grant Numbers JP26400136 and JP16H03948. We would also like to thank an anonymous referee for useful suggestions.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] S. Adams, N. Dirr, M. A. Peletier, J. Zimmer, From a large-deviations principle to the Wasserstein gradient flow: a new micro-macro passage, Commun. in Math. Phys. 307 , (2011), 791–815.
- 2[2] J. Backhoff, G. Conforti, I. Gentil, C. Léonard, The mean field Schrödinger problem: ergodic behavior, entropy estimates and functional inequalities, ar Xiv:1905.02393 v 1.
- 3[3] I. J. Bakelman, Convex Analysis and Nonlinear Geometric Elliptic Equations, Springer-Verlag, 1994.
- 4[4] S. Bernstein, Sur les liaisons entre les grandeurs alétoires, Verh. des intern. Mathematikerkongr. Zurich 1932, Band 1 , (1932), 288–309.
- 5[5] A. Beurling, An Automorphism of Product Measures, Ann. of Math. 72 , (1960), 189–200.
- 6[6] P. Billingsley, Convergence of Probability Measures, Wiley-Interscience, 1999.
- 7[7] Y. Brenier, Décomposition polaire et réarrangement monotone des champs de vecteurs, C. R. Acad. Sci. Paris Série I, 305 , no. 19, (1987), 805–808.
- 8[8] Y. Brenier, Polar factorization and monotone rearrangement of vector-valued functions, Comm. Pure Appl. Math., 44 , no. 4, (1991), 375–417.
