An optimal transport problem with backward martingale constraints motivated by insider trading
Dmitry Kramkov, Yan Xu

TL;DR
This paper investigates a single-period optimal transport problem with a backward martingale constraint and a covariance-type cost, providing conditions for optimality, uniqueness, and representation, inspired by insider trading models.
Contribution
It introduces a novel optimal transport framework with backward martingale constraints and characterizes optimal plans via maximal monotone sets, extending classical models.
Findings
Optimal plans are supported on maximal monotone sets.
Sharp regularity conditions for uniqueness and map representation.
Connection to insider trading models like the Kyle model.
Abstract
We study a single-period optimal transport problem on with a covariance-type cost function and a backward martingale constraint. We show that a transport plan is optimal if and only if there is a maximal monotone set that supports the -marginal of and such that for every in the support of . We obtain sharp regularity conditions for the uniqueness of an optimal plan and for its representation in terms of a map. Our study is motivated by a variant of the classical Kyle model of insider trading from Rochet and Vila (1994).
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
An optimal transport problem with backward martingale
constraints motivated by insider trading.
Dmitry Kramkov111Carnegie Mellon University, Department of Mathematical Sciences, 5000 Forbes Avenue, Pittsburgh, PA, 15213-3890, USA. *Email: *[email protected]
Yan Xu222Carnegie Mellon University, Department of Mathematical Sciences, 5000 Forbes Avenue, Pittsburgh, PA, 15213-3890, USA. *Email: *[email protected]
Abstract
We study a single-period optimal transport problem on with a covariance-type cost function and a backward martingale constraint. We show that a transport plan is optimal if and only if there is a maximal monotone set that supports the -marginal of and such that for every . We obtain sharp regularity conditions for the uniqueness of an optimal plan and for its representation in terms of a map. Our study is motivated by a variant of the classical Kyle model of insider trading from Rochet and Vila (1994).
Keywords:
martingale optimal transport, Kyle equilibrium.
AMS Subject Classification (2010):
60G42, 91B24, 91B52.
1 Introduction
Let be a probability space and be a -dimensional random variable with finite second moment: . Our goal is to
[TABLE]
for the cost function , , and the domain that consists of -measurable random variables such that is a martingale: . A relaxation of the -measurability constraint on leads to the optimal transport problem:
[TABLE]
where and is the family of probability measures on that have as their -marginal: , and make a martingale out of the canonical process: . In view of the martingale constraint, problem (2) admits an equivalent formulation:
[TABLE]
and thus, has a natural connection to the classical Fréchet-Hoeffding inequality and the Wasserstein -distance.
Problem (2) exhibits a backward structure in the sense that the initial marginal is part of the solution. In this regard, it differs from the “standard” single-period martingale transport problem in Beiglböck and Juillet (2016), Beiglböck et al. (2017), Henry-Labordère and Touzi (2016), and Ghoussoub et al. (2019), among others, where both the initial and terminal marginals are fixed. We point out that for our cost function , the standard problem is trivial, as every martingale measure with given marginals and produces the same average cost:
[TABLE]
Our work is motivated by the classical Kyle (1985) equilibrium with insider from financial economics. More precisely, we consider the model from Rochet and Vila (1994), where the insider observes both the terminal value of the risky asset and the order flow of the noise traders; see Section 6 for details. Setting we establish in Theorem 6.3 the equivalence between the existence of equilibrium and that of an optimal map for (1) such that is an optimal plan for (2). Moreover, the components of are naturally identified as equilibrium’s total order and price . To the best of our knowledge, the connection between the Kyle equilibrium and a martingale optimal transport is new.
The main results of the paper are Theorems 2.2 and 4.6. In Theorem 2.2 we prove the existence of an optimal plan for (2) and characterize its support. We show that is optimal if and only if there is a maximal monotone set in that supports the -marginal of and such that
[TABLE]
Geometrically, the support of has the hyperbolic tangent property: it connects to those , that are touched by the hyperbola
[TABLE]
see Figure 1. Surprisingly, as a consequence of (3), the optimal plan possesses properties of solutions to classical unconstrained problems. By Corollary 2.3, the -marginal of is a Fréchet-Hoeffding coupling between its first and second coordinates, while, by Corollary 2.5, is a classical optimal coupling between its - and -marginals.
In Theorem 3.2 we show that the set from (3) is a solution of the dual problem:
[TABLE]
where is the family of all maximal monotone sets in , and that primal and dual problems (2) and (4) have identical values. The dual problem appears in (Rochet and Vila, 1994, Eq. (2.3)), where stands for the graph of a pricing rule. When has a Gaussian or, more generally, elliptically contoured distribution, becomes a line with strictly positive slope; see Example 5.1.
In Theorem 4.1, we show that optimal map and plan problems (1) and (2) have identical values, provided that is atomless. The result is similar to that of Pratelli (2007) for the classical unconstrained case. The existence of an optimal map for (1) that induces an optimal plan for (2) is obtained in Theorem 4.5 under the condition that gives zero mass to the graphs of strictly decreasing Lipschitz functions. This assumption is weaker than the standard regularity condition of the Brenier theorem, see (Ambrosio and Gigli, 2013, Theorem 1.26), that requires to assign zero mass to rotations of the graphs of Lipschitz functions. Our second main result, Theorem 4.6, establishes the uniqueness of solutions to (1) and (2) if, in addition, the (one-dimensional) distribution functions of and are continuous. Examples 5.2 and 5.3 show that the conditions of Theorems 4.1 and 4.5 are sharp.
Being applied to the model of Rochet and Vila (1994), Theorems 4.5 and 4.6 yield sufficient conditions for the existence and uniqueness of equilibria, which are stated in Theorem 6.7. These assumptions generalize those in Rochet and Vila (1994), where is required to have a continuous compactly supported density in . Rochet and Vila (1994) work with dual problem (4) and rely on the properties of the space of closed graph correspondences endowed with the Hausdorff topology.
Finally, Appendix A contains a density result for the Wasserstein spaces, for which we could not find a ready reference, while Appendix B collects the properties of the function from (3).
2 A backward martingale optimal transport problem
We denote by the family of Borel probability measures with finite second moments and by the Borel -algebra on . For a Borel probability measure on , a -integrable -dimensional Borel function , and an -dimensional Borel function , the notation stands for the -dimensional vector of conditional expectations of given under :
[TABLE]
Similarly, . We write a point in as , where and belong to , and think about and as the initial and terminal values of the canonical two-dimensional process.
Let . We denote by the family of probability measures that have as their -marginal and make a martingale out of the canonical process:
[TABLE]
Our goal is to
[TABLE]
for the covariance-type cost function
[TABLE]
Problem (5) belongs to the class of optimal transport problems with backward martingale constraints, in the sense that the initial -marginal is part of the solution. As we shall see in Section 6, such problem naturally arises in the study of the Kyle-type equilibrium with insider.
Remark 2.1*.*
Problem (5) admits several equivalent formulations. For instance, it has same solutions as the one, where we
[TABLE]
The justification comes from the identity
[TABLE]
where the second equality holds by the martingale property of .
For a Borel probability measure on we denote by its support, that is, the smallest closed set with full measure. We recall that a set is monotone if
[TABLE]
A monotone set is maximal if it is not a proper (or strict) subset of a monotone set. We denote by the family of maximal monotone sets in . It is well-known that if and only if is the graph of the subdifferential of a proper closed convex function on .
For we define a function
[TABLE]
Such functions will play a key role in our study. Their properties are collected in Appendix B. In particular, Lemma B.1 states that takes values in and .
The main results of the paper are Theorems 2.2 and 4.6. Theorem 2.2 establishes the existence of an optimal plan for (5) and shows the structure of its support. Theorem 4.6 contains a uniqueness result.
Theorem 2.2**.**
Let . An optimal plan for (5) exists. For a probability measure the following conditions are equivalent:
- (a)
* is an optimal plan for (5).* 2. (b)
If points and belong to , then
[TABLE] 3. (c)
There is such that
[TABLE]
Moreover, if is a maximal monotone set satisfying (8) and is the -marginal of , then contains and
[TABLE]
Figure 1 illustrates the properties of the support of an optimal plan stated in Theorem 2.2. Let and belong to and be such that and the points and lie, respectively, strictly above and strictly below the maximal monotone set from item (c). As Lemma 2.11 shows, item (b) means that the hyperbolas
[TABLE]
do not cross. The geometric interpretation of item (c) is that these hyperbolas are tangent to .
Before proceeding with the proof of Theorem 2.2, we establish rather surprising connections between an optimal martingale plan for (5) and the solutions of classical unconstrained optimal transport problems. If and are Borel probability measures on , then denotes the family of all couplings of and , that is, the family of Borel probability measures on with -marginal and -marginal . For , the Wasserstein 2-metric is given by
[TABLE]
Corollary 2.3**.**
Let , be the -marginal of an optimal plan for (5), and be the -marginal of , . Then is a solution of the optimal transport problem:
[TABLE]
or, equivalently,
[TABLE]
Proof.
It is well-known that problems (9) and (10) have same solutions and that an element of is such a solution if and only if its support belongs to a cyclically monotone set. By Theorem 2.2, there is a monotone set that contains the support of . Since every monotone set in is also cyclically monotone, the result follows. ∎
Remark 2.4*.*
Let be a maximal monotone set from Theorem 2.2 and be its projection on -coordinate. If is atomless, then the increasing function
[TABLE]
taking values in , defines an optimal map solution to (10):
[TABLE]
The function is a pricing rule in a version of the Kyle equilibrium with insider studied in Section 6.
Corollary 2.5**.**
Let , be an optimal plan for (5), and be the -marginal of . Then is a solution of the optimal transport problem:
[TABLE]
Proof.
By Theorem 2.2, there is such that
[TABLE]
where . Lemma B.1 shows that the -conjugate function
[TABLE]
takes values in and . By Theorem 2.2, and thus,
[TABLE]
Since , we deduce that
[TABLE]
and the optimality of for (11) follows. ∎
Remark 2.6*.*
We point out that the assertions of the corollaries are not sufficient for the optimality of . Indeed, let be the simplest martingale measure, whose -marginal is the Dirac measure concentrated at the mean . In this case, the families and are singletons and hence, and are trivial solutions to (10) and (11). An elementary analysis of (8) shows that such is optimal for (5) if and only if the support of belongs to a line with negative or infinite slope.
The rest of the section is devoted to the proof of Theorem 2.2, which we divide into lemmas. We start with the existence part and recall some basic facts on the Wasserstein distance ; see (Ambrosio and Gigli, 2013, Theorem 2.7 and Proposition 2.4). If and are in , then if and only if for every continuous function on with quadratic growth:
[TABLE]
where is a constant. A set is pre-compact under if and only if
[TABLE]
Lemma 2.7**.**
The family is a convex compact set in under the Wasserstein metric .
Proof.
The martingale property of is equivalent to the identity
[TABLE]
for every bounded and continuous function on . The convexity and the closedness of under readily follow. It only remains to be shown that is pre-compact under or, equivalently, that
[TABLE]
For we have that
[TABLE]
and then that
[TABLE]
Finally, we obtain that
[TABLE]
and the result follows. ∎
Lemma 2.8**.**
An optimal plan for (5) exists.
Proof.
Let be a sequence in such that
[TABLE]
By Lemma 2.7, is compact under . Hence, there is a subsequence that converges to under . Since the cost function is continuous and has quadratic growth, we deduce that
[TABLE]
Thus, is an optimal plan. ∎
The implication (a)(b) of Theorem 2.2 is proved in Lemma 2.10 and relies on the following first-order optimality condition.
Lemma 2.9**.**
Let be an optimal plan for (5). Then
[TABLE]
for every such that .
Proof.
We first establish (12) for a Borel probability measure on that has a bounded density with respect to :
[TABLE]
We choose a non-atom of and define the probability measure
[TABLE]
where is the Dirac measure concentrated at . For sufficiently small the probability measure
[TABLE]
is well-defined and has the same -marginal as . We define the conditional expectation and observe that the law of under belongs to . The optimality of for (5) or, equivalently, for (6) implies that
[TABLE]
Standard computations based on Bayes formula show that
[TABLE]
where , , and . Since for some constant , we deduce that and . It follows that
[TABLE]
In view of (13), the first-order term is negative. It can be written as
[TABLE]
and the result follows.
In the general case, where and , we use the approximation result from Appendix A. By Theorem A.1, there are Borel probability measures on that have bounded densities with respect to and converge to under . By what we have already proved,
[TABLE]
Since the integrands are continuous functions with quadratic growth, we can pass to the limit as and obtain (12). ∎
Lemma 2.10**.**
Let be an optimal plan for (5). Then condition (b) of Theorem 2.2 holds.
Proof.
Lemma 2.9 yields inequality (12) for the probability measure
[TABLE]
where , , and is the Dirac measure concentrated at , . Elementary computations show that for such (12) becomes (7). ∎
The equivalence of assertions (b) and (c) of Theorem 2.2 is a special case of Lemma 2.12, whose proof relies on the following geometric interpretation of (7). Figure 1 visualizes the arguments.
Lemma 2.11**.**
Let and , , be points in such that . Then (7) holds if and only if for all , , the graphs of the hyperbolas
[TABLE]
do not intersect:
[TABLE]
Proof.
The result follows from the identity:
[TABLE]
where and . ∎
Lemma 2.12**.**
For a set the following conditions are equivalent:
- (i)
If points and belong to , then (7) holds. 2. (ii)
There is such that
[TABLE]
Proof.
We observe first that under either (i) or (ii),
[TABLE]
Indeed, under (i) this inequality follows from (7), while under (ii) it holds because . We define the open sets
[TABLE]
where, for ,
[TABLE]
The boundaries of , , are, respectively, upper and lower envelopes of the graphs of increasing hyperbolas and thus, are maximal monotone sets.
By Lemma 2.11, item (i) holds if and only if the sets and are disjoint:
[TABLE]
On the other hand, item (ii) holds if and only if the closed set
[TABLE]
contains a maximal monotone set . As
[TABLE]
every such set separates and . Hence, its existence yields (14) and then (i). Conversely, if the sets and are disjoint, then their boundaries belong to . As the boundaries are maximal monotone sets, we obtain (ii). ∎
The remaining assertions of the theorem follow from Lemma 2.14. A key role is played by inequality (15).
Lemma 2.13**.**
Let and be its -marginal. For every we have that
[TABLE]
and then that
[TABLE]
Proof.
We only need to prove the inequality with conditional expectations. From Lemma B.1 we know that . As , we deduce that for every :
[TABLE]
and taking over a dense countable set of obtain the result. ∎
The following lemma completes the proof of the theorem.
Lemma 2.14**.**
Let , be its -marginal, and be such that
[TABLE]
Then in (16) we actually have the equality, is an optimal plan for (5), the set contains , and
[TABLE]
Proof.
We shall write for . From (15) and (16) we deduce that is a solution to (5), that in (16) we have an equality, and that
[TABLE]
Lemma B.1 states that and . It follows that . Being a closed set, contains . In particular, if , then . It follows that
[TABLE]
Accounting for (16), we deduce that
[TABLE]
Hence, for every we can find a sequence that converges to and such that , . Being a pointwise infinum of continuous functions, the function is upper semi-continuous. It follows that
[TABLE]
and we obtain (17). ∎
3 Dual problem
In view of Theorem 2.2 and Lemma 2.13, a natural dual problem to (5) is to
[TABLE]
Such problem appears in Rochet and Vila (1994) in connection to their study of Kyle-type equilibrium with insider; see Section 6. They use a direct method based on the properties of the space of closed graph correspondences and assume that has a compactly supported density.
We recall that , , and thus the family of all maximal monotone sets in is in one-to-one correspondence with the family of functions
[TABLE]
Hence, (18) is equivalent to the problem, where we
[TABLE]
A technical inconvenience of the set is the absence of convexity. It turns out that the set of functions dominated by the elements of is not only convex, but also admits a self-contained description related to item (b) of Theorem 2.2.
Lemma 3.1**.**
Let be a Borel function. Then for some if and only if
[TABLE]
The set of such functions is convex.
Proof.
The result follows directly from Lemma 2.12, where we take
[TABLE]
Clearly, the family of functions satisfying (19) is convex. ∎
Theorem 3.2**.**
Let . We have that
[TABLE]
where the lower and upper bounds are attained at respective solutions to (5) and (18). A probability measure and a maximal monotone set are such solutions if and only if
[TABLE]
In this case, contains the support of the -marginal of . Moreover, and are uniquely defined on , that is,
[TABLE]
for any other solution to (18). In particular, and are unique if .
Proof.
With an exception of the uniqueness part, all other assertions follow directly from Theorem 2.2 and Lemmas 2.13 and 2.14.
Let be a solution to (5), and be solutions to (18) and denote and . From (20) we deduce that the functions and coincide on , the projection of on -coordinates. Since every is the limit of a sequence , Lemma B.3 yields that
[TABLE]
We have proved the uniqueness of on . The uniqueness of on holds as . ∎
4 Optimal maps
For simplicity of notations, we slightly modify the setup. We start with a 2-dimensional random variable having a finite second moment: . As usual, we identify random variables that differ only on a set of measure zero. Our goal is to
[TABLE]
for the same cost function and the domain
[TABLE]
We denote and observe that for every . Thus, optimal plan problem (5) may be viewed as a Kantorovich-type relaxation of optimal map problem (21). In general,
[TABLE]
and the inequality may be strict and an optimal map may not exist as Examples 5.2 and 5.3 show.
The main results of this sections are Theorems 4.1, 4.5, and 4.6. Theorem 4.1 yields the equality in (22) provided that is atomless. Theorem 4.5 shows the existence of an optimal map if is -regular in the sense of Definition 4.4. Theorem 4.6 establishes the uniqueness of optimal plan and map if, in addition, every component has a continuous distribution function. The last two theorems play a key role in the study of equilibrium in Section 6.
We shall use the notations from Appendix B related to the function , where . In particular, stands for the differential operator associated with the cost function :
[TABLE]
where is the set of points where is differentiable. We denote by the union of the vertical and horizontal line segments of :
[TABLE]
Clearly, the sets are countable. Finally, we define
[TABLE]
The following result is similar to that of Pratelli (2007) obtained for the classical unconstrained optimal transport problem.
Theorem 4.1**.**
Let and suppose that is atomless. Then plan and map problems (5) and (21) have identical values:
[TABLE]
The proof of the theorem relies on some lemmas.
Lemma 4.2**.**
Let , be an optimal plan for (5), and be a maximizer for (18). If , then the probability measure
[TABLE]
has the martingale property: .
Proof.
We write for . We shall show that , that is, that
[TABLE]
for every bounded Borel function on . The martingale property for the second coordinate has a similar proof.
Let be the union of the horizontal line segments of . If , then Theorem 3.2 yields that and . If, in addition, , then and, subsequently, . Hence, (24) holds if
[TABLE]
Hereafter, we fix . Let . If , then and thus, . Conversely, if , the relative interior of , then Lemma B.4 yields that and then, as , that . Hence,
[TABLE]
where and are the boundary points of such that . Accounting for the martingale property of , we obtain that
[TABLE]
Let be such that . If , then Lemma B.4 yields that . If , then and thus, . Finally, if , then . It follows that
[TABLE]
where at the last step we used the martingale property of . The case of the left boundary is similar. We have proved (25). ∎
Lemma 4.3**.**
Let and and be random variables such that takes values in , , and . If the law of is atomless, then for every there is a random variable such that , , and is -measurable.
Proof.
We fix and denote , , and
[TABLE]
Theorems B.6 and B.12 show that , where is either a point or the graph of a strictly decreasing function. Of course, we can choose the sets so that
[TABLE]
For every we shall construct a two-dimensional random variable and a Borel function such that
[TABLE]
Given the sequence of such pairs , , we define
[TABLE]
where . We have that and . Moreover, in view of Theorem B.6, . Hence, (26) is all we need to obtain.
Using the conditional probabilities with respect to events , we can reduce the general case to the situation where
[TABLE]
for some strictly decreasing function . Since is atomless, every component has a continuous distribution function , . It follows that has the uniform distribution on and , where is the pseudo-inverse function to :
[TABLE]
In particular, is -measurable.
Lemma B.16 yields Borel functions , , such that either or . As the functions
[TABLE]
are continuous and increasing, the random variable
[TABLE]
has the uniform distribution on . Clearly, and the indicators are -measurable. It follows that and are also -measurable. Setting
[TABLE]
we obtain that has the same law as , that , and that is -measurable. ∎
Proof of Theorem 4.1.
Let be an optimal plan for (5). By extending, if necessary, the underlying probability space we can assume that for some random variable . As , we have that . Theorem 2.2 yields such that and .
We denote and observe that is another optimal plan. Indeed, by Lemma 4.2,
[TABLE]
and therefore, for a bounded Borel function on ,
[TABLE]
It follows that and thus, . By the construction of , we have that and the optimality of follows. This fact allows us to assume from the start that . Then, and satisfy the assumptions of Lemma 4.3.
Let and be the random variable yielded by Lemma 4.3. As is -measurable, the conditional expectation is also -measurable. Thus, there is a Borel function such that . Since and have identical laws, . As
[TABLE]
we deduce that
[TABLE]
The result follows, because is any positive number. ∎
Let be the family of graphs of strictly decreasing functions defined on closed intervals of such that both and its inverse are Lipschitz functions:
[TABLE]
for some constant . To make statements shorter we allow for a degenerate case where the domain of is just a point. Thus, .
Definition 4.4**.**
A Borel probability measure on is -regular if , .
The following theorem establishes the existence of optimal maps that induce optimal plans.
Theorem 4.5**.**
Let and suppose that is -regular. Let be a maximizer for (18) and denote and . Then
[TABLE]
is an optimal map for (21), is an optimal plan for (5), and the law of is -regular. Moreover, if is an optimal map and is an optimal plan, then
[TABLE]
Proof.
Let be an optimal plan. From Theorem 3.2 we deduce that if , then . In particular, . By Theorem B.12, the exception set belongs to the union of and of a countable family of sets from . Since is -regular, we have that
[TABLE]
It follows that the random variable is well-defined.
Theorem B.6 shows that if , then is the only element of . It follows that and . In view of Theorem 3.2, is an optimal plan if it has the martingale property: .
If and , then Theorems 3.2 and B.6 yield that . Since and have common -marginal satisfying (29), they coincide outside of , that is, (28) holds.
Let be a bounded Borel function on . As , we deduce that
[TABLE]
where the last two equalities follow from the martingale property of and Lemma 4.2, respectively. Thus, . We have proved that is an optimal plan and, in particular, that is an optimal map. The uniqueness property (27) for optimal maps follows directly from the corresponding property (28) for optimal plans.
It only remains to be shown that is -regular. As and the intersection of with any set from is a point, is -regular if and only if it is atomless. Assume to the contrary, that for some and define the Borel probability measure
[TABLE]
Being -regular, the measure is atomless. Hence,
[TABLE]
From the optimality of we deduce that
[TABLE]
and then that . The martingale property of yields that . The last two properties of and the fact that outside of imply the existence of such that
[TABLE]
By Lemma B.5, belongs to the graph of a strictly decreasing linear function and thus, belongs to . As is -regular, we arrive to a contradiction: . ∎
We now state the main uniqueness result of the paper, which can be viewed as an adaptation of the classical Brenier theorem to our setting. We point out that our regularity assumption on is weaker than the standard condition of the Brenier theorem, which requires to assign zero mass to rotations of the graphs of Lipschitz functions.
Theorem 4.6**.**
Let and suppose that is -regular and the (one-dimensional) laws of and are atomless. Let be a maximizer for (18) and denote . Then or, in more detail,
[TABLE]
is the unique optimal map for (21) and the law of is the unique optimal plan for (5). Moreover, the law of is -regular and the laws of and are atomless.
Proof.
We omit from the notations (23) related to its vertical and horizontal line segments. As the law of is atomless and the set is countable, we deduce that
[TABLE]
Except the continuity of the distribution functions for and , all other assertions follow directly from Theorem 4.5.
We shall prove that the law of is atomless. If , then the set is a singleton: . By Theorem 4.5, the law of is -regular and, in particular, atomless. It follows that .
Let . Lemma B.4 shows that if and , then and subsequently, . As , , and the law of is atomless, we obtain that
[TABLE]
where the last step holds by the continuity of the law of . ∎
5 Examples
Example 5.1** (Linear optimal map).**
Let be a random variable in such that , , and
[TABLE]
The latter property holds if the distribution of is Gaussian or, more generally, elliptically contoured.
We denote and define and
[TABLE]
Elementary computations show that and
[TABLE]
Being the graph of an increasing linear function, . Setting , we deduce from Theorem 3.2 that and are respective solutions to (18) and (5). Moreover, as is the only element of such that , the characteristic property (20) yields that is the unique optimal plan. In particular, is the unique optimal map for (21).
Example 5.2** (Optimal map may not yield optimal plan).**
Let be a random variable taking values in , , and with probability . Direct computations show that the points
[TABLE]
belong to the set , that
[TABLE]
and that the probability measure
[TABLE]
belongs to , where . Being the graph of an increasing hyperbola, . By Theorem 2.2, is an optimal plan for (5). The value of this problem is
[TABLE]
On the other hand, let , that is, is -measurable and . We write , . If all are distinct, then and . If they are the same point, then and
[TABLE]
Finally, if precisely two of the elements of coincide: , where is a permutation of , then , , and
[TABLE]
As and , the value function of the optimal map problem (21) is given by , which is strictly less than , the value of the optimal plan problem (5).
Example 5.3** (Optimal map may not exist).**
Let and be independent symmetric random variables in with having a continuous distribution function and taking values in . We define a 2-dimensional random variable
[TABLE]
The components and have continuous distribution functions and, in particular, is atomless. By Theorem 4.1, the plan and map problems (5) and (21) have identical values. We shall prove that there is a unique optimal plan, which is not induced by a (-measurable martingale) map, and hence, shall show that an optimal map does not exist.
To this end, we define a 2-dimensional random variable
[TABLE]
We observe that takes values in the set
[TABLE]
consisting of two upward-slopping lines and thus, belonging to . Direct computations show that and
[TABLE]
By Theorem 2.2, the law of is an optimal plan and is a dual maximizer. We shall proceed to show that this is the only optimal plan and that it is not induced by a map from .
From the construction of we deduce the equality of the sets:
[TABLE]
It follows that
[TABLE]
Hence, is not -measurable.
Let be an optimal plan and be its -marginal. By Theorem 3.2, and
[TABLE]
The random variable takes values in , where
[TABLE]
Elementary computations show that for the set of such that consists of two points and such that
[TABLE]
For , , the three points are distinct and
[TABLE]
On the other hand, by the martingale property of and the fact that , we have that
[TABLE]
and therefore, the conditional probabilities
[TABLE]
For a bounded Borel function on we then obtain that
[TABLE]
Hence, is unique if and only if is unique. We observe now that the map is one-to-one. Thus, for a Borel set ,
[TABLE]
and the uniqueness of follows.
6 Equilibrium with insider
We consider a single-period financial market. There are a bank account with zero interest rate and a stock. The stock value at maturity is represented by a random variable . The stock price at initial time is the result of the interaction between noise traders, an insider, and market makers, where
The noise traders place an order for stocks; is a random variable. 2. 2.
The insider knows the value of both and and places an order for stocks. The trading strategy is a -measurable random variable. 3. 3.
The market makers observe only the total order . They quote the price according to a pricing rule , which is a Borel function .
Definition 6.1**.**
An equilibrium is defined by a trading strategy and a pricing rule such that
Given the total order , the price is efficient in the sense that
[TABLE] 2. 2.
Given the pricing rule , the order maximizes insider’s profit:
[TABLE]
with the convention .
Remark 6.2*.*
Up to minor technical differences, our notion of equilibrium coincides with the one in Rochet and Vila (1994). It differs from the classical equilibrium from Kyle (1985) in the ability of the insider to observe noise traders’ order flow . In the model of Kyle (1985), the insider maximizes over all -measurable random variable .
The following result links the existence of equilibrium with the existence of an optimal map for (21) that induces an optimal plan for (5).
Theorem 6.3**.**
Let and denote . An equilibrium exists if and only if there is an optimal map for (21) such that the law of is an optimal plan for (5). Insider’s profit is unique and given by
[TABLE]
where is a maximizer for (18).
Moreover, there are equilibrium and optimal map such that the pricing rule is an increasing function, the total order , and the price .
We divide the proof of the theorem into lemmas.
Lemma 6.4**.**
Let , , and be an optimal map for (21) such that is -measurable and the law of is an optimal plan for (5). Then there is an increasing function such that and is an equilibrium with .
Proof.
By construction, . Hence, we only need to verify the profit maximization condition for the order . Theorem 2.2 yields such that and
[TABLE]
Let be the projection of on the first or -coordinate. Clearly, is an interval. As is -measurable, there is an increasing function on such that and for . By construction,
[TABLE]
We now extend to an increasing function from to by setting its values to on the left and to on the right of . As , Lemma B.2 yields that takes values in the closure of . Under the standing convention: , we obtain that
[TABLE]
Hence, is an equilibrium with . ∎
Lemma 6.5**.**
Let be a Borel function and
[TABLE]
with the convention: . Then there is such that .
Proof.
Given and , we denote and deduce that
[TABLE]
where in the middle we used the negative parts to account for the possibility that . The result now follows from Lemma 3.1. ∎
Lemma 6.6**.**
Let , , and be an equilibrium with the total order and the price . Then with is an optimal map for (21), the law of is an optimal plan for (5), and
[TABLE]
Proof.
From the definition of the equilibrium we obtain that
[TABLE]
where , . We claim that
[TABLE]
As the integrability properties of are unknown, we use a localization argument. For from the martingale properties and we deduce that
[TABLE]
Taking the limit as , we obtain (31) by the dominated convergence theorem.
Lemma 6.5 yields such that . From Lemma 2.13 we deduce that
[TABLE]
In view of (31) and since belongs to , we obtain that
[TABLE]
It follows that is an optimal plan, is an optimal map, and . Finally, Theorem 3.2 yields that
[TABLE]
and we obtain (30). ∎
Proof of Theorem 6.3.
If is an optimal map, then with is an optimal map as well and is -measurable. By Theorem 3.2,
[TABLE]
for every maximizer to (18). In particular, is the same random variable for every optimal map . After these observations, the proof follows from Lemmas 6.4 and 6.6. ∎
We now state sufficient conditions for the existence and uniqueness of equilibrium. Theorem 6.7 generalizes a result from Rochet and Vila (1994), where the distribution of has a compact support and a continuous density.
Theorem 6.7**.**
Let and suppose that the law of is -regular. Then an equilibrium exists.
If, in addition, the laws of and are atomless, then insider’s order , the total order , and the price are unique. Moreover, is the unique optimal map for (21) and is the unique optimal plan for (5).
For the proof we need a lemma.
Lemma 6.8**.**
Let be a Borel function and
[TABLE]
with the convention: . There is a countable set such that if and , then
[TABLE]
Proof.
If , then
[TABLE]
Clearly,
[TABLE]
Thus, if the increasing functions and are continuous and strictly increasing at , then . To conclude the proof we just observe that the set of arguments, where an increasing function is discontinuous, and the set of values, where it is not strictly increasing, are countable. ∎
Proof of Theorem 6.7.
If is -regular, then Theorem 4.5 yields an optimal map such that the law of is an optimal plan. By Theorem 6.3, there is an equilibrium .
If the laws of and are atomless, then, by Theorem 4.6, the optimal map is unique. Lemma 6.6 shows that is the unique equilibrium price: , where .
Let be the function defined in Lemma 6.8 and be a maximizer for (18). From the definition of the equilibrium and Theorem 6.3 we deduce that
[TABLE]
If , then the total order is clearly unique. If , then by Lemma 6.8 and the continuity of the distributions of and . Thus, the total order and insider’s order are unique. By Theorem 6.3, the uniqueness of and implies that is an optimal map. Hence . ∎
Appendix A Closure of probability measures with bounded densities in
Let . We denote by the space of Borel probability measures on with finite -th moments equipped with the Wasserstein metric:
[TABLE]
where is the family of Borel probability measures on with -marginal and -marginal . We recall that is a complete separable metric space and that in if and only if for every continuous function with polynomial -th growth:
[TABLE]
Let and be the family of Borel probability measures on that have bounded densities with respect to :
[TABLE]
Clearly, . The following result, used in the proof of our main Theorem 2.2, describes the closure of under .
Theorem A.1**.**
Let and . Then the closure of in has the form:
[TABLE]
Proof.
If in , then weakly and thus,
[TABLE]
for every closed set . In particular, if , then
[TABLE]
and hence, . It follows that is closed in . Clearly, is convex.
If is compact, then restricted to the convergence under is equivalent to the weak convergence and thus, the family of probability measures in with finite support is dense. Being a closed convex set, is then the closure of in if and only if every Dirac measure concentrated at is the weak limit of a sequence . The sequence with
[TABLE]
where is the ball of radius centered at , has the required properties.
If is not compact, we approximate by the sequence given by
[TABLE]
for some . We have that and under . By what we have already proved, each belongs to the closure of in , where
[TABLE]
As , , we deduce that the sequence belongs to the closure of in . Same property holds for its -limit and the result follows. ∎
Appendix B Properties of the function
Let be a maximal monotone set: . In this appendix, we collect the properties of the function
[TABLE]
used throughout the paper.
Lemma B.1**.**
The function and its -conjugate
[TABLE]
take values in , , and
[TABLE]
Proof.
Let . As , we have that , , and thus, . Conversely, if , then the maximal monotone set crosses the interior of either the upper-left or the lower-right quadrants relative to . If belongs to such intersection, then
[TABLE]
We have shown that on and on . It follows that
[TABLE]
If , then and thus, . Conversely, if , then , , and therefore, . ∎
We associate with the closed convex function
[TABLE]
Clearly, and have same domains:
[TABLE]
For a convex set we denote by , , , and its respective closure, interior, relative interior and relative boundary.
Lemma B.2**.**
The domain of is convex. If is either horizontal or vertical line, then . Otherwise, has a non-empty interior:
[TABLE]
where is the projection of on -coordinate, . If , then the relative interiors of the horizontal and vertical parts of containing also belong to .
Proof.
Being convex, the function has convex domain. As , the domain of is also convex.
We observe that is either a point or an interval. If , then is a vertical line: . For we have that and
[TABLE]
Thus, . The case where is a point and thus, is a horizontal line is identical.
We assume now that , where . If , then the set is bounded and therefore,
[TABLE]
Conversely, suppose that does not belong to the closure of , say ; other cases are covered similarly. Then and hence,
[TABLE]
We have proved (32).
For the last assertion of the lemma, we assume that and take and with . Given that , we have to show that . Indeed, otherwise there is a sequence such that
[TABLE]
Since , the sequence is bounded and . It follows that
[TABLE]
and we obtain a contradiction. ∎
The closed convex function is lower semi-continuous on and is continuous on the interior of its domain. The following result shows that and are continuous relative to their full domains.
Lemma B.3**.**
If and , then and .
Proof.
It is sufficient to consider the case of the function and take . If , then the result holds by the lower semi-continuity:
[TABLE]
Thus, we assume that or, equivalently, that . By Lemma B.2, the relative interiors of the horizontal and vertical parts of containing belong to . Hence, there is a closed triangle in that contains , for sufficiently large . Being convex, the function is continuous on this triangle and the result follows. ∎
We define a multi-valued function
[TABLE]
taking values in the closed (possibly empty) subsets of , and denote
[TABLE]
Let , , be the union of vertical and horizontal line segments of ; see (23). As the set is fixed, we write simply
[TABLE]
The following lemma shows that for the set can intersect only at . We denote , the scalar product of .
Lemma B.4**.**
Let and . If and , then belongs to the boundary of and
[TABLE]
Proof.
Without loss of generality we can assume that and that stays above . Then the increasing hyperbola
[TABLE]
contains and lays below , which is only possible if is the right boundary of the horizontal line segment . In this case,
[TABLE]
and the result follows. ∎
For we denote by the line segment connecting and :
[TABLE]
Lemma B.5**.**
Let and belong to and stay above and below , respectively. If , then belongs to the line segment connecting and .
Proof.
The conditions of the lemma imply that the increasing hyperbolas
[TABLE]
contain and stay below and above , respectively. Hence, they have identical tangent lines at . Elementary computations show that the slope of the tangent line is given by
[TABLE]
and the result follows. ∎
For the derivative is defined in the classical sense. For the derivative exists if it is the limit: , for every sequence that converges to . We write
[TABLE]
By we denote the differential operator associated with the cost function :
[TABLE]
Finally, let be the union of the vertical and horizontal line segments of and denote
[TABLE]
We observe that
[TABLE]
Theorem B.6**.**
We have that
[TABLE]
Conversely, the set difference has at most two points and these points belong to different linear parts of . If , then is the only element of and
[TABLE]
We divide the proof of the theorem into lemmas. We write if , . If and , then denotes the segment of bounded by and :
[TABLE]
Lemma B.7**.**
Let . Then if and only if . In this case, is the only element of .
Proof.
From the structure of in Lemma B.2 we deduce the existence of such that , , and , where is the rectangle with the diagonal . Every such that belongs to . Hence,
[TABLE]
As is compact, is non-empty. If , then
[TABLE]
It follows that belongs to , the subdifferential of at . Differentiability of (equivalently, of ) at then implies that is a singleton and
[TABLE]
Conversely, let be the only element of and be such that . We have to show that . We take a unit vector in and define a sequence in such that
[TABLE]
Let be an index such that , . By the first part of the proof, for the set is non-empty and belongs to the compact . Moreover, if then . It follows that
[TABLE]
and then that . As is the only element of and
[TABLE]
every convergent subsequence of goes to and then . Hence, and, as is an arbitrary unit vector in , we obtain that . ∎
Lemma B.8**.**
Let and . Then
[TABLE]
and , . The slope of the line segment is negative and has the form:
[TABLE]
Proof.
We fix and denote . From the description of in Lemma B.2 we deduce that . Without loss in generality we can assume that . Then the hyperbola
[TABLE]
contains and stays below , while the hyperbola
[TABLE]
contains and stays below . It follows that is the only element of . Lemma B.7 yields that . The last part of the lemma follows directly from the definition of and the fact that . ∎
The following corollary of Lemma B.8 will also be used in the proof of Theorem B.12.
Lemma B.9**.**
Let and be distinct points in and , . Then either or the line segments and do not intersect.
Proof.
If and have common interior point , then Lemma B.8 yields that . ∎
Lemma B.10**.**
Let . Then is the only element of .
Proof.
In view of Lemma B.7 we can further assume that . Let be a sequence in that converges to . By Lemma B.7, is the only element of . From the construction of on and Lemma B.3 we deduce that
[TABLE]
Hence, . On the other hand, if , then Lemma B.8 allows us to choose the sequence so that . Hence, . ∎
Lemma B.11**.**
The set difference has at most two points and these points belong to different linear parts of .
Proof.
From Lemma B.2 we deduce that , where and is the interior of the projection of on the -coordinate. Without loss of generality we can assume that . Let and be such that , and , . We are going to show that . By doing so, we shall prove that the interior of each linear part of has at most one element of .
Let be a sequence in that converges to . Then and there is such that . In view of Lemma B.7, is the only element of . If , then stays strictly above the line segment and, as , we can assume that same property holds for . By Lemmas B.8 and B.10, the line segment has negative slope and can intersect only at . It follows that belongs to the compact set . Continuity of from Lemma B.3 yields that any convergent subsequence of goes to the unique . Hence, and , by the definition of on .
Similar arguments show that if the “corner” point and there are that belong to the interiors of different linear parts of , then . ∎
Proof of Theorem B.6.
From Lemmas B.7 and B.10 we deduce that
[TABLE]
Lemma B.2 shows that the boundary of is contained in the union of two lines and that each of these lines is either vertical or horizontal. It follows that
[TABLE]
and we obtain (34). Lemma B.11 states the structure of . Let . Accounting for (33) we deduce that
[TABLE]
Lemmas B.7 and B.10 now yield that is the only element of . Finally, identity (35) holds by the definition of . ∎
We recall that denotes the family of graphs of strictly decreasing functions defined on closed intervals of such that and its inverse are Lipschitz functions. We allow for a degenerate case where the domain of is just a point. Thus, .
Theorem B.12**.**
The exception set
[TABLE]
where is a countable union of sets in and is the union of horizontal and vertical line segments of .
We divide the proof into lemmas. For and the points in , we denote by the closed curved triangle bounded by the line segments , , and the segment of ; see Figure 2. If , then ; otherwise .
Lemma B.13**.**
Let be distinct points in , let be in , and denote , .
- (a)
If , then . 2. (b)
If and , then the intersection of and is at most one point, which is then either or .
Proof.
If either (a) or (b) fails to hold, then there are line segments , , that intersect only at an interior point. We obtain a contradiction with Lemma B.9. ∎
Lemma B.13 (a) yields a partial order relation on : if for some in .
Lemma B.14**.**
If belong to and , then
[TABLE]
that is, is the graph of a strictly decreasing function on such that and its inverse are Lipschitz functions.
Proof.
We illustrate the proof on Figure 2. Without loss of generality we can assume that and are distinct points that stay above . Let be in . We have that . If belongs to the line segment , then Lemma B.8 yields that
[TABLE]
and then that . Same lemma shows that the line segment has a negative slope and thus, belongs to . The case, where is identical.
Hereafter, we assume that or, equivalently, that . Being a chord of the concave hyperbola
[TABLE]
which touches from below, the line segment stays below . It follows that belongs to the interior of the triangle with vertices . Hence, there are unique and such that the line segments and intersect at .
We observe that the convex polygon with the vertices contains every such that and thus, contains . Being convex, is bounded on . Hence, is bounded on as well. Moreover, as stays away from , same boundedness property holds for . If , then Lemmas B.9 and B.10 show that belongs to the union of and . In particular, and then also are bounded on . From Lemma B.8 we deduce the existence of negative constants and such that
[TABLE]
Let be distinct. Lemma B.13 yields that either or . Assuming that we deduce the existence of such that and . The slope of is then bounded from below by the slope of and from above by the slope of , and thus is bounded in between by and :
[TABLE]
Hence, the set has the required Lipschitz properties.
It remains to be shown that the set is connected or, equivalently, that for every pair of distinct points in there is , which is different from and and such that . Without loss of generality we can take and . We shall find the required in .
Let , . From the non-intersection property of Lemma B.9 and the continuity of on its domain, we deduce that
If and , then , . 2. 2.
If is such that and , , then .
Similar properties (with obvious modifications in the first item) hold when is replaced with . These properties readily yield the unique such that intersects with both and . Clearly, is different from both and and , thanks to Lemma B.13. ∎
Lemma B.15**.**
The set
[TABLE]
if not empty, is a countable union of sets in . More precisely,
[TABLE]
for some in , .
Proof.
Clearly, , where
[TABLE]
Let . We denote by the set of minimal elements of with respect to the order relation . In other words, if any such that coincides with . From Lemma B.13 we deduce that is countable. Let . If is not a minimal element, then there is such that , . Being contained in for some in , the set is bounded. By the continuity of , this set is closed and hence, contains some . It follows that
[TABLE]
Finally, for , Lemmas B.13 and B.14 show that is the graph of a strictly decreasing function such that and are locally Lipschitz. The result readily follows. ∎
Proof of Theorem B.12.
By Theorem B.6 representation (36) holds if we add to the set given by (37) at most 2 points. Lemma B.15 yields the result. ∎
Lemma B.16**.**
Let be given by (37) and
[TABLE]
Then is countable and there are Borel functions , , such that
[TABLE]
Proof.
In view of Lemma B.15, it is sufficient to prove the result for the sets and , where in . Let be distinct elements of . The functions
[TABLE]
map to and are monotone with respect to the order relations on and on . Thus, their respective sets of discontinuities are countable. From Lemma B.13 we deduce that and from the continuity of that , . The proof readily follows. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Ambrosio and Gigli (2013) Luigi Ambrosio and Nicola Gigli. A user’s guide to optimal transport. In Modelling and optimisation of flows on networks , volume 2062 of Lecture Notes in Math. , pages 1–155. Springer, Heidelberg, 2013. doi: 10.1007/978-3-642-32160-3˙1 . URL https://doi.org/10.1007/978-3-642-32160-3_1 . · doi ↗
- 2Beiglböck and Juillet (2016) Mathias Beiglböck and Nicolas Juillet. On a problem of optimal transport under marginal martingale constraints. Ann. Probab. , 44(1):42–106, 2016. ISSN 0091-1798. doi: 10.1214/14-AOP 966 . URL https://doi.org/10.1214/14-AOP 966 . · doi ↗
- 3Beiglböck et al. (2017) Mathias Beiglböck, Marcel Nutz, and Nizar Touzi. Complete duality for martingale optimal transport on the line. Ann. Probab. , 45(5):3038–3074, 2017. ISSN 0091-1798. doi: 10.1214/16-AOP 1131 . URL https://doi.org/10.1214/16-AOP 1131 . · doi ↗
- 4Ghoussoub et al. (2019) Nassif Ghoussoub, Young-Heon Kim, and Tongseok Lim. Structure of optimal martingale transport plans in general dimensions. Ann. Probab. , 47(1):109–164, 2019. ISSN 0091-1798. doi: 10.1214/18-AOP 1258 . URL https://doi.org/10.1214/18-AOP 1258 . · doi ↗
- 5Henry-Labordère and Touzi (2016) Pierre Henry-Labordère and Nizar Touzi. An explicit martingale version of the one-dimensional Brenier theorem. Finance Stoch. , 20(3):635–668, 2016. ISSN 0949-2984. doi: 10.1007/s 00780-016-0299-x . URL https://doi.org/10.1007/s 00780-016-0299-x . · doi ↗
- 6Kyle (1985) Albert S. Kyle. Continuous auctions and insider trading. Econometrica , 53:1315–1335, 1985.
- 7Pratelli (2007) Aldo Pratelli. On the equality between Monge’s infimum and Kantorovich’s minimum in optimal mass transportation. Ann. Inst. H. Poincaré Probab. Statist. , 43(1):1–13, 2007. ISSN 0246-0203. doi: 10.1016/j.anihpb.2005.12.001 . URL https://doi.org/10.1016/j.anihpb.2005.12.001 . · doi ↗
- 8Rochet and Vila (1994) Jean-Charles Rochet and Jean-Luc Vila. Insider Trading without Normality. The Review of Economic Studies , 61(1):131–152, 01 1994. ISSN 0034-6527. doi: 10.2307/2297880 . URL https://doi.org/10.2307/2297880 . · doi ↗
