This paper characterizes when multiple probability measures and distributions are compatible by using heterogeneity orders and extends the results to stochastic processes, with applications to portfolio selection under constraints.
Contribution
It introduces a new heterogeneity order criterion for compatibility of measures and distributions, generalizes to stochastic processes, and applies to portfolio optimization problems.
Findings
01
Compatibility depends on heterogeneity comparison of measures and distributions.
02
Compatibility characterized by dominance in heterogeneity order via convex order.
03
Results extend to stochastic processes with practical portfolio applications.
Abstract
In this paper, we characterize compatibility of distributions and probability measures on a measurable space. For a set of indices J, we say that the tuples of probability measures (Qiβ)iβJβ and distributions (Fiβ)iβJβ are {compatible} if there exists a random variable having distribution Fiβ under Qiβ for each iβJ. We first establish an equivalent condition using conditional expectations for general (possibly uncountable) J. For a finite n, it turns out that compatibility of (Q1β,β¦,Qnβ) and (F1β,β¦,Fnβ) depends on the heterogeneity among Q1β,β¦,Qnβ compared with that among F1β,β¦,Fnβ. We show that, under an assumption that the measurable space is rich enough, (Q1β,β¦,Qnβ) and (F1β,β¦,Fnβ) are compatible if and only if (Q1β,β¦,Qnβ) dominates (F1β,β¦,Fnβ) in a notion ofβ¦
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRisk and Portfolio Optimization Β· Stochastic processes and financial applications Β· Economic theories and models
Full text
Distributional Compatibility for Change of Measures
Jie Shen
Department of Statistics and Actuarial Science, University of Waterloo, Canada
Yi Shen
Department of Statistics and Actuarial Science, University of Waterloo, Canada
Bin Wang
RCSDS, NCMIS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, China
Ruodu Wang
Department of Statistics and Actuarial Science, University of Waterloo, Canada
(26thΒ February 2024)
Abstract
In this paper, we characterize compatibility of distributions and probability measures on a measurable space.
For a set of indices J, we say that the tuples of probability measures (Qiβ)iβJβ and distributions (Fiβ)iβJβ are compatible if there exists a random variable having distribution Fiβ under Qiβ for each iβJ.
We first establish an equivalent condition using conditional expectations for general (possibly uncountable) J.
For a finite n, it turns out that compatibility of (Q1β,β¦,Qnβ) and (F1β,β¦,Fnβ) depends on the heterogeneity among Q1β,β¦,Qnβ compared with that among F1β,β¦,Fnβ.
We show that, under an assumption that the measurable space is rich enough, (Q1β,β¦,Qnβ) and (F1β,β¦,Fnβ)
are compatible if and only if (Q1β,β¦,Qnβ) dominates (F1β,β¦,Fnβ) in a notion of heterogeneity order, defined via multivariate convex order between the Radon-Nikodym derivatives of (Q1β,β¦,Qnβ) and (F1β,β¦,Fnβ) with respect to some reference measures.
We then proceed to generalize our results to stochastic processes, and conclude the paper with an application to portfolio selection problems under multiple constraints.
Keywords: change of measure, compatibility, heterogeneity order, optimization.
1 Introduction
1.1 The main problem
Change of probability measures is found ubiquitous in problems where multiple probability measures appear, with extensive theoretical treatment and applications in the fields of probability theory, statistics, decision theory, simulation, and finance.
A key feature of a change of measure is that the distribution of a random variable is transformed to another one, and this serves many theoretical as well as practical purposes, such as in the modification of a Brownian motion drift (e.g.Β [21]) or in importance sampling (e.g.Β [24; 14]).
In view of this, a question seems natural to us: how much would the distribution change?
We formulate this question below.
Question (A) arises naturally if one has statistical (distributional) information about a random variable X under P, but yet she is concerned about the behaviour of X under another measure Q.
A general version of question (A), the vocal focus of this paper, is the following.
Question (B) is henceforth referred to as the compatibility problem for the n-tuples of measures (Q1β,β¦,Qnβ) and (F1β,β¦,Fnβ).
We give an analytical answer to question (B), and hence (A).
More generally, we also address the compatibility of two infinite collections of measures.
Before describing our findings, let us look at a few intuitive cases of (B).
Suppose that (Q1β,β¦,Qnβ) and (F1β,β¦,Fnβ) are compatible, that is, (B) has an affirmative answer.
In case that Q1β,β¦,Qnβ are identical, it is clear that the respective distributions of a random variable under each Qiβ, i=1,β¦,n are the same; thus F1β=β―=Fnβ. In case that Q1β,β¦,Qnβ are mutually singular, the respective distributions of a random variable under Qiβ, i=1,β¦,n can be arbitrary.
In case that F1β,β¦,Fnβ are mutually singular measures on (R,B(R)), Q1β,β¦,Qnβ have to be also mutually singular.
From the above observations, it then seems natural to us that whether (Q1β,β¦,Qnβ) and (F1β,β¦,Fnβ) are compatible depends on the heterogeneity (in some sense) among Q1β,β¦,Qnβ compared to that of F1β,β¦,Fnβ. More precisely, Q1β,β¦,Qnβ need to be more heterogeneous than F1β,β¦,Fnβ to allow for compatibility.
The main objective of this paper, question (B), has several deep connections to fundamental problems in finance and economics. We summarize some notable relevant points below111We thank Marcel Nutz for suggesting the second and the third connections, and Fabio Maccheroni for helpful discussions leading to the fourth connection..
Risk assessment under multiple scenarios. In the evaluation of capital requirement for market risks, one often needs to assess risk models under different probability measures, e.g.Β stressed and non-stressed scenarios.
The evaluation of a risk would then be a combination of distributions obtained under various scenarios.
For a theoretical treatment of this approach and its relation to the Fundamental Review of the Trading Book, we refer to [27].
A natural question in this context is whether one can find a risk model (represented by a random variable or a stochastic process) that has specified distributions under corresponding scenarios.
For instance, one may be interested in simulating from a risk model which has a specific dynamic under a non-stressed scenario and
another dynamic (e.g.Β with different parameters) under a stressed scenario.
The existence of such a risk model is precisely question (B); see Section 4.2 for results on Brownian motions.
For some other questions in the same spirit, we refer to [11; 12] where the authors address a few questions on the existence of certain models satisfying given constraints, which are raised by practitioners from the financial industry.
In Section 5, we present a portfolio selection problem with constraints under multiple scenarios.
Simultaneous mass transport. By definition, question (B) is equivalent to the existence of a Monge mass transport from Qiβ to Fiβ for all i=1,β¦,n simultaneously.
Optimal mass transport is an active topic with various applications in mathematical finance, in particular in the calculation of model-independent bounds;
we refer to [16], [5; 7] and [6] for recent advances.
In the study of optimal transport, one typically looks at an optimal transport for one pair of measures.
The existence of such a transport is trivial for one pair of measures, which corresponds to question (B) for n=1.
In this paper, we deal with the case n>1, and thus simultaneous mass transport. Existence is no longer a trivial issue, and it has to be studied before one could discuss optimality.
Admittedly, we are not aware of immediate applications of simultaneous mass transport in finance. Nevertheless, this paper serves as a starting point for future studies in this direction.
Consequentialism in decision theory. The fourth connection is found in decision theory.
An Anscombe-Aumann act ([1]) is a vector of distributions, resulting from a lottery (a random variable) under a set of beliefs (a collection of probability measures Q1β,β¦,Qnβ).
Question (B) is equivalent to the existence of a given Anscombe-Aumann act for a pre-specified set of beliefs.
Decision theorists often study preferences over the set of all Anscombe-Aumann acts without specifying the measures Q1β,β¦,Qnβ, and this is referred to as an axiom of consequentialism (see e.g.Β [2]).
Such an approach assumes that any choice of an act always exists, which is guaranteed by assuming the mutual singularity of (Q1β,β¦,Qnβ)
(Proposition 3.7 (iv) and Theorem 3.17). However, mutual singularity is not the case for many parametric models of beliefs.
As such, the results in our paper are helpful to a better understanding of the decision-theoretical framework of consequentialism.
We first define the main concept of this paper, compatibility problem for the two groups of measures (Qiβ)iβJββM1β and (Fiβ)iβJββF, where J is a possibly infinite set of indices.
We note that F is the distribution of X under Q if and only if F=QβXβ1.
Below we establish our first result, which leads to an equivalent condition for compatibility of (Qiβ)iβJββM1β and (Fiβ)iβJββF.
X* has distribution Fiβ under Qiβ for iβJ.*
2. (ii)
For all QβM1β dominating (Qiβ)iβJβ, the probability measure F=QβXβ1 dominates (Fiβ)iβJβ, and for all iβJ,
[TABLE]
3. (iii)
For some QβM1β dominating (Qiβ)iβJβ, the probability measure F=QβXβ1 dominates (Fiβ)iβJβ, and (2.1) holds.
Proof.
(i)β(ii): By definition, X is such that Qiβ(XβA)=Fiβ(A) for AβB(R) and iβJ. Let QβM1β such that QiββͺQ, iβJ. For any AβB(R), if F(A)=0, then Q(XβA)=0. Since QiββͺQ, Qiβ(XβA)=Fiβ(A)=0, we have FiββͺF for iβJ.
We can verify that for any AβB(R) and iβJ,
[TABLE]
Therefore,
[TABLE]
(ii)β(iii): Trivial.
(iii)β(i): Suppose that (2.1) holds and F dominates (Fiβ)iβJβ. One can easily verify that, for all AβB(R) and iβJ,
[TABLE]
Therefore, X has distribution Fiβ under Qiβ, iβJ, thus (Qiβ)iβJβ and (Fiβ)iβJβ are compatible.
β
Remark 2.3**.**
In the case where the index set J={1,β¦,n} is finite, a probability measure QβM1β dominating (Q1β,β¦,Qnβ) always exists, as we can take, for example, Q=n1β(Q1β+β―+Qnβ). As such, the existence assumption in Theorem 2.2 can be removed when J is finite.
In the special case of n=2 and Q1ββͺQ2β, one can take Q=Q2β in Theorem 2.2, and the two-dimensional equality in (2.1) reduces to a one-dimensional equality
[TABLE]
3 Characterizing compatibility via heterogeneity order
In this section, we explore analytical conditions for compatibility of (Q1β,β¦,Qnβ) and (F1β,β¦,Fnβ) based on their Radon-Nikodym derivatives with respect to some reference probability measures, which are much easier to verify than Theorem 2.2.
3.1 Preliminaries on convex order
For an arbitrary probability space (Ξ,S,P), denote by L1(Ξ;Rn) the set of all integrable n-dimensional random vectors defined on (Ξ,S,P).
Multivariate convex order is a natural notion of heterogeneity order, as defined below.
As mentioned in the introduction, compatibility intuitively concerns the heterogeneity among (Q1β,β¦,Qnβ) compared to (F1β,β¦,Fnβ).
The following lemma, based on Theorem 2.2, yields a possible way of characterizing the comparison between the two tuples of measures.
More precisely, a necessary condition for compatibility is built on a convex order relation between the random vectors (dFdF1ββ,β¦,dFdFnββ) and (dQdQ1ββ,β¦,dQdQnββ) for some reference probability measures FβF and QβM1β.
Lemma 3.3**.**
If (Q1β,β¦,Qnβ)βM1nβ and (F1β,β¦,Fnβ)βFn are compatible, then
for any QβM1β dominating (Q1β,β¦,Qnβ), there exists FβF dominating (F1β,β¦,Fnβ), such that
[TABLE]
Moreover, F in (3.1) can be taken as QβXβ1, where X is a random variable with distribution Fiβ under Qiβ, i=1,β¦,n.
Using the language of heterogeneity order, Lemma 3.3 says that in order for compatibility of (Q1β,β¦,Qnβ)βM1nβ and (F1β,β¦,Fnβ)βFn, a necessary condition is (F1β,β¦,Fnβ)βͺ―hβ(Q1β,β¦,Qnβ). Before discussing the sufficiency of this condition, we first establish some properties of heterogeneity order.
The following lemma implies that the choice of the reference measures P and Q in (3.2) is irrelevant; in fact, they can be conveniently chosen as the averages of the corresponding measures.
Take the convex function f:RnβR, f(x1β,β¦,xnβ)=(x1β+β―+xnβ)2. It follows from the definition of convex order that
[TABLE]
On the other hand,
[TABLE]
Hence, dPβdP1ββ+β―+dPβdPnββ has zero variance under Pβ, which implies that it is Pβ-almost surely equal to n.
In other words, Pβ=n1ββi=1nβPiβ on all sets with positive Pβ-measure. Noting that moreover Pβ dominates (P1β,β¦,Pnβ), we must have Pβ=n1ββi=1nβPiβ.
Therefore, (3.2) holds for P=n1ββi=1nβPiβ and Q=n1ββi=1nβQiβ.
where, again, Xβ²=0 implies Yβ²=0, and in this case Xβ²Yβ²β is set to be 0.
Hence
[TABLE]
Therefore, we must have
[TABLE]
Ξ·-almost surely.
Define measure Pβ² by dPdPβ²β(z)=EΞ·[Xβ²β£Zβ²=Z(z)]=:V(z). Note that since
[TABLE]
Pβ² is a probability measure. Then we have (dPβ²dP1ββ,β¦,dPβ²dPnββ)=VZβ. Define probability measure Ξ·β²β² by dΞ·dΞ·β²β²β=EΞ·[Xβ²β£Zβ²]. Since the relation between Zβ², Ξ· and Ξ·β²β² is in parallel with that between Z, P and Pβ², we have
[TABLE]
However, for any test function g,
[TABLE]
hence =Zβ²β£Ξ·β²β²βZβ²β£Ξ·β²β\buildreldβ. Thus, EΞ·[Xβ²β£Zβ²]Zβ²β, as a function of Zβ², also has the same distribution under Ξ·β² and Ξ·β²β². Consequently, we have
[TABLE]
Also, recalling that =(X,Y)β£Qβ²β(Xβ²,Yβ²)β£Ξ·β²β\buildreldβ,
[TABLE]
The proof is finished by noting that
[TABLE]
and applying Lemma 3.2 with random vectors EΞ·[Xβ²β£Zβ²]Zβ²β, Xβ²Yβ²β and measure Ξ·β².
β
Remark 3.6**.**
While the definition of heterogeneity order is given simply by using the convex order between the Radon-Nikodym derivatives of the measures, Lemma 3.5 shows that the choice of the reference measures, hence the exact form of the Radon-Nikodym derivatives, does not affect the order. This explains our motivation to introduce the notion of heterogeneity order as a partial order between two groups of measures rather than between two groups of random variables.
Some simple and intuitive properties of heterogeneity order are summarized in the following proposition. These properties justify the term βheterogeneityβ in the order βͺ―hβ.
(iii) Let P=n1ββi=1nβPiβ and Q=n1ββi=1nβQiβ.
(P1β,β¦,Pnβ)βͺ―hβ(Q1β,β¦,Qnβ) implies that, for each i=1,β¦,n,
[TABLE]
Note that Q(dQiβ/dQ=0)=0 as Q1β,β¦,Qnβ are equivalent. By Lemma 3.2, we know P(dPiβ/dP=0)=0, which implies PβͺPiβ.
Thus, P1β,β¦,Pnβ are equivalent.
and (dPdP1ββ,β¦,dPdPnββ) takes values in S.
Furthermore,
[TABLE]
By the Choquet-Meyer Theorem ([9]; see Section 10 of [20]), stating that among random vectors distributed in a simplex, the maximal elements with respect to convex order are supported over the vertices of the simplex, we have
[TABLE]
(v) Using the notation in (iv),
(dPdP1ββ,β¦,dPdPnββ)
takes values in the vertices of the simplex S,
and (dQdQ1ββ,β¦,dQdQnββ) takes values in S.
Therefore, by the Choquet-Meyer Theorem again, in order for (P1β,β¦,Pnβ)βͺ―hβ(Q1β,β¦,Qnβ) to hold,
(dQdQ1ββ,β¦,dQdQnββ) has to be distributed over the vertices of the simplex S, and therefore, Q1β,β¦,Qnβ are mutually singular.
β
3.3 Almost compatibility
In Section 3.2, we see that a necessary condition for compatibility of (Q1β,β¦,Qnβ)βM1nβ and (F1β,β¦,Fnβ)βFn is (F1β,β¦,Fnβ)βͺ―hβ(Q1β,β¦,Qnβ). A natural question is whether (and with what additional assumptions) the above condition is also sufficient. This boils down (via Theorem 2.2) to the question of, given
[TABLE]
where F=n1ββi=1nβFiβ and Q=n1ββi=1nβQiβ,
constructing a random variable X with distribution F under Q such that
It seems then natural to assume that each of Q1β,β¦,Qnβ is atomless.
Below we give a counter example showing that this condition is still insufficient.
Example 3.8 suggests that the atomless condition, combined with the heterogeneity order (F1β,β¦,Fnβ)βͺ―hβ(Q1β,β¦,Qnβ),
is not sufficient for compatibility of (Q1β,β¦,Qnβ) and (F1β,β¦,Fnβ).
Nevertheless, in this section we show that, assuming Q1β,β¦,Qnβ are atomless, (F1β,β¦,Fnβ)βͺ―hβ(Q1β,β¦,Qnβ)
is sufficient for almost compatibility, a weaker notion than compatibility, which we introduce below.
Denote by DKLβ(β β₯β ) the Kullback-Leibler divergence between probability measures. Recall that DKLβ(Pβ₯Q) is defined as β«log(dP/dQ)dP for PβͺQ.
The following theorem characterizes almost compatibility via heterogeneity order in Definition 3.4,
assuming each of Q1β,β¦,Qnβ is atomless.
Theorem 3.10**.**
*Suppose that (Q1β,β¦,Qnβ)βM1nβ, (F1β,β¦,Fnβ)βFn and each of Q1β,β¦,Qnβ is atomless. (Q1β,β¦,Qnβ) and (F1β,β¦,Fnβ) are almost compatible if and only if (F1β,β¦,Fnβ)βͺ―hβ(Q1β,β¦,Qnβ).
*
The proof of Theorem 3.10 is a bit lengthy, and is postponed to Appendix A.2 of the paper.
Remark 3.11**.**
The Kullback-Leibler divergence in Definition 3.9 is not the only possible choice to provide an equivalent condition in Theorem 3.10. Indeed, the condition for necessity can be weakened to the convergence in probability of dFi,Ξ΅β/dFiβ to 1 as Ξ΅β0, by using Fatouβs lemma and the fact that a sequence converging in probability has a subsequence converging almost surely; the proof for sufficiency implies results as strong as the uniform convergence of dFi,Ξ΅β/dFiβ to 1. Consequently, the Kullback-Leibler divergence used in the definition of the almost compatibility can be replaced by a series of other conditions, including:
(i)
dFi,Ξ΅β/dFiββp1;
2. (ii)
dFi,Ξ΅β/dFiββa.s.1;
3. (iii)
Fi,Ξ΅β converges to Fiβ in total variation, and Fi,Ξ΅ββͺFiβ;
4. (iv)
among others, without altering the result of Theorem 3.10.
Almost compatibility has a practical implication for optimization problems.
Suppose that Q1β,β¦,Qnβ are atomless. For optimization problems of the form
[TABLE]
where Ο:Fβ[ββ,β] is a functional,
it suffices to consider
[TABLE]
as long as Ο is continuous with respect to any of the convergence types listed in Remark 3.11.
3.4 Equivalence of heterogeneous order and compatibility
In view of the discussions in Section 3.3, (F1β,β¦,Fnβ)βͺ―hβ(Q1β,β¦,Qnβ) is not sufficient for compatibility of (Q1β,β¦,Qnβ) and (F1β,β¦,Fnβ), but sufficient for almost compatibility if each of Q1β,β¦,Qnβ is atomless.
In this section, we seek for a
slightly stronger condition on the n-tuple (Q1β,β¦,Qnβ), under which compatibility and almost compatibility coincide.
Clearly, if (Q1β,β¦,Qnβ) is conditionally atomless, then each of Q1β,β¦,Qnβ is atomless, since a continuous random variable under Q is also continuous under each Q1β,β¦,Qnβ.
With the help of conditional distributions, we first note that the independence in Definition 3.12 is not essential and can be replaced by continuity of the conditional distribution. Moreover, similarly to heterogeneity order, the reference probability measure Q can always be taken as Q=n1ββi=1nβQiβ.
Proposition 3.14**.**
For (Q1β,β¦,Qnβ)βM1nβ, the following are equivalent:
(i)
(Q1β,β¦,Qnβ)* is conditionally atomless.*
2. (ii)
Note that (iii) has two versions: one states the existence of Q and the other specifies Q.
It is trivial to see that (ii) implies (i) and both versions of (iii). It remains to show (iii)β(i)β(ii).
We first show (i)β(ii). Assume (Q1β,β¦,Qnβ) is conditionally atomless. As a result, there exist Qβ²βM1β and a random variable X, such that X and Y:=(dQβ²dQ1ββ,β¦,dQβ²dQnββ) are independent under Qβ².
For i=1,β¦,n, AβB(R) and BβB(Rn),
[TABLE]
The independence between X and Y also implies that
[TABLE]
Thus, X has the same distribution under Qiβ, i=1,β¦,n. Let Q=n1ββi=1nβQiβ, and note that X also has the same distribution under Q. Moreover,
[TABLE]
which means that X and Y are independent under Qiβ for i=1,β¦,n. For any AβB(R) and BβB(Rn),
[TABLE]
and hence X and Y are independent under Q. As a result, X is also independent of
[TABLE]
under Q, where β₯β β₯1β is the Manhattan norm on Rn. Therefore, we conclude that X and (dQdQ1ββ,β¦,dQdQnββ) are independent under Q.
As a byproduct of the above proof, we note that if a random variable X is independent of (dQdQ1ββ,β¦,dQdQnββ) under a probability measure Q, then X is also independent of (dQdQ1ββ,β¦,dQdQnββ) under each of Q1β,β¦,Qnβ. Moreover, X has the same distribution under Q1β,β¦,Qnβ and Q.
Remark 3.16**.**
Right before the publication of this paper, a new preprint [10] introduces the concept of a conditionally atomless Ο-field which turns out to be closely related to our notion of conditionally atomless measures in Definition 3.12. For the connection and the differences between the two formulations, see the discussions in [10]222We thank Freddy Delbaen for pointing out the preprint and for very useful discussions..
It turns out that the assumption that (Q1β,β¦,Qnβ) is conditionally atomless allows for such a construction.
Theorem 3.17**.**
Suppose that (Q1β,β¦,Qnβ)βM1nβ is conditionally atomless and (F1β,β¦,Fnβ)βFn.
(Q1β,β¦,Qnβ) and (F1β,β¦,Fnβ) are compatible if and only if (F1β,β¦,Fnβ)βͺ―hβ(Q1β,β¦,Qnβ).
The key step to prove Theorem 3.17 is the following lemma, which might be of independent interest.
Necessity is guaranteed by Lemma 3.3. We only show sufficiency.
Suppose that (F1β,β¦,Fnβ)βͺ―hβ(Q1β,β¦,Qnβ). We shall show that (Q1β,β¦,Qnβ) and (F1β,β¦,Fnβ) are compatible.
By Lemma 3.5,
As shown in Theorem 3.17, compatibility is closely related to heterogeneity order βͺ―hβ, and hence it defines a partial order. The direction of the order comes from the fact that a measurable mapping needs not to be a bijection. As multiple points are mapped to a same image, the βheterogeneityβ between measures decreases. However, if we require the mapping to be a bijection, then compatibility becomes an equivalence relation. Indeed, in this case Theorem 3.17 would be applicable to both directions, which means that (3.2) holds for both directions, with P=n1ββi=1nβPiβ and Q=n1ββi=1nβQiβ. As a result, we must have
[TABLE]
Moreover, the proof of Theorem 3.17 actually shows that, assuming both tuples of measures are conditionally atomless, the above condition is not only necessary but also sufficient to guarantee the existence of a bijection linking (P1β,β¦,Pnβ) to (Q1β,β¦,Qnβ).
Remark 3.20**.**
As a simple consequence of Theorem 3.17, in the case where n=2 and Q1ββͺQ2β, if (Q1β,Q2β) and (F1β,F2β) are compatible, then F1ββͺF2β and
[TABLE]
The converse is also true if, in addition, (Q1β,Q2β) is conditionally atomless. Therefore, the heterogeneity order condition becomes one-dimensional, and is easy to check. Chapter 3 of [23] contains several classic methods to check Xβ£Pββͺ―cxβYβ£Qβ for arbitrary random variables X and Y and probability measures P and Q.
Below we discuss a few special cases of compatible (Q1β,β¦,Qnβ)βM1nβ and (F1β,β¦,Fnβ)βFn based on the heterogeneity order condition, in particular in the context of Proposition 3.7 and Theorem 3.17. We shall see how our main results are consistent with natural intuitions.
Assume that Q1β,β¦,Qnβ are identical. The natural intuition is that the respective distributions F1β,β¦,Fnβ of a random variable under Q1β,β¦,Qnβ have to be identical as well. Indeed, by Lemma 3.3, compatibility implies (F1β,β¦,Fnβ)βͺ―hβ(Q1β,β¦,Qnβ). By Proposition 3.7 (ii), F1β,β¦,Fnβ are identical.
Assume that F1β,β¦,Fnβ are mutually singular.
The natural intuition here is that the probability measures Q1β,β¦,Qnβ have to be also mutually singular to allow for compatibility.
Similarly to the previous case, this is justified by Theorem 3.17 and Proposition 3.7 (v).
Assume that F1β,β¦,Fnβ are identical, and (Q1β,β¦,Qnβ) is conditionally atomless.
Proposition 3.7 (i) gives (F1β,β¦,Fnβ)βͺ―hβ(Q1β,β¦,Qnβ).
It follows from Theorem 3.17 that (Q1β,β¦,Qnβ) and (F1β,β¦,Fnβ) are compatible.
We conclude that, as long as (Q1β,β¦,Qnβ) is conditionally atomless, for any distribution FβF, there exists a random variable X which has distribution F under each of Qiβ, i=1,β¦,n. Indeed, as (Q1β,β¦,Qnβ) is conditionally atomless, there exists Q dominating (Q1β,β¦,Qnβ) and an F-distributed random variable X under Q independent of (dQdQ1ββ,β¦,dQdQnββ). Remark 3.15 then implies that X also has distribution F under each Q1β,β¦,Qnβ.
Assume that Q1β,β¦,Qnβ are equivalent. Intuitively, the respective distributions F1β,β¦,Fnβ of any random variable under Q1β,β¦,Qnβ have to be equivalent.
This fact is implied by Proposition 3.7 (iii).
Remark 3.21**.**
A notion similar to heterogeneity order is
useful
in comparison of statistical experiments, an area of study originated by Blackwell ([3; 4]); the interested reader is referred to [17] and [26] for summaries.
4 Distributional compatibility for stochastic processes
4.1 General results
In this section we extend our results to stochastic processes with sample paths which are continuous from right with left limits (cΓ dlΓ g).
For a (finite or infinite) closed interval IβR, let D(I) be the Skorokhod space on I, i.e., the space of all cΓ dlΓ g functions defined on I. Let DIβ be the Borel Ο-field of the Skorokhod topology J1β. Denote by GIβ=M1β(D(I)) the set of probability measures on (D(I),DIβ). Our first step is to generalize the definition of compatibility to this setting, which follows in a natural way.
The following is a parallel result to Theorem 2.2, which shares the same proof.
Proposition 4.2**.**
Let IβR be a closed interval, (Qiβ)iβJββM1β and (Giβ)iβJββGIβ.
A stochastic process X has distribution Giβ under Qiβ for iβJ if and only if for all QβM1β dominating (Qiβ)iβJβ, G=QβXβ1 dominates (Giβ)iβJβ, and for all iβJ,
Suppose that (Q1β,β¦,Qnβ)βM1nβ is conditionally atomless, IβR is a closed interval, and (G1β,β¦,Gnβ)βGInβ.
(Q1β,β¦,Qnβ) and (G1β,β¦,Gnβ) are compatible if and only if (G1β,β¦,Gnβ)βͺ―hβ(Q1β,β¦,Qnβ).
The proof of Theorem 4.3 sheds light upon a more general result, where (G1β,β¦,Gnβ) are probability measures defined on a Polish space Y equipped with the Borel Ο-field. In particular, let {yiβ}i=1,2,β¦β be a dense subset of Y, then the sequence of partitions
[TABLE]
β=0,1,β¦, can be used to replace {[h2ββ,(h+1)2ββ),\leavevmodeΒ hβZ}β=0,1,β¦β in the proof of Lemma 3.18. The rest follows exactly in the same way as in that proof. Consequently, a general version of Theorem 3.17 can be stated using the setting of a Polish space instead of R. However, due to the lack of a natural order and metric as in R, a rigorous proof directly for the general case of a Polish space would be notationally heavy and also less intuitive for the readers who are not familiar with Polish spaces. As such, we present Theorem 3.17 under the setting of R, which is also the focus of this paper, and use this remark for a discussion for the general setting, after seeing the proof of Theorem 4.3.
4.2 Relation to the Girsanov Theorem
In this section we investigate how much the drift of a Brownian motion may vary under a change of measure as in the classic Girsanov Theorem. We keep in mind that, the distribution of a Brownian motion (with respect to its natural filtration) with a deterministic drift process only depends on this drift. On the other hand, Brownian motions with stochastic drift processes are not identified by the distribution of the drift processes. Due to this reason, we consider only Brownian motions with deterministic drift processes here.
Throughout this section,
let PβM1β and B={Btβ}tβ[0,T]β be a P-standard Brownian motion. Furthermore,
for a [0,T]-square integrable deterministic process ΞΈ={ΞΈtβ}tβ[0,T]β,
define
[TABLE]
and let GΞΈβ be the distribution measure of a Brownian motion with drift process ΞΈ.
The Girsanov Theorem says that B is a Brownian motion with drift process ΞΈ and volatility 1 under QΞΈβ (certainly, this statement is also true for adapted drift processes). Thus, (P,QΞΈβ) and (G0β,GΞΈβ) are compatible.
It is clear that distribution measures of Brownian motions with different non-random volatility terms are mutually singular, and hence they are not compatible with (P,QΞΈβ).
A next question is whether there exists a P-standard Brownian motion which has a deterministic drift process ΞΌ={ΞΌtβ}tβ[0,T]β under QΞΈβ.
We are interested in the values of ΞΌ such that (G0β,GΞΌβ) and (P,QΞΈβ) above are compatible. Here we do not assume that (P,QΞΈβ) is conditionally atomless, which means that there might not be any random source other than B.
Theorem 4.5**.**
Suppose that the deterministic processes ΞΈ={ΞΈtβ}tβ[0,T]β and ΞΌ={ΞΌtβ}tβ[0,T]β are [0,T]-square integrable, and ΞΌtβξ =0 almost everywhere on [0,T]. (P,QΞΈβ) and (G0β,GΞΌβ) are compatible if and only if
[TABLE]
Proof.
(i) Necessity.
By the Girsanov Theorem, we know that (G0β,GΞΌβ) and (P,QΞΌβ) are compatible. Using Proposition 4.2 for n=2, we have
[TABLE]
Suppose that (P,QΞΈβ) and (G0β,GΞΌβ) are compatible. Note that
Let a stochastic process B^={B^tβ}tβ[0,T]β be given by dB^tβ=dBtββΞΈtβdt. By the Girsanov Theorem,
B^ is a QΞΈβ-standard Brownian motion.
Define
[TABLE]
where Ξ²={Ξ²sβ}sβ[0,Ξ±Tβ]β is given by Ξ²Ξ±tββ=ΞΌtβΞΈΞ±tβββ, tβ[0,T].
W={Wtβ}tβ[0,T]β is clearly a Gaussian process, EP[Wtβ]=0, and
[TABLE]
Therefore, W is a P-standard Brownian motion. Furthermore, for tβ[0,T],
[TABLE]
where the last equality is due to (4.1).
As β«0tβΞ²Ξ±sββdB^Ξ±sββ defines a QΞΈβ-standard Brownian motion, we conclude that W has distribution GΞΌβ under QΞΈβ, and hence (P,QΞΈβ) and (G0β,GΞΌβ) are compatible.
β
We list Theorem 4.5 for the case of a constant drift term below, and look more closely at the construction of the desired stochastic process.
To study (5.1), we assume that (P,Q,R) are conditionally atomless and Q,RβͺP. By Theorem 3.17, the compatibility in (5.1) is equivalent to the heterogeneity order (F,G,H)βͺ―hβ(P,Q,R). An application of Lemma 3.5 (iii) shows this is equivalent to
[TABLE]
for some Fβ² such that F,G,HβͺFβ². As dFβ²dFββFβ²ββͺ―cxβ1, Fβ² must be the same as F. Hence, an equivalent condition is
Hence, we relax the reliance of Wβ² in the optimization problem, and (5.1) can be rewritten as
[TABLE]
where the minimum is taken subject to the constraints
[TABLE]
Under Pβ², given the joint distribution of Xβ² and Yβ², the conditional distributions Xβ²β£Yβ²=y and Zβ²β£Yβ²=y are both fixed for Pβ²-almost every y. Hence, by the Hardy-Littlewood inequality (in the form of Remark 3.25 of [22]), the sub-problem, for fixed (Xβ²,Yβ²),
[TABLE]
where Kyβ is the set of all random variables Zβ² satisfying (Z^{\prime}|Y^{\prime}=y)\big{|}_{P^{\prime}}\buildrel\mathrm{d}\over{=}(\frac{\mathrm{d}R}{\mathrm{d}P}|\frac{\mathrm{d}Q}{\mathrm{d}P}=y)\big{|}_{P}, has a simple solution such that Zβ² given Yβ²=y and \mathds1{XβDaβ}β given Yβ²=y are counter-monotonic. Consequently, we have
[TABLE]
where f(β β£y) is the left-quantile of the distribution function of dPdRβ given dPdQβ=y under P, and
where FYβ is the distribution of \frac{\mathrm{d}Q}{\mathrm{d}P}\big{|}_{P}.
Clearly, Ξ¦ is determined by the joint distribution of (Xβ²,Yβ²) under Pβ².
By this argument, we relax the reliance of Zβ² in the optimization problem (5.2).
To summarize, the results in this paper allow us to transform the original optimization problem (5.1) into
Problem (5.3) can be seen as a generalized martingale mass transportation problem (e.g.Β [5]).
In a classic two-period martingale mass transportation problem, the objective is to minimize EP[Ο(X,Y)] for some cost function Ο:R2βR over (X,Y)
where the distributions of X and Y under some measure P are known, and EP[Yβ£X]=X.
Note that our constraints (5.4) are the same as in the classic problem. The only difference between (5.3) and the classic problem is that our objective Ξ¦
does not have the form of an expected value of Ο(X,Y). Rather, Ξ¦ is determined by the joint distribution of (X,Y).
Hence, Ξ¦ can be seen as a generalized cost functional in a mass transportation problem.
Meanwhile, a lower bound for the optimal value of (5.3) can be obtained by considering an optimization problem with a weaker constraint:
[TABLE]
where Kβ² is the set of all random variables Y satisfying Y|_{P^{\prime}}\buildrel\mathrm{d}\over{=}\frac{\mathrm{d}Q}{\mathrm{d}P}\big{|}_{P}, and
[TABLE]
Note that since only the joint distribution of X and Y matters, here we take X as given and reduce the problem to an optimization solely over Y. Denote by Yβ an optimal solution of (5.5), and let pβ=pX,Yββ. Then for any two points y1β, y2β and Ξ»β(0,1) satisfying pβ(y1β),pβ(y2β),pβ(Ξ»y1β+(1βΞ»)y2β)β(0,1), a variational argument leads to
the first order condition
[TABLE]
which implies that f(pβ(y)β£y) must be linear in y when pβ(y) is between 0 and 1. Combining this with the constraints
[TABLE]
and
[TABLE]
generically gives a unique solution, which is a local minimum by checking the second order condition. Note that similar to (5.2), (5.5) can be rewritten as minEPβ²[Z\mathds1{XβDaβ}β], where the minimum is taken over all the (Y,Z) such that =(dPdQβ,dPdRβ)β£Pβ(Y,Z)β£Pβ²β\buildreldβ and EPβ²[Y\mathds1{XβDaβ}β]=EPβ²[X\mathds1{XβDaβ}β].
As the objective EPβ²[Z\mathds1{XβDaβ}β] is linear and the feasible region is convex (with respect to mixture),
the local minimum must also be the global minimum for the optimization problem (5.5), providing a lower bound for the optimal value in (5.3).
pX,Yβ(y) can be derived and then it can be verified that the first order condition (5.6) is met. Consequently, the dependence given by (X,Y) is indeed optimal for problem (5.3), and the corresponding optimal value can be calculated. We omit the detail as the rest is purely computational.
Acknowledgements
The authors are grateful to the Editor, the Associate Editor, two referees, Michel Baes, Fabio Bellini, Paul Embrechts, Fabio Maccheroni, Tiantian Mao, Alfred MΓΌller, Marcel Nutz, Jan Obloj, Sidney Resnick, Ludger RΓΌschendorf, Alexander Schied and Xiaolu Tan for various helpful suggestions and discussions on an earlier version of the paper.
J.Β Shen acknowledges financial support from the China Scholarship Council.
Y.Β Shen and R.Β Wang acknowledge financial support by the Natural Sciences and Engineering Research Council (NSERC 2014-04840, RGPIN-2018-03823, RGPAS-2018-522590) of Canada.
R.Β Wang is also grateful to FIM at ETH Zurich for supporting his visit in 2017, during which part of this paper was written.
From the definition of F1β and Q1β, we have, for Ξ»-almost surely tβ[0,1],
β£4X(t)β2β£=2t.
It follows that X(t)=(t+1)/2 or X(t)=(1βt)/2 for all tβ[0,1].
Write
Necessity. Assume that (Q1β,β¦,Qnβ) and (F1β,β¦,Fnβ) are almost compatible. This means that for any Ξ΅>0, there exists (F1,Ξ΅β,β¦,Fn,Ξ΅β) such that DKLβ(Fi,Ξ΅ββ₯Fiβ)<Ξ΅ for i=1,β¦,n, and (Q1β,β¦,Qnβ) is compatible with (F1,Ξ΅β,β¦,Fn,Ξ΅β). Define probability measures
Since DKLβ(Fi,Ξ΅ββ₯Fiβ) converges to 0, by Pinskerβs inequality,
Fi,Ξ΅β converges to Fiβ in total variation, which is equivalent to dFi,Ξ΅β/dFiβ converging in L1β£Fiββ to 1. Hence for any sequence Ξ΅mββ0,
there exists a subsequence, which we still denote as Ξ΅mββ0 by a slight abuse of notation, such that dFi,Ξ΅mββ/dFiβ converge to 1Fiβ-almost surely. It is easy to check that we have dFΞ΅mββ/dF converge to 1 as well. (A.1) then implies that
Since dFΞ΅mββdFi,Ξ΅mββββ[0,n], and f is convex hence continuous, β£f(dFΞ΅mββdF1,Ξ΅mβββ,β¦,dFΞ΅mββdFn,Ξ΅mβββ)β£ is bounded. Let b be an upper bound of it. Because FΞ΅mββ converges in total variation to F, we have
[TABLE]
uniformly, where Ξ΄(β ,β ) is the total variation distance. Moreover, by dominated convergence, we have
Let Idβ be the identity random variable on (R,B(R)). For β=0,1,β¦ and hβZ, denote by Οβ,hmβ(z) the conditional probability under F of the event Idββ[h2ββ,(h+1)2ββ) given Zmβ=z:
[TABLE]
Then for any β=0,1,β¦, Ak,jmβ can be further divided into disjoint subsets Ak,j,β,hmβ, such that Q(A^{m}_{k,j,\ell,h})$$=Q(A^{m}_{k,j})\varphi^{m}_{\ell,h}(\exp(j2^{-m})). Moreover, the partitions can be made such that {Ak,j,ββ²,hmβ}hβZβ is a refinement of {Ak,j,β,hmβ}hβZβ for any ββ²>β and any given m,k,j. Define Xm,ββ(Ο)=h2ββ for ΟβAk,j,β,hmβ, and Xmβ=limββββXm,ββ. The limit exists since it is easy to check that Xm,ββ is increasing with respect to β. Note that Xm,ββ is conditionally independent of Ymβ given Zmβ²β²β, hence Xmβ is also conditionally independent of Ymβ given Zmβ²β²β.
By construction, for any AβRn, β=0,1,β¦, and hβZ,
where the first equality holds since Xmβ is independent of Ymβ given Zmβ²β²β, and the fourth equality holds because QβXmβ1β=F and ZmββXmβ=Zmβ²β²β. Symmetrically,
Since this holds for any AβB(B), we conclude that QiββXmβ1β is absolutely continuous with respect to Fiβ, and dQiββXmβ1β/dFiββ[exp(β2βm+1),exp(2βm+1)]. It is easy to see that DKLβ(QiββXmβ1ββ₯Fiβ) converges to 0 as mββ.
β
Bibliography27
The reference list from the paper itself. Each links out to its DOI / PubMed record.
1Anscombe and Aumann [1963] Anscombe, F. J., Aumann, R. J.: A definition of subjective probability. Annals of Mathematical Statistics 34 (1), 199β205 (1963)
2Battigalli et al. [2017] Battigalli, P., Cerreia-Vioglio, S., Maccheroni, F., Marinacci, M.: Mixed extensions of decision problems under uncertainty. Economic Theory 63 (4), 827β866 (2017)
3Blackwell [1951] Blackwell, D.: Comparison of experiments. Proceedings of the Second Berkeley Symposium on Mathematical Statistics and Probability 1 , 93β102 (1951)
4Blackwell [1953] Blackwell, D.: Equivalent comparisons of experiments. Annals of Mathematical Statistics 24 (2), 265β272 (1953)
5BeiglbΓΆck et al. [2013] BeiglbΓΆck, M., Henry-LabordΓ¨re, P., Penkner, F.: Model-independent bounds for option prices: a mass transport approach. Finance and Stochastics 17 (3), 477β501 (2013)
6BeiglbΓΆck and Juille [2016] BeiglbΓΆck, M., Juillet, N.: On a problem of optimal transport under marginal martingale constraints. Annals of Probability 44 (1), 42β106 (2016)
7BeiglbΓΆck et al. [2017] BeiglbΓΆck, M., Nutz, M., Touzi, N.: Complete duality for martingale optimal transport on the line. Annals of Probability 45 (5), 3038β3074 (2017)
8Cambou and Filipovic [2017] Cambou, M., Filipovic, D.: Model uncertainty and scenario aggregation. Mathematical Finance 27 (2), 534β567 (2017)