Centers of probability measures without the mean
Giovanni Puccetti, Pietro Rigo, Bin Wang, Ruodu Wang

TL;DR
This paper explores the concept of centers in probability measures lacking finite means, revealing that certain sums of Cauchy distributions can be constant within specific bounds, extending mixability theory.
Contribution
It characterizes the set of centers for completely and jointly mixable distributions without finite means, including a surprising result for sums of Cauchy variables.
Findings
Existence of centers for sums of Cauchy distributions within specific bounds.
Extension of mixability concepts to distributions without finite mean.
Counterintuitive bounds for sum constants of Cauchy random variables.
Abstract
In the recent years, the notion of mixability has been developed with applications to optimal transportation, quantitative finance and operations research. An -tuple of distributions is said to be jointly mixable if there exist random variables following these distributions and adding up to a constant, called center, with probability one. When the distributions are identical, we speak of complete mixability. If each distribution has finite mean, the center is obviously the sum of the means. In this paper, we investigate the set of centers of completely and jointly mixable distributions not having a finite mean. In addition to several results, we show the (possibly counterintuitive) fact that, for each , there exist standard Cauchy random variables adding up to a constant if and only if
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFuzzy Systems and Optimization · Functional Equations Stability Results · Advanced Algebra and Logic
Centers of probability measures without the mean
Giovanni Puccetti111Department of Economics, Management and Quantitative Methods, University of Milano, Italy.
Pietro Rigo222Department of Mathematics, University of Pavia, Italy.
Bin Wang333Key Laboratory of Random Complex Structures and Data Science, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China.
Ruodu Wang444Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, Canada.
Abstract
In the recent years, the notion of mixability has been developed with applications to optimal transportation, quantitative finance and operations research. An -tuple of distributions is said to be jointly mixable if there exist random variables following these distributions and adding up to a constant, called center, with probability one. When the distributions are identical, we speak of complete mixability. If each distribution has finite mean, the center is obviously the sum of the means. In this paper, we investigate the set of centers of completely and jointly mixable distributions not having a finite mean. In addition to several results, we show the (possibly counterintuitive) fact that, for each , there exist standard Cauchy random variables adding up to a constant if and only if
[TABLE]
MSC2000 subject classification: Primary 60E05, Secondary 90B30.
Keywords: Cauchy distribution; Complete mixability; Joint mixability; Multivariate dependence
1 Introduction
In the recent years, the field of complete and joint mixability [18] has been rapidly developing. Mixability serves as a building block for the solutions of many optimization problems under marginal-distributional constraints. Applications are found in optimal transportation [14], quantitative finance [6, 2] and operations research [8, 1].
In this paper, we study the set of centers of completely and jointly mixable distributions. Our main result (Theorem 4.2) is that standard Cauchy random variables can add up to a constant if and only if
[TABLE]
Even if apparently innocuous, the proof (or at least our proof) of such a result is quite involved.
To be formal, let be the collection of probability measures on having one-dimensional marginals , where are probability measures on . Denote by the standard Cauchy distribution. Then, for any and , there exists \lambda\in\Gamma\bigl{(}\mu,\dots,\mu) such that
[TABLE]
if and only if satisfies condition (1). In general, for any probability measure on , the set of satisfying (2) for some is compact (Proposition 3.1) and is finite in case is discrete (Proposition 3.2).
The existence of a probability measure with given marginals and satisfying (2) is meaningful with respect to the notions of complete mixability and joint mixability, as introduced in [17] and [20]. Let be probability measures on . The -tuple is said to be jointly mixable (JM) if condition (2) holds for some and some . In this case, is called a center of . Similarly, is -completely mixable (-CM) if there exist and satisfying condition (2). In this case, is an -center of . Clearly, coincides with the mean of provided the latter exists and is finite. In this sense, the notion of -center can be seen as a generalization of the notion of mean. The term -center is used to stress the dependence on . However, the “-" notation will be dropped when clear from the context.
The historical motivation for investigating mixability was to minimize var\bigl{(}\sum_{i=1}^{n}X_{i}\bigr{)}, where are real random variables with given marginal distributions. In fact, the idea of building random variables with constant sum, or at least whose sum has minimum variance, goes back to [7], where complete mixability of the uniform distribution was shown. Random sums with minimal variance were further investigated in [15], where complete mixability of symmetric unimodal distributions was established. Complete mixability and joint mixability of distributions with monotone densities are characterized in [17] and [18], respectively. From an analytical viewpoint, mixability can be seen as an extension of the concept of countermonotonicity (negative dependence) in dimensions and further mathematical properties are collected in [11].
In this paper, in addition to the results mentioned above, various other useful facts are proved. Amongst them, we mention Example 2.6, which provides the first (to our knowledge) explicit construction of two joint (complete) mixes having the same marginal distributions and different centers.
A last remark (connected with our main result) is that, still today, the Cauchy distribution continues to exhibit some rather unexpected properties; see e.g. [9].
Notation
Throughout this paper, is a positive integer. For any , we say “a probability measure on " to mean “a probability measure on the Borel -field of ". We write to mean that is the probability distribution of the random variable and to mean that and have the same law. We always denote by the Borel -field of and by (with or without indices) a probability measure on . Also, for any set , stands for the point mass at .
For and the -th coordinate of is denoted by . If is a probability measure on , the -th one-dimensional marginal of is the probability measure on given by A\mapsto\lambda\big{(}\bigl{\{}x\in\mathbb{R}^{n}:x_{i}\in A\bigr{\}}\big{)}.
Finally, all random variables are defined on a common probability space .
2 (Non)-Uniqueness of the center
A joint mix for with center is an -tuple of real random variables such that , , and . Similarly, is a -complete mix for with center if ,, and .
Not all probability measures on are completely mixable. For instance, is necessarily symmetric if it is 2-CM and centered at 0. Or else, is not -CM for any if the support of is bounded above (below) but not below (above). For a broad list of jointly and completely mixable distributions we refer to [11] and the references therein. Here, we start by noting that the existence of a joint mix always delivers the existence of a complete mix (with average marginal distribution) and that a complete mix with a given marginal can always be taken to be exchangeable.
Proposition 2.1**.**
- (i)
If is JM with center , then is -CM with center . 2. (ii)
Each -CM probability measure on admits an exchangeable -complete mix.
Proof.
(i) Let be a joint mix for with center . Define
[TABLE]
where is a uniform random permutation of independent of . A uniform random permutation of is a random permutation such that for each permutation of . Then, . By independence of and and recalling that , one obtains
[TABLE]
for each and , where is the set of all permutations of . Therefore, is -CM with center and is a joint mix for .
(ii) Given an -CM probability on , take in (i). Then, is an exchangeable joint mix for .
The next example, even if obvious, is helpful in proving Theorem 4.2 below.
Example 2.2**.**
Let and be probability measures on . Suppose is -CM and is -CM, where . Define for and for . Then, is clearly JM, so that
[TABLE]
is -CM by Proposition 2.1. In particular, is -CM for any .
An intriguing question is whether the center of mixable distributions is unique. Obviously, if is JM and each has finite mean, then has a unique center , namely . Analogously, if is -CM and has finite mean, is the only center of . Uniqueness of the center is also clear for , since is 1-CM if and only if it is degenerate.
In view of [16], if and are real random variables such that exists (finite or infinite) then depends only on the marginal distributions of and , in the sense that provided , and exists. It follows that the center is unique for . This fact also admits an obvious direct proof: if and are joint mixes for with and , then
[TABLE]
which clearly implies . More generally, one obtains the following result.
Proposition 2.3**.**
Suppose that is JM and at least of have finite mean. Then, the center of is unique.
Proof.
Let and let be a joint mix for with center . Without loss of generality, assume that have finite mean. Then, is integrable and . Thus, is finite, so that only depends on on and (because of [16]). Hence,
[TABLE]
is the only center of .
Another uniqueness criterion can be obtained by increasing to but replacing in the above proposition the existence of the mean with the slightly weaker condition
[TABLE]
Proposition 2.4**.**
Suppose that is JM and at least of satisfy condition (3). Then, the center of is unique.
Proof.
Let be a joint mix for with center . Without loss of generality, assume that satisfy condition (3). Then, also satisfies condition (3). In fact, for every , one obtains
[TABLE]
Hence, as .
Now take iid copies of , denoted by . For , let , , and . By condition (3) and the weak law of large numbers, we have , as , for fixed . It follows that
[TABLE]
Therefore, is unique.
Contrary to the cases and , a JM -tuple of distributions may have more than one center of .
Recall that the standard Cauchy distribution is the probability measure on with density with respect to the Lebesgue measure. Let Cauchy() denote the distribution of , where and has the standard Cauchy distribution. In [3], Chen and Shepp show the existence of two Cauchy() random variables and a constant such that is Cauchy(). Thus, and are both joint mixes for the triplet \bigl{(}Cauchy(), Cauchy(), Cauchy(4\sqrt{2})\bigr{)} with centers and , respectively. From Proposition 2.1, one also obtains a 3-CM probability measure with an interval of centers.
Example 2.5** (A probability measure with an interval of centers).**
Let and . By [3], the triplet is JM with centers and for some . Take two independent joint mixes and for with centers and , respectively. Fix and define . Using characteristic functions, it is straightforward to see that and . Hence, is a joint mix for with center . Since is arbitrary, Proposition 2.1 implies that each point in the interval is a 3-center of . Note however that such does not belong to the Cauchy family of distributions.
The general question of whether the center of a JM -tuple of distributions is always unique was stated as an open problem in [17, 11, 19]. During the writing of the present paper, we became aware of the Chen-Shepp example in [3] providing an early negative answer to the question.
However, the Chen-Shepp example, while implying non-uniqueness of the center, does not provide an explicit construction for it depends on (the existence of) an orthogonal projection. Furthermore, the value of is not explicitly given and it is not clear if and how it can be computed. We next give an example of couplings having the same marginal distributions but different sums. To the best of our knowledge, this is the first explicit construction of two joint (complete) mixes having the same marginal distributions and different centers.
Example 2.6** (Center of a JM triplet is not unique).**
Let be a random variable with a geometric distribution with parameter , that is, , , and let be a Bernoulli random variable with parameter independent of . Let
[TABLE]
and
[TABLE]
Then, , and . Furthermore, and
[TABLE]
for each . Similarly, . Thus, if denotes the distribution of and that of , the triplet is JM with centers 0 and 1. From this example and Proposition 2.1, it also follows that the probability measure is 3-CM with centers 0 and 1/3. In Example 3.6 below we shall see that [math] and are actually the only -centers of .
3 The set of centers of mixable distributions
Let be the set of those such that
[TABLE]
for some . With a slight abuse of terminology, we call the -center of . Clearly, if and only if is -CM and each is the distribution of a -complete mix for . We also denote by the function
[TABLE]
Proposition 3.1**.**
Let be -CM. Then, is a compact metric space and the function is continuous. In particular, the set of -centers of is compact. Thus, there exist such that and are -centers of but no point in is an -center of .
Proof.
Let be the set of all probability measures on , equipped with the topology of weak convergence, and let for . Since and is a Polish space, is compact if and only if it is closed and tight. Fix and such that weakly as . Then, since for all and the coordinate maps are continuous for all . Similarly, since is continuous, weakly. In addition, implies for all . Since is a closed subset of , it follows that for some and as . Hence, and . This proves that is closed and is continuous. Finally, is tight since all its elements have the same one-dimensional marginals (all equal to ). Thus, is a compact metric space.
If is -CM with an unique center, then in Proposition 3.1. In case , a natural question is: which points in the interval are centers of ?
A probability measure supported by the integers, such as in Example 2.6, has at most finitely many centers because of Proposition 3.1. This conclusion can be actually generalized to any discrete distribution.
Proposition 3.2**.**
If is JM and each is discrete, then has finitely many centers.
Proof.
Since is discrete, there is a finite set such that and . Let be a joint mix for with center and . Then,
[TABLE]
where A_{1}+\dots+A_{n}=\bigl{\{}\sum_{i=1}^{n}x_{i}:x_{i}\in A_{i},1\leq i\leq n\bigr{\}}. Since , it follows that belongs to the finite set .
The situation is quite different for diffuse distributions, which can have infinitely many centers; see for instance Example 2.5. A more interesting case is exhibited by Theorem 4.2 below, where is the standard Cauchy and each point in is an -center of .
We next obtain two useful bounds for and in Proposition 3.1. Define the quantile functional
[TABLE]
For , define also the average quantile functional
[TABLE]
Note that, for fixed and , the map is continuous with respect to weak convergence.
Proposition 3.3**.**
Suppose that is JM with center . Then, for any such that , one obtains
[TABLE]
Proof.
The first inequality follows from the second by noting that is a center of , where for each . Hence, we only prove the second inequality.
By applying , in Theorem 1 of [4] (with their notation for , , ), we obtain, for any integrable random variables , with , that
[TABLE]
for all .
Take a joint mix for with center and a sequence satisfying
[TABLE]
where stands for convergence in distribution. Then, and for all . By continuity of the average quantile functional with respect to weak convergence, it follows that
[TABLE]
Finally, by taking , one obtains
[TABLE]
which concludes the proof.
Letting and , Proposition 3.3 has the following useful consequence.
Corollary 3.4**.**
If is -CM with center , then , where
[TABLE]
Example 3.5** (The mean inequality).**
A remarkable consequence of Corollary 3.4 is the mean inequality (Proposition 2.1(7) of [17]), arguably the most important necessary condition for complete mixability, which is also sufficient for probability measures with monotone densities. Let and be the left and right end-points of . Assume and are finite and denote by the mean of . If is -CM, Corollary 3.4 yields . Hence, for , we have that that is
[TABLE]
On the other hand,
[TABLE]
Therefore, inequality (5) yields , one side of the mean inequality in [17] (the other follows similarly).
Example 3.6** (Example 2.6 revisited).**
The probability measure defined in Example 2.6 is 3-CM with 3-centers 0 and . We now prove that [math] and are actually the only 3-centers of . For , one can compute that
[TABLE]
By Corollary 3.4, it follows that and . Let be a complete mix for . Since , and , then a.s. Thus, to see that 0 and 1/3 are the only -centers of , it suffices to show that . Since a.s., then .
Another consequence of Proposition 3.3 is that a distribution with an infinite mean cannot be -CM for any . The following corollary can be shown by letting , in (4), so that the left-hand side of (4) goes to infinity.
Corollary 3.7**.**
If have means , respectively, and for at least one , then is not JM. In particular, a probability measure on with an infinite mean is not -CM for any .
We conclude this section by characterizing the set of -centers of a probability measure based on a duality argument.
Let be a probability measure on . Recall that for and write to denote the set . By definition, a real number is an -center of if and only if , where
[TABLE]
Based on Theorem 5 of [13] and Remark 2 in [7], has the dual representation
[TABLE]
where denotes the class of bounded, Borel-measurable functions such that for all . The value of (6) is not easy to compute in general. However, restricting to a subset of (as done for instance in [5]) leads to an upper bound for .
We consider the following class of piecewise-linear functions defined, for , as
[TABLE]
Since for all , we obtain
[TABLE]
where . If , then is not an -center of . Therefore, for to be an -center of , it is necessary that
[TABLE]
The above inequality is another necessary condition for the center of -CM probability measures, in addition to that of Corollary 3.4. These two necessary conditions are not equivalent in general.
4 The Cauchy distribution
From now on, we let the standard Cauchy distribution. It is shown in [15] that is -CM with center 0, for each , as it is symmetric and unimodal. In this section, we characterize the set of -centers of . We start by observing that such set is a closed interval contained in .
Example 4.1**.**
As in Proposition 3.1, let and be the minimum and the maximum of the set of -centers of . Let and be two independent complete mixes for such that
[TABLE]
Fix and define for . Then, for each and so that is a center of . Hence, , namely, each point in is a center of .
Next, on noting that , one obtains
[TABLE]
By Corollary 3.4,
[TABLE]
Since (for is symmetric) one also obtains .
Example 4.1 says that . Our main result is that this inclusion is an equality.
Theorem 4.2**.**
For every , the set of -centers of the standard Cauchy distribution is the interval
[TABLE]
The rest of this section is devoted to the proof of Theorem 4.2.
For each , let denote the collection of -CM probability measures with center . We first need two lemmas of possible independent interest. The first states that is closed under arbitrary mixtures, generalizing Theorem 3.2 of [10].
Lemma 4.3**.**
Let be any probability space and, for each , let . Suppose that is a -measurable map, for each , and define
[TABLE]
Then, .
Proof.
Let be the field on generated by the measurable rectangles , where for all , and let be any map. By Theorem 6 of [12], is a -additive probability on provided it is a finitely additive probability and A\mapsto\gamma\bigl{\{}x\in\mathbb{R}^{n}:x_{i}\in A\bigr{\}} is a -additive probability on for all .
Let . For each , since , there is such that . Define
[TABLE]
where is a finitely additive extension of to the power set of . Then, is a finitely additive probability on and
[TABLE]
for all and all , where the third equality holds because is -measurable. Hence, is -additive on . Let be the only -additive extension of to the Borel -field of . Since , to conclude the proof it suffices to see that . In fact, since is open, it is a countable union of open rectangles, that is, with for all . Since for all , one obtains
[TABLE]
Therefore , for is a countable union of -null sets.
The second lemma, which slightly generalizes Theorem 2.4 of [17], provides conditions for a certain probability measure to be -CM.
Lemma 4.4**.**
Let , , and a probability measure on which admits a non-increasing density (with respect to the Lebesgue measure). Then, is -CM if and only if
[TABLE]
Proof.
Let . As noted in Example 3.5, a necessary condition for complete mixability is the mean inequality in [17], and condition (7) is precisely the mean inequality for . Thus, (7) holds if is -CM. Conversely, suppose (7) holds. Let , where and stands for the uniform distribution on the interval . By (7), satisfies the mean inequality. By Corollary 2.9 of [17], since has non-increasing density and meets the mean inequality, is -CM. Further, as , the mean of converges to and weakly. Thus,
[TABLE]
because of Theorem 3.1 of [10].
We are now ready to prove Theorem 4.2.
Proof of Theorem 4.2.
Recall that (see [15]) for all . Since the case is trivial, we assume . Fix By Example 4.1, it suffices to show that . In turn, by Lemma 4.3, it suffices to prove that can be written as
[TABLE]
where for all and is a probability measure on .
Let , , be the standard Cauchy density and the function on given by
[TABLE]
Also, let be a function such that, for each :
- (I)
2. (II)
, 3. (III)
.
The existence of such will be verified at the end of the proof. For the moment, we assume that exists.
For , let be the finite measure on with density
[TABLE]
Since and for all and , one obtains
[TABLE]
for each . Note also that , for
[TABLE]
Denote
[TABLE]
[TABLE]
Since , , conditions (II)-(III) imply for all with and . Hence, for each , one can define
[TABLE]
where denotes the uniform distribution on the interval . Such are the probability measures that we use in (8).
Next, fix and define
[TABLE]
Then, is continuous and satisfies, by direct calculation,
[TABLE]
where is a co-finite set (possibly depending on ). Since is locally integrable (with respect to the Lebesgue measure) it follows that
[TABLE]
for all . In particular,
[TABLE]
Let be the probability measure on such that for all . Then, condition (9) yields
[TABLE]
Therefore,
[TABLE]
To prove , it is fundamental to note that has mean . Define in fact
[TABLE]
By condition (I), for all . Computing , one obtains
[TABLE]
Therefore, has mean for all .
Having noted this fact, fix and define
[TABLE]
Such has mean and is -CM by Example 2.2. Hence, . If , then is a convex combination of and . Since has mean , it follows that .
Suppose now that . Since , the mean of is not less than . Since has mean , it follows that
[TABLE]
has a mean smaller than or equal to , namely, . By this fact and , one can define
[TABLE]
Since and have mean and
[TABLE]
then has mean as well. By Lemma 4.4, is -CM (condition (7) follows from having mean ). Therefore, . Finally, since for and is convex (by Lemma 4.3), one obtains .
To conclude the proof, it remains only to prove that a -function satisfying conditions (I)-(II)-(III) actually exists. Define
[TABLE]
Then, is a -function on and
[TABLE]
where denotes the set . Fix . If , there is such that for every . Thus,
[TABLE]
Hence, the map is continuous, strictly decreasing on , and
[TABLE]
It follows that, for each , there exists a unique number satisfying and A\bigl{(}t,\,h(t)\bigr{)}=0. It remains to see that , is and .
We begin with . Since whenever , it suffices to see that . Define
[TABLE]
Then,
[TABLE]
Observe now that, since is convex (see Remark 4.5),
[TABLE]
is an increasing function of . Therefore,
[TABLE]
and, by rearranging terms,
[TABLE]
If , then , so that the right-hand member of (10) is bounded above by 1. Hence, implies . Thanks to this fact and , one concludes that there is some (possibly ) such that for and for . Since and have the same sign and
[TABLE]
one finally obtains . Therefore, for all .
To prove is , recall that is the only real number such that and . Also, implies . Thus, is because of the implicit function theorem.
We finally prove . Since and
[TABLE]
it suffices to show that for . Since
[TABLE]
it can be assumed . In this case,
[TABLE]
Recall now that for and for where . Hence, for . If , since and is decreasing on , then
[TABLE]
On the other hand, it is easily seen that
[TABLE]
Therefore,
[TABLE]
It follows that , and again . To summarize, for all , namely, satisfies conditions (I)-(II)-(III). This concludes the proof.
Remark 4.5*.*
The proof of Theorem 4.2 is valid for other probability measures in addition to the Cauchy distribution. Fix in fact a probability measure on which admits a density with respect to the Lebesgue measure. If is strictly positive, differentiable, symmetric and strictly unimodal (that is, on and on ), and if is a convex function, then any real number satisfying
[TABLE]
is an -center of . This follows from replacing and by and , respectively, in the proof of Theorem 4.2. Therefore, taking an arbitrary differentiable, symmetric, strictly positive and strictly convex function , by defining a density for some normalizing constant one always finds a distribution fulfilling the requirements of the proof of Theorem 4.2. However, apart from the Cauchy distribution, we do not know of any natural example of such . This is due to the convexity of , which is a quite restrictive requirement. For instance, suppose that has the power form for some constants . In this case, leads to the standard Cauchy distribution, implies that the mean of is finite (so that 0 is the unique center), and implies that is not convex.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] F. Bellini, O. E. Karaşan, and M. Ç. Pınar, Joint mixability of some integer matrices , Discrete Optim. 20 (2016), 90–104.
- 2[2] C. Bernard, L. Rüschendorf, and S. Vanduffel, Value-at-risk bounds with variance constraints , Forthcoming in J. Risk Insurance , DOI: 10.1111/jori.12108, 2015.
- 3[3] R. Chen and L. A. Shepp, On the sum of symmetric random variables , Amer. Statist. 37 (1983), no. 3, 237.
- 4[4] P. Embrechts, H. Liu, and R. Wang, Quantile-based risk sharing , Preprint, available at https://ssrn.com/abstract=2744142, 2016.
- 5[5] P. Embrechts and G. Puccetti, Bounds for functions of dependent risks , Finance Stoch. 10 (2006), no. 3, 341–352.
- 6[6] P. Embrechts, G. Puccetti, and L. Rüschendorf, Model uncertainty and Va R aggregation , J. Bank. Financ. 37 (2013), no. 8, 2750–2764.
- 7[7] N. Gaffke and L. Rüschendorf, On a class of extremal problems in statistics , Math. Operationsforsch. Statist. Ser. Optim. 12 (1981), no. 1, 123–135.
- 8[8] U-U. Haus, Bounding stochastic dependence, joint mixability of matrices, and multidimensional bottleneck assignment problems , Oper. Res. Lett. 43 (2015), no. 1, 74–79.
