Bivariate distributions with ordered marginals
Sebastian Arnold, Ilya Molchanov, Johanna F. Ziegel

TL;DR
This paper characterizes dependency structures between two stochastically ordered variables using copulas, providing extremal measures, explicit joint distributions, and extensions to multivariate and partially ordered spaces.
Contribution
It offers a comprehensive characterization of copulas compatible with stochastic order, including extremal measures and maximum entropy distributions, with multivariate extensions.
Findings
Closed-form extremal Kendall's tau and Spearman's rho
Explicit maximum entropy joint distribution
Extensions to multivariate and partially ordered spaces
Abstract
This paper provides a characterization of all possible dependency structures between two stochastically ordered random variables. The answer is given in terms of copulas that are compatible with the stochastic order and the marginal distributions. The extremal values for Kendall's and Spearman's for all these copulas are given in closed form. We also find an explicit form for the joint distribution with the maximal entropy. A multivariate extension and a generalization to random elements in partially ordered spaces are also provided.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Bivariate distributions with ordered marginals
Sebastian Arnold
Ilya Molchanov
Johanna F. Ziegel
Institute of Mathematical Statistics and Actuarial Sciences, University of Bern, Switzerland
Abstract
This paper provides a characterization of all possible dependency structures between two stochastically ordered random variables. The answer is given in terms of copulas that are compatible with the stochastic order and the marginal distributions. The extremal values for Kendall’s and Spearman’s for all these copulas are given in closed form. We also find an explicit form for the joint distribution with the maximal entropy. A multivariate extension and a generalization to random elements in partially ordered spaces are also provided.
keywords:
copula , diagonal section , differential entropy, nonparametric correlation , stochastic order.
MSC:
[2010] Primary 60E15 , Secondary 62H20
1 Introduction
Let and be two random variables, such that is stochastically larger than . This means that , , for their cumulative distribution functions (cdfs) and , respectively. It is well known that this is the case if and only if and can be realized on the same probability space, so that almost surely. The objective of this paper is to characterize all random vectors such that and have given cdfs and .
As a first observation, we establish a representation of joint distributions of ordered random variables as distributions of the order statistics sampled from an exchangeable bivariate law.
Theorem 1**.**
A random vector with marginal cdfs and satisfies if and only if and for a random vector with exchangeable components and such that and for all , , where
[TABLE]
Proof*.*
The vector obtained as the random permutation of is exchangeable and its marginal distributions are . Furthermore,
[TABLE]
Conversely, if and are order statistics from , then a.s., and
[TABLE]
This theorem complements already known results deriving the distribution of order statistics from general multivariate laws, see [1, 2, 3].
If the supports of and are disjoint intervals, then any dependency structure between them is possible. Otherwise, restrictions are necessary, e.g. and cannot be independent. In Section 2, we give a complete description of the joint distribution of and . This description is given in terms of copulas and their diagonal sections. In Section 3, we identify bounds on the joint distribution of in terms of concordance ordering. Then in Section 4, we determine the smallest possible nonparametric correlation coefficients. The joint distribution of with the maximal entropy is found in Section 5, followed by examples in Section 6. A multivariate extension and a generalization to random elements in partially ordered spaces are presented in Section 7.
2 Characterization of stochastically ordered copulas
A (bivariate) copula is the cdf of a random vector with standard uniformly distributed marginals. The joint cdf of each random vector can be written as
[TABLE]
for a copula with and being the marginal cdfs. A copula is called symmetric on a set if for all . For , symmetry of the copula is equivalent to the pair being exchangeable.
The following theorem provides a characterization of all dependence structures that are compatible with the stochastic ordering of the marginals.
Theorem 2**.**
Let be a random vector with marginals and having cdfs and , respectively. Then if and only if for all and the joint cdf of is given by
[TABLE]
for all , where is given by (1) and is a symmetric copula on the range of such that
[TABLE]
Proof*.*
Sufficiency. Let be distributed according to the symmetric bivariate cdf . By construction, and are identically distributed with cdf . Furthermore, . The distribution of is given by (2), so sufficiency follows from Theorem 1 because .
Necessity. Let be as in Theorem 1. Then, any copula of satisfies
[TABLE]
for all . ∎
Diagonal sections of copulas, i.e. the functions that arise as , , for some copula are characterized by the following properties, see [4].
Definition 1**.**
A function is a diagonal section if
(D1)
, ;
(D2)
it is increasing;
(D3)
for all ;
(D4)
for all .
For an increasing function , denote by
[TABLE]
the generalized inverse of , where . Note that
[TABLE]
for all from the range of , see [5, Proposition 2.3(4)]. For notational convenience, we set , .
The following result follows from the representation (3) of the diagonal section of the copula and identity (4).
Corollary 3**.**
Let be given by (1). The function
[TABLE]
given by the composition of and the generalized inverse of , is the restriction of a diagonal section to the range of .
We can also provide a converse to Corollary 3.
Proposition 4**.**
Let be a diagonal section. Then there are cdfs and such that and with defined at (1).
Proof*.*
We can extend to an increasing function on with range . Its generalized inverse is left-continuous. For , we define
[TABLE]
The function is increasing and right-continuous, , and for all . Let , . Then, by (D3),
[TABLE]
Set for and for . The function is a cdf by the above arguments. By (D4), we obtain . We have , . It remains to be checked that . The function is constant on for any . This implies, for any ,
[TABLE]
Equation (3) specifies the diagonal section of the copula for all from the range of . If both and are non-atomic, then this range is and so the diagonal section of is uniquely specified.
Example 1**.**
An Archimedean generator is a decreasing convex function with and , see [6, Theorem 6.3.2]. Note that we define an Archimedean generator following [7]. For the Archimedean copula , , with generator , the diagonal section is .
There are many parametric families of Archimedean generators. For example, the Gumbel family of copulas is generated by , . Then, the cdf constructed in the proof of Proposition 4 is , , with .
Example 2** (Identical distributions).**
If and are identically distributed, then . In this case, the diagonal section of in Theorem 2 is given by for all from the range of . For , by (3),
[TABLE]
hence, . Therefore, by (2), the joint law of satisfies , meaning that almost surely.
Example 3** (Discrete distributions).**
Assume that and have discrete distributions, say supported on with masses and , respectively, and such that . The range of is . The condition (3) on in Theorem 2 is
[TABLE]
While there are clearly many copulas that satisfy this constraint, the condition is sufficient to uniquely determine the joint law of . By (2), .
Example 4** (Disjoint supports).**
Assume that is uniformly distributed on and on . In this case, all kinds of dependency structures between and are allowed. For and , (2) yields that
[TABLE]
As prescribed by (3), for and for . This is the diagonal section of the Fréchet-Hoeffding lower bound.
It is not a contradiction that any copula yields a possible bivariate law of such that almost surely, but in the representation of Theorem 2, there are restrictions on the diagonal of the symmetric copula . The copula is the copula of the random permutation of , and as such it cannot put any mass on the squares or .
3 Pointwise bounds on the joint cdf
By Theorem 2, the range of all possible bivariate cdfs of random vectors with given marginals and and such that a.s. depends on the choice of a symmetric copula satisfying (3), equivalently, having the diagonal section (5) on the range of . For a general diagonal section , the following result holds.
Theorem 5** ([8, 9]).**
Each copula with diagonal section satisfies
[TABLE]
where
[TABLE]
is the Bertino copula. The copula has diagonal section .
Denote
[TABLE]
Theorem 6**.**
Each random vector with marginal cdfs and , and such that a.s., has a joint cdf satisfying
[TABLE]
where both bounds are attained, and
[TABLE]
Proof*.*
The upper bound in (8) is the Fréchet–Hoeffding one; it corresponds to complete dependence between and , so that and for a standard uniformly distributed random variable .
For the lower bound, let be a diagonal section which is equal to at (5) on the range of . We continuously extend to the closure of . Continuity of implies that is equal on . The function is bounded below by defined as
[TABLE]
where and . This function is itself a diagonal section which is equal to on . For all it holds that . Therefore, Theorem 5 and (2) imply for ,
[TABLE]
Since
[TABLE]
we can restrict the infimum in (10) to . Hence,
[TABLE]
The last equality holds because always holds (see [5]) and only happens if there is an such that is constant on . But if is constant on some interval, then is necessarily also constant on this interval. ∎
The lower bound in (8) corresponds to the Bertino copula and so yields the least possible dependence between and . If the function at (7) is unimodal, this corresponds to the assumption that the Bertino copula (6) is simple, compare [10]. The unimodality condition (which also appears in Theorem 10) applies in many examples, and simplifies the structure of the distribution considerably.
Corollary 7**.**
Assume that the function is unimodal, that is, increases on and decreases on for some . Then
[TABLE]
Proof*.*
Let . By the unimodality, the infimum of over is attained at one of the end-points or . Therefore,
[TABLE]
which yields (11). ∎
Example 5** (Disjoint supports – Example 4 continued).**
We assume that is uniformly distributed on and on . Then,
[TABLE]
which is clearly unimodal. Therefore, by Corollary 7, and for
[TABLE]
which corresponds to choosing the Fréchet-Hoeffding lower bound as the dependence structure for .
As shown by Rogers in [11], if is unimodal, the distribution given by (11) maximizes the payoff (or transportation cost) over all strictly convex decreasing functions . Without unimodality assumption, the joint distribution maximizing the payoff is given by
[TABLE]
This joint distribution satisfies ; it provides the joint distribution with the largest mass concentrated on the diagonal, see [12, Th. 7.2.6].
The following result concerns the support of the random vector with distribution in the case when the cdfs and are continuous. In the general case, the support of is more intricate to describe.
Lemma 8**.**
Suppose that and are continuous. The support of the distribution given at (9) is the set
[TABLE]
where and are the supports of the distributions and , respectively, and
[TABLE]
Here, , denotes the topological boundary of , .
Proof*.*
Let have distribution . Let and . Then,
[TABLE]
The right hand side can only be strictly positive if
[TABLE]
which is the case whenever and . Thus, only points with belonging to and may be in the diagonal parts of the support of . If and is small enough, then
[TABLE]
so cannot belong to the support of .
Conversely, assume that . Since is continuous, the infimum in (15) is attained at some , if , then , hence because . One can argue analogously if . If , then we distinguish two cases. If or , then one can argue as previously. Here denotes the interior of , . If and
[TABLE]
then which yields the claim concerning the diagonal part of the support of .
Now assume that and . Then
[TABLE]
where
[TABLE]
The probability in (16) is strictly positive if and only if or . The point belongs to the support of if and only if for all small enough. Letting converge to zero, we find that a necessary condition is that
[TABLE]
hence . It is not hard to check that the conditions on and in are necessary and sufficient to ensure that is fulfilled for all small enough. ∎
The set from (8) is illustrated in the top-left panel of Fig. 1 using points sampled from .
Example 6** (Disjoint supports – Example 4 continued).**
We assume that is uniformly distributed on and on . The function and the distribution are given at (12) and (13), respectively. The support of is given by
[TABLE]
This follows from Example 5 or Lemma 8. The set at (8) consists of three parts. The first set is , the second set is empty, and the third set is because is the empty set.
4 Nonparametric correlation coefficients
Dependence measures quantitatively summarize the degree of dependence between two random variables and . Kendall’s tau and Spearman’s rho are arguably the two most well-known measures of association whose sample versions are purely based on ranks. If the marginal distributions of and are continuous then the population versions of Kendall’s tau and Spearman’s rho only depend on the copula of . In this section, we assume that both, and are continuous, and hence the copula of is uniquely defined. We refer the reader to [13] for details concerning problems that arise in the case of arbitrary marginal distributions.
Kendall’s tau of a copula is given by
[TABLE]
and, Spearman’s rho is given by
[TABLE]
These two correlation coefficients are monotonic with respect to pointwise, or, concordance ordering of copulas [13]. They take the value one for the joint cdf given by the upper bound in (8). For the copula with given by (9), these dependence measures attain their lowest values calculated as follows.
Theorem 9**.**
Suppose that and are continuous. The smallest possible Kendall’s tau of that satisfies the conditions of Theorem 2 is
[TABLE]
Proof*.*
Writing yields that
[TABLE]
where has cdf given by (9). By Lemma 8, the support of is given by the set at (8). On the set ,
[TABLE]
hence the result. ∎
Theorem 10**.**
Suppose that and have continuous cdfs and with the same support and that the function from (7) is unimodal, strictly increases on and strictly decreases on for some . Then the smallest possible Spearman’s rho of satisfying the conditions of Theorem 2 is
[TABLE]
where and for and .
Proof*.*
Spearman’s rho is given by
[TABLE]
The set from (8), consists of 3 pieces: with the push-forward of , ; with the distribution being the image of the measure on with push-forward of ; and with the distribution being the push-forward of . The push-forward is the image of the measure on the line by the specified map, e.g., the third part if the image of the measure on with under the map .
The result is obtained by splitting the above expectation into these 3 parts. ∎
5 Maximum entropy distributions
We assume that both and have full supports on a (possibly infinite) interval and that their cdfs and are absolutely continuous with densities and . Amongst all joint absolutely continuous laws of with given marginals and and such that a.s., we characterize those maximizing the differential entropy (see [14, Ch. 8]) given by
[TABLE]
These copulas correspond to the least informative (most random) joint distributions, equivalently, to the distributions minimizing the Kullback–Leibler divergence with respect to the uniform distribution. We use the common convention . Independently of our work, maximum entropy distributions of order statistics in the multivariate case have been studied in [15].
Note that the function from (1) is absolutely continuous with density . By Theorem 2, the joint law of is absolutely continuous if and only if the associated symmetric copula is absolutely continuous. We denote its density by . By the symmetry of ,
[TABLE]
Therefore, maximizing over all is equivalent to maximizing over all symmetric copulas with diagonal section . Note that the smallest entropy arises as the limit by considering absolutely continuous distributions approximating the distribution of and for a uniformly distributed .
Butucea et al. [16] characterize the maximum entropy copula with a given diagonal section . We recall some of their notation in order to be able to state our result. For a diagonal section with for all , define for , ,
[TABLE]
and for , set . Butucea et al. [16, Proposition 2.2] show that is the density of a symmetric copula with diagonal section . Note that the derivative of exists almost everywhere as is Lipschitz continuous. For a general diagonal section , due to its continuity, the set is the union of disjoint open intervals , for an at most countable index set . Note that and . For , define
[TABLE]
where is given at (22), and
[TABLE]
Based on the results of [16], we arrive at the following theorem. Recall that .
Theorem 11**.**
Let be a random vector with marginals and satisfying . Suppose that and have identical support being a (possibly unbounded) interval , and that their cdfs and are absolutely continuous with densities and . If
[TABLE]
then
[TABLE]
where the supremum is taken over all possible joint laws of . The maximum is attained when the joint density of is given by
[TABLE]
and is defined at (23). If (24) does not hold, then .
Proof*.*
By substitution,
[TABLE]
The result now follows from [16, Th. 2.5] in combination with (21) and Theorem 2. ∎
The condition for all is equivalent to for all . If this condition holds, then and the formula for the entropy maximizing in Theorem 11 simplifies to
[TABLE]
for , .
6 Examples
Example 7**.**
Let be uniformly distributed on , and let be distributed as the maximum of and another independent uniformly distributed random variable, that is, . In this case is unimodal with the maximum at , and for . The top-left panel of Fig. 1 shows a sample from the distribution . It is easily seen that these values belong to the set given by (8) which consists here of the diagonal of the square and the lower part of the off-diagonal.
The smallest values for Kendall’s tau and Spearman’s rho are and , respectively. The joint density with the maximal entropy is given by
[TABLE]
A sample from this distribution is shown in the bottom panel of Fig. 1. Note that in this example it is easy to simulate from the distribution , and also from the distribution with density given at (25). To simulate a random vector with distribution , generate a random variable which is uniformly distributed on and set
[TABLE]
A random vector with distribution given by the density at (25) is obtained by simulating independent random variables both uniformly distributed on and defining
[TABLE]
Example 8**.**
Theorem 1 establishes a relationship between the distribution of and the order statistics of a suitably chosen exchangeable pair . Assume that and are independent. Then . By Theorem 1, , whence
[TABLE]
Then
[TABLE]
is always unimodal. If is continuous the maximum attained at any lower quartile of . The smallest possible Kendall’s tau equals ; it does not depend on . If we assume additionally that the support of is an interval, Theorem 10 applies and the equation used to find turns into . Substituting this in (19) yields that Spearman’s rho equals for all . If is absolutely continuous with density the maximum entropy is attained on the density , , where .
Example 9**.**
Let be uniform on , and let with . Then , , and
[TABLE]
for , see (2). In this case, (9) yields that
[TABLE]
The function is unimodal and attains its maximum at . The smallest Kendall’s tau is
[TABLE]
Note that if , if , and as .
We are not able to provide an explicit formula for Spearman’s rho in terms of but Fig. 2 shows and as a function of .
Example 10** (Unimodal densities).**
Let be a random variable with cdf and unimodal density whose support is . Let , , with , hence , . Then Theorem 9 yields for Kendall’s tau of the distribution at (9)
[TABLE]
which is the convolution of and .
Let us additionally assume that is symmetric about its mode at zero. Then , and the unimodal function has its maximum at . Hence, the function in Theorem 10 is given by . If is the standard Gaussian cdf, then , and, therefore,
[TABLE]
In particular, if , if , and if , then .
For Spearman’s rho, we numerically computed the integrals in (19) for . The values of and as functions of are displayed in Fig. 3.
Example 11** (Exponential marginal distributions).**
Let be an exponential random variable with cdf , and let , , with . Considering , shows that we are in the same setting as in Example 10 because Kendall’s tau and Spearman’s rho are invariant under monotone transformations of the marginals and the stochastic ordering is preserved if we transform both marginals with the same increasing function.
However, we can also compute and directly. Kendall’s tau is given by
[TABLE]
Note that if , if , and as .
The function
[TABLE]
is unimodal on with maximum at
[TABLE]
Therefore, we can use (19) to compute . Considering the increasing transformation , , we see that and only depend on . Fig. 4 provides plots of and as functions of .
7 Generalizations
A multivariate version of Theorem 1 is the following.
Theorem 12**.**
A random vector with marginal cdfs satisfies if and only if where are the order statistics of a random vector with exchangeable components and such that for
[TABLE]
Proof*.*
Let be an exchangeable random vector that satisfies the above condition. By [17, Proposition 4.4.1], we have that
[TABLE]
Conversely, if the vector is obtained as the random permutation of , then it is exchangeable and the formula for is essentially the inversion of the first equality in the above equation. ∎
A variant of Theorem 1 applies to random elements in a lattice with partial order , and with being the maximum and being the minimum operation. Endow with the -algebra generated by for all . Since these events form a -system, the values , , uniquely determine the distribution of an -valued random element .
In this case, Theorem 1 admits a direct generalization. Namely a.s. if and only if and for a pair of exchangeable random elements in such that
[TABLE]
and .
Acknowledgement
The problem of characterizing bivariate copulas with stochastically ordered marginals was brought to the attention of the second author by Nicholas Kiefer from the Economics Department at Cornell University.
The authors are grateful to the referees for spotting mistakes in the original version of this paper and suggesting numerous improvements.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] J. Navarro, F. Spizzichino, On the relationships between copulas of order statistics and marginal distributions, Statist. Probab. Lett. 80 (2010) 473–479.
- 2[2] R. Lebrun, A. Dutfoy, Copulas for order statistics with prescribed margins, J. Multivariate Anal. 128 (2014) 120–133.
- 3[3] M. Dietz, S. Fuchs, K. D. Schmidt, On order statistics and their copulas, Statist. Probab. Lett. 117 (2016) 165–172.
- 4[4] F. Durante, R. Mesiar, C. Sempi, Copulas with given diagonal section: some new results, in: Proc. of the Joint 4th Conf. of the European Society for Fuzzy Logic and Technology and the 11th Rencontres Francophones sur la Logique Floue et ses Applications, 2005, pp. 931–936.
- 5[5] P. Embrechts, M. Hofert, A note on generalized inverses, Math. Method. Oper. Res. 77 (2013) 423–432.
- 6[6] B. Schweizer, A. Sklar, Probabilistic Metric Spaces, North-Holland Publishing Co., New York, 1983.
- 7[7] A. Mc Neil, J. Nešlehová, Multivariate archimedean copulas, d 𝑑 d -monotone functions and l 1 subscript 𝑙 1 l_{1} -norm symmetric distributions, Ann. Stat. 37 (2009) 3059–3097.
- 8[8] R. B. Nelsen, J. J. Quesada Molina, J. A. Rodríguez Lallena, M. Úbeda Flores, Best-possible bounds on sets of bivariate distribution functions, J. Multivariate Anal. 90 (2004) 348–358.
