Stochastic comparisons of sample mean differences for multivariate random variables
Xuehua Yin, Dan Zhu, Chuancun Yin

TL;DR
This paper extends stochastic ordering results of the Gini index from multivariate normal risks to more general multivariate elliptical risks, analyzing their dispersion matrices and tail probabilities.
Contribution
It generalizes existing stochastic ordering results for Gini indexes to multivariate elliptical risks and revises large deviation results for these risks.
Findings
Conditions on dispersion matrices ensure monotonicity of Gini index
Established stochastic ordering for multivariate elliptical risks
Revised large deviation results for Gini indexes
Abstract
In this paper, we establish the stochastic ordering of the Gini indexes for multivariate elliptical risks which generalized the corresponding results for multivariate normal risks. It is shown that several conditions on dispersion matrices and the components of dispersion matrices of multivariate normal risks for the monotonicity of the Gini index in the usual stochastic order proposed by Samanthi, Wei and Brazauskas (2016) and Kim and Kim (2019) are also suitable for multivariate elliptical risks. We also study the tail probability of Gini index for multivariate elliptical risks and revised a large deviation result for the Gini indexes of multivariate normal risks in Kim and Kim (2019).
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRisk and Portfolio Optimization · Statistical Distribution Estimation and Applications · Probability and Risk Models
Stochastic ordering of Gini indexes for multivariate elliptical random variables
Chuancun Yin
*School of Statistics, Qufu Normal University
Shandong 273165, China
*e-mail: [email protected]
Abstract
In this paper, we establish the stochastic ordering of the Gini indexes for multivariate elliptical risks which generalized the corresponding results for multivariate normal risks. It is shown that several conditions on dispersion matrices and the components of dispersion matrices of multivariate normal risks for the monotonicity of the Gini index in the usual stochastic order and increasing convex order proposed by Samanthi, Wei and Brazauskas (2016) and Kim and Kim (2019) are also suitable for multivariate elliptical risks. We also study the tail probability of Gini index for multivariate elliptical risks and revised a large deviation result for the Gini indexes of multivariate normal risks in Kim and Kim (2019).
Keywords: Gini index; increasing convex order; large deviation; multivariate elliptical risk; multivariate normal risk; usual stochastic order
1 Introduction
Since Corrado Gini introduced an index to measure concentration or inequality of incomes (see Gini (1936), for English translation of the original article), it has been studied extensively because of its importance in many fields such as economics, actuarial science, finance, operations research, queuing theory and statistics; see, for example, Denuit et al. (2005), McNeil et al. (2005), Brazauskas et al. (2007), Goovaerts et al. (2010), Frees et al. (2011, 2014), Samanthi et al. (2017), to name but a few. Recently, there is a growing interest in the study and applications of the Gini index; See e.g. Samanthi, Wei and Brazauskas (2016), Kim and Kim (2019) and the references therein. Let be portfolios of risks or random variables, these portfolios can be independent or dependent. To check whether or not the risk measures ’s are all equal. Brazauskas et al. (2007) and Samanthi et al. (2017) proposed a nonparametric test statistic based on the following Gini index:
[TABLE]
It is easy to see that can be rewritten in terms of order statistics:
[TABLE]
where denotes the th smallest component of , i.e. . Note that the expression (2.2) in Samanthi et al. (2016) lost a negative sign before summation sign.
The comparison of Gini indexes of multivariate elliptical risks also shows its own independent interest. For example, for the ordering of Gini indexes of multivariate normal risks, Samanthi et al. (2016) proposed the following conjecture.
Conjecture 1. Let random vector follow a multivariate normal distribution . Then its Gini index decreases in the sense of usual stochastic order as the covariance matrix increases componentwise with diagonal elements remaining unchanged.
Samanthi et al. (2016) pointed out that proving Conjecture 1 is a challenging task. They partially completes this task and claim that generalizes the conclusion to elliptical distributions, yet still leaves some open problems. Recent paper of Kim and Kim (2019) shows that this conjecture is true when . However, this conjecture is not true when . By using the positive semidefinite ordering of covariance matrices, they obtain the usual stochastic order of the Gini indexes for multivariate normal risks and generalized to the scale mixture of multivariate normal risks. In this paper we generalize the main results in Samanthi et al. (2016) and Kim and Kim (2019) from multivariate normal risks and scale mixture of multivariate normal risks to multivariate elliptical risks and scale mixture of multivariate elliptical risks.
The rest of the paper is organized as follows. In the next section, we introduce some basic notation and review the definition and properties of stochastic orders and elliptical distributions. Sections 3 and 4 establish the usual stochastic orders and increasing convex orders between Gini indexes for multivariate elliptical risks. We study the tail probability of Gini index for multivariate elliptical risks and a large deviation result for the Gini indexes of multivariate normal risks in Section 5. Section 6 provides concluding remarks of the paper.
2 Preliminaries
In this section we fix the notation that will be used in the sequel and we recall some well known results about stochastic orders of random variables and elliptical distributions. Throughout the paper, we use bold letters to denote vectors or matrices. For example, is a row vector and is an matrix. In particular, the symbol denotes the -dimensional column vector with all entries equal to 0, denotes the -dimensional column vector with all components equal to 1, and denotes the matrix with all entries equal to 1. Denote as the matrix having all components equal to 0 and denotes the identity matrix. For symmetric matrices and of the same size, the notion or means that is positive semidefinite.
In order to compare Gini indexes, we recall definitions of some stochastic orders, see, Denuit et al. (2005) and Shaked and Shanthikumar (2007). Let and be two random variables, is said to be smaller than in usual stochastic order, denoted as , if for all real numbers . Random vector is said to be smaller than random vector in increasing convex order (written ), if for all increasing convex functions such that the expectations exist. A function is said to be supermodular if for any it holds that
[TABLE]
where the operators and denote coordinatewise minimum and maximum respectively. Supermodular functions are also called quasimonotone or -superadditive. Note that if is twice differentiable, then is supermodular if and only if
[TABLE]
for all and . If is supermodular, then is called submodular. Random vector is said to be smaller than random vector in the supermodular order, denoted as , if for any supermodular function such that the expectations exist.
We next state some basics about elliptical distributions. Elliptical distributions have been used widely in insurance, finance and multicriteria decision theory; see, for example, Owen and Rabinovitch (1983), Landsman and Valdez (2003), Hamada and Valdez (2008), Landsman, Makov and Shushi (2018), Sha et al. (2019) and Kim and Kim (2019). We follow the notation of Cambanis, Huang and Simons (1981) and Fang, Kotz and Ng (1990). Let be a class of functions such that function is an -dimensional characteristic function. It is clear that
[TABLE]
Denote by the set of characteristic generators that generate an -dimensional elliptical distribution for arbitrary . That is
An random vector is said to have an elliptically symmetric distribution if its characteristic function has the form for all , where is called the characteristic generator satisfying , (-dimensional vector) is its location parameter and ( matrix with ) is its dispersion matrix (or scale matrix). The mean vector (if it exists) coincides with the location vector and the covariance matrix Cov (if it exists), being . We shall write . It is well known that admits the stochastic representation
[TABLE]
where is a square matrix such that , is uniformly distributed on the unit sphere , is the random variable with in called the generating variate and is called the generating distribution function, and are independent. In general, an elliptically distributed random vector does not necessarily possess a density. However, if density of exists it must be of the form
[TABLE]
for some non-negative function satisfying the condition
[TABLE]
and a normalizing constant given by
[TABLE]
The function is called the density generator. One sometimes writes for the -dimensional elliptical distributions generated from the function . In this case in (1.1) has the pdf given by
[TABLE]
Theorem 2.21 in Fang, Kotz and Ng (1990) shows that if and only if is a mixture of normal distributions. Some such elliptical distributions are the multivariate normal distribution, the multivariate -distribution, the multivariate Cauchy distribution and the exponential power distribution with . Some elliptical distributions like logistic distribution and Kotz type distribution are not mixture of normal distributions. A comprehensive review of the properties and characterizations of elliptical distributions can be found in Cambanis et al. (1981) and Fang et al. (1990).
3 Usual stochastic order of Gini indexes
In this section, we extended the results of multivariate normal risks and scale mixture multivariate normal risks in Samanthi et al. (2016) and Kim and Kim (2019) to scale mixture multivariate elliptical risks. To compare the usual stochastic orders between Gini indexes for multivariate elliptical risks, we use the following result due to Fefferman, Jodeit, and Perlman (1972); see Eaton and Erlman (1991) for a different proof. In the case of Gaussian distribution, this was proved by Anderson (1955). Let denote the class of all convex, centrally symmetric (i.e., ) subsets of .
Lemma 3.1**.**
Suppose that and . If , then for every ,
[TABLE]
The following result generalized Proposition 2 in Kim and Kim (2019) in which they only considered a special class of multivariate elliptical risks with zero mean vector, i.e., scale mixture of multivariate normal risks with zero mean vector.
Proposition 3.1**.**
Let and . If , then
[TABLE]
In particular, if , or , then
[TABLE]
Proof. If and , then and , and thus (3.1) follows from Lemma 3.1 by taking for as in Kim and Kim (2019). It is easy to check that the Gini index is invariant under drift , i.e., for all -dimensional random vector . Therefore, (3.2) follows.
We will extend the result of Proposition 3.1 to the scale mixture of multivariate elliptical risks.
Definition 2.1 A -dimensional random variable is said to have a scale mixture of elliptical distributions with the parameters and , if
[TABLE]
where , is a nonnegative, scalar-valued random variable with the distribution , and are independent, , with , and is the square root of . Here is an vector of zeros, and is an identity matrix. We will use the notation .
Note that when we get the multivariate normal variance mixture distribution (see, e.g., McNeil et al., 2005); When we have the variance mixture of the Kotz-type distribution introduced by Arslan (2009).
Proposition 3.1 can be generalized to scale mixture of multivariate elliptical risks.
Proposition 3.2**.**
Let and . If , then
[TABLE]
In particular, if , or , then
[TABLE]
Proof. It can be easily seen that for any , and . Since , one has . By Proposition 3.1, given , we get
[TABLE]
Or, equivalently,
[TABLE]
Therefore, for all ,
[TABLE]
which is (3.4). In particular, if , or , then and , and (3.5) follows.
An important property of elliptical distributions is that linear transformations of elliptical vectors are also ellipticals, with the same characteristic generator. Specifically
Lemma 3.2**.**
Suppose that , is an matrix of rank , and is an vector, then .
We will give a weaker sufficient condition for stochastic ordering of Gini indexes for multivariate elliptical risks.
Proposition 3.3**.**
Let and . If , where is an matrix defined as
[TABLE]
Then
[TABLE]
In particular, if , or , then
[TABLE]
Proof. If and , by Lemma 3.2 we get
[TABLE]
and
[TABLE]
According to Proposition 3.1, if , then
[TABLE]
In particular, if , or , then . Thus
[TABLE]
and (3.7) follows since and .
The following proposition generalized the result of Proposition 4.4 in Samanthi et al. (2016) in which only the multivariate normal risks with zero mean vectors were considered. Moreover, we provide a short proof.
Proposition 3.4**.**
Let and . If there exists such that , then
[TABLE]
where is defined in Proposition 3.3. In particular, if , or , then
[TABLE]
Proof. By Proposition 3.3, it suffices to show that . In fact,
[TABLE]
since as desired.
Remark 3.1**.**
If , then for any , then . But conversely is not true in general.
Proposition 3.5 can be generalized to scale mixture of multivariate elliptical risks. The proof is very similar to that used in extending Propositions 3.1 to 3.2 and hence is omitted.
Proposition 3.5**.**
Let and . If , where is defined in Proposition 3.3, then
[TABLE]
In particular, if , or , then
[TABLE]
Proposition 3.6**.**
Let and . If there exists such that , then
[TABLE]
In particular, if , or , then
[TABLE]
where is defined in Proposition 3.3.
Proof. It is an immediate consequence of Proposition 3.5 since the condition implies as shown in the proof of Proposition 3.4.
The condition on the components of dispersion matrix of multivariate normal risk or scale mixture of multivariate normal risk for the monotonicity of the Gini index in the usual stochastic order proposed by Kim and Kim (2019) also suitable for general multivariate elliptical risk or scale mixture of multivariate elliptical risk, as shown below.
The following result generalized Propositions 3 and 4 in Kim and Kim (2019) in which they only considered a special class of multivariate elliptical risks with zero mean vector, i.e., the multivariate normal risks and scale mixture of multivariate normal risks with zero mean vector.
Proposition 3.7**.**
Let and with and . Let , if , and for other , . Then
[TABLE]
In particular, if , or , then
[TABLE]
Proof. According to Kim and Kim (2019), under the assumed condition, we know that,
[TABLE]
from which we get
[TABLE]
We conclude that the latter matrix is positive semidefinite. In fact, for any ,
[TABLE]
Therefore, Then Proposition 3.6 implies the desired results.
The following result generalized Propositions 5 and 6 in Kim and Kim (2019) in which they only considered the multivariate normal risks and scale mixture of multivariate normal risks with zero mean vector.
Proposition 3.8**.**
Let and with and . Let , if
[TABLE]
then
[TABLE]
In particular, if , or , then
[TABLE]
Proof. Under the assumed condition, Kim and Kim (2019) found that,
[TABLE]
from which we get
[TABLE]
where is defined in Proposition 3.3. We find that . Therefore, Proposition 3.5 provides the desired result.
4 Increasing convex order of Gini indexes
Samanthi, Wei and Brazauskas (2016) established a sufficient and necessary condition for the supermodular order between two scale mixture of multivariate normal risks. Based on this result they found a sufficient condition for the increasing convex order of Gini indexes for two 3-dimensional elliptical random variables. In addition, they remarked that ordering in the increasing convex order for higher dimensional risk is still an open problem. However, Proposition 3.5 in Samanthi, Wei and Brazauskas (2016) is not true due to their Lemma 3.2 is not true. For example, is not a supermodular function. The following theorem solves this open problem.
Theorem 4.1**.**
Let and with , and . If and have the same marginals and for all , then .
Proof. According to Lemma 3.1 in Samanthi, Wei and Brazauskas (2016) is supermodular and componentwise monotone. For any convex and increasing function , a result of Topkis (1968) shows that the composition is componentwise monotone and supermodular; See also Marshall, Olkin and Arnold (2011). This, together with Proposition 3.4 in Samanthi, Wei and Brazauskas (2016) yields the desired result.
The following result is a direct consequence of Theorem 4.1, which generalizes Proposition 3.6 in Samanthi, Wei and Brazauskas (2016) for the case of zero mean vector.
Corollary 4.1**.**
Let and with , and . If and have the same marginals and for all , then given the expectations exist.
5 Tail asymptotic results for Gini indexes
In this section, we will discuss the asymptotic result for the tail probability of Gini index when . The following lemma can be found in Fang and Liang (1989), see also Tong (1990).
Lemma 5.1**.**
For all real vectors and all given real numbers , we have
[TABLE]
where is the rearranged order of and is the vectors of permutations of .
We first give a counterexample demonstrating that the large deviation result for Gini indexes in Kim and Kim (2019) is not true.
Example 4.1. Let and , where
[TABLE]
Then with
[TABLE]
Recall that
[TABLE]
and note that the Pearson correlation coefficient of and is , so we get and hence . It follows that for ,
[TABLE]
We have
[TABLE]
due to the fact that . Therefore,
[TABLE]
This, together with (5.1) implies that
[TABLE]
However, in the case of and , the limit in Theorem 5 of Kim and Kim (2019) is
To discuss the asymptotic result for the tail probability of Gini index , we first recall some results for elliptical distributions. Consider the linear transformation , where
[TABLE]
where and is an matrix such that is a permutation of . If , then by Lemma 3.2, . This, together with Lemma 5.1 implies that
[TABLE]
This result in the multinormal case can be found in Tong (1990). In particular, taking leads to the following result.
Proposition 5.1**.**
Assume that , then for any ,
[TABLE]
Or, equivalently,
[TABLE]
where with , and
[TABLE]
Here is a permutation of , .
The next theorem establishes a large deviation result for the Gini indexes of multivariate normal risks which corrected the result of Theorem 5 in Kim and Kim (2019).
Theorem 5.1**.**
Let , the we have
[TABLE]
where are diagonal elements of matrix in Proposition 4.1 with .
Proof By Proposition 5.1, if , then . In particular, where ’s are the elements of vector and ’s are diagonal elements of matrix . Using the well known fact
[TABLE]
we obtain, for all ,
[TABLE]
or, equivalently,
[TABLE]
Using (5.4) we get
[TABLE]
Without loss of generality we assume that . Then (5.6) can be rewritten as
[TABLE]
where
[TABLE]
One easily obtains
[TABLE]
It follows from (5.5), (5.7) and (5.8) that
[TABLE]
as desired.
6 Concluding remarks
In this paper, we have considered usual stochastic order and increasing convex order problems about the Gini indexes for multivariate elliptical random variables. The related issues for multivariate normal risks and scale mixture of multivariate normal risks have been studied by Samanthi et al. (2016) and Kim and Kim (2019). Here, we have investigated the issues for multivariate elliptical risks and scale mixture of multivariate elliptical risks. This paper also answered the following open problems proposed in the Concluding Remarks in Samanthi et al. (2016): To what extent can Gini indexes of multivariate elliptical risks be ordered in the sense of usual stochastic order? Does the conclusion still hold for high dimensional risks with general elliptical distribution? We also solve another open problem in Samanthi et al. (2016) about the increasing convex order of Gini indexes for higher dimensional risks. In addition, we found the tail probability of Gini index when . Especially, a large deviation result for the Gini indexes of multivariate normal risks was established which revised the corresponding result in Kim and Kim (2019).
Acknowledgements The author is grateful to Managing Editor Rob Kaas and the referees for their suggestions which helped to improve this article. The research was supported by the National Natural Science Foundation of China (No. 11171179, 11571198, 11701319).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Anderson, T. W., 1955. The integral of a symmetric unimodal function over a symmetric convex set and some probability inequalities. Proc. Amer. Math. Soc. 6, 170-176.
- 2[2] Anderson, T.W., 1996. Some inequalities for symmetric convex sets with applications. Ann. Statist. 24 (2), 753-762.
- 3[3] Arslan, O., 2009. Variance-mean mixture of Kotz-Type distributions. Communications in Statistics- Theory and Methods 38(2), 272-284.
- 4[4] Brazauskas, V., Jones, B.L., Puri, M.L., Zitikis, R., 2007. Nested L-statistics and their use in comparing the riskiness of portfolios. Scand. Actuar. J. 3, 162-179.
- 5[5] Cambanis, S., Huang, S., Simons, G., 1981. On the theory of elliptically contoured distributions. Journal of Multivariate Analysis 11, 365-385.
- 6[6] Das Gupta, S., Eaton, M.L., Olkin, I., Perlman, M.D., Savage, L.J., Sobel, M., 1972. Inequalities on the probability content of convex regions for elliptically contoured distributions. In: Sixth Berkeley Symposium on Probability and Statistics, II, pp. 241-265.
- 7[7] Denuit, M., Dhaene, J., Goovaerts, M., Kaas, R., 2005. Actuarial Theory for Dependent Risks: Measures, Orders and Models, Wiley, New York.
- 8[8] Eaton, M.L., Perlman, M.D., 1991. Multivariate probability inequalities: convolution theorems, composition theorems, and concentration inequalities. In: Lecture Notes-Monograph Series, vol. 19, Stochastic Orders and Decision under Risk, pp. 104-122.
