A note on conditional covariance matrices for elliptical distributions
Piotr Jaworski, Marcin Pitera

TL;DR
This paper derives an analytical formula for conditional covariance matrices of elliptically distributed vectors, simplifying calculations and exploring properties of quantile-based sets under multivariate normality.
Contribution
It introduces a univariate invariant that simplifies the calculation of conditional covariance matrices for elliptical distributions and characterizes quantile-based sets with equal conditional covariances.
Findings
Derived an explicit formula for conditional covariance matrices.
Introduced a univariate invariant for simplification.
Characterized quantile-based sets with equal conditional covariances.
Abstract
In this short note we provide an analytical formula for the conditional covariance matrices of the elliptically distributed random vectors, when the conditioning is based on the values of any linear combination of the marginal random variables. We show that one could introduce the univariate invariant depending solely on the conditioning set, which greatly simplifies the calculations. As an application, we show that one could define uniquely defined quantile-based sets on which conditional covariance matrices must be equal to each other if only the vector is multivariate normal. The similar results are obtained for conditional correlation matrices of the general elliptic case.
| k | partition ratio | ||||||
|---|---|---|---|---|---|---|---|
| 1 | 0.500 | - | - | - | - | - | 50/50 |
| 2 | 0.198 | 0.802 | - | - | - | - | 20/60/20 |
| 3 | 0.075 | 0.500 | 0.925 | - | - | - | 7/43/43/7 |
| 4 | 0.027 | 0.270 | 0.730 | 0.973 | - | - | 3/24/46/24/3 |
| 5 | 0.010 | 0.133 | 0.500 | 0.867 | 0.990 | - | 1/12/37/37/12/1 |
| 6 | 0.004 | 0.062 | 0.307 | 0.693 | 0.938 | 0.996 | 0.5/5/25/39/25/5/0.5 |
| k | v | partition ratio | |||
|---|---|---|---|---|---|
| 3 | 0.045 | 0.955 | - | 4.5/91.0/4.5 | |
| 5 | 0.123 | 0.877 | - | 12.3/75.4/12.3 | |
| 7 | 0.150 | 0.850 | - | 15.0/75.0/15.0 | |
| 10 | 0.166 | 0.834 | - | 16.6/66.8/16.6 | |
| 2 | 12 | 0.173 | 0.827 | - | 17.3/65.4/17.3 |
| 15 | 0.179 | 0.821 | - | 17.9/64.3/17.9 | |
| 25 | 0.187 | 0.813 | - | 18.7/62.7/18.7 | |
| 50 | 0.193 | 0.807 | - | 19.3/61.4/19.3 | |
| 100 | 0.196 | 0.804 | - | 19.6/60.8/19.6 | |
| 0.198 | 0.802 | - | 19.8/60.4/19.8 | ||
| 3 | 0.002 | 0.500 | 0.998 | 0.2/49.8/49.8/0.2 | |
| 5 | 0.016 | 0.500 | 0.984 | 1.6/48.4/48.4/1.6 | |
| 7 | 0.032 | 0.500 | 0.968 | 3.2/46.8/46.8/3.2 | |
| 10 | 0.045 | 0.500 | 0.955 | 4.5/45.5/45.5/4.5 | |
| 3 | 12 | 0.051 | 0.500 | 0.949 | 5.1/44.9/44.9/5.1 |
| 15 | 0.055 | 0.500 | 0.945 | 5.5/44.5/44.5/5.5 | |
| 25 | 0.063 | 0.500 | 0.937 | 6.3/43.7/43.7/6.3 | |
| 50 | 0.069 | 0.500 | 0.931 | 6.9/43.1/43.1/6.9 | |
| 100 | 0.072 | 0.500 | 0.928 | 7.2/42.8/42.8/7.2 | |
| 0.075 | 0.500 | 0.925 | 7.5/42.5/42.5/7.5 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Bayesian Methods and Mixture Models · Bayesian Modeling and Causal Inference
A note on conditional covariance matrices for elliptical distributions
Piotr Jaworski111Institute of Mathematics, University of Warsaw, Warsaw, Poland. , Marcin Pitera222Institute of Mathematics, Jagiellonian University, Cracow, Poland.
Abstract
In this short note we provide an analytical formula for the conditional covariance matrices of the elliptically distributed random vectors, when the conditioning is based on the values of any linear combination of the marginal random variables. We show that one could introduce the univariate invariant depending solely on the conditioning set, which greatly simplifies the calculations. As an application, we show that one could define uniquely defined quantile-based sets on which conditional covariance matrices must be equal to each other if only the vector is multivariate normal. The similar results are obtained for conditional correlation matrices of the general elliptic case.
Keywords: elliptical distribution, conditional covariance, conditional correlation, tail covariance matrix, tail conditional variance.
MSC 2010: 62H05, 60E05.
1 Introduction
Consider an -dimensional elliptically distributed random vector with finite covariance matrix . Let denote any non-trivial linear combination of margins of . In this short note we provide the analytical formula for conditional covariance matrix , where is any Borel measurable set for which is of positive measure. We show that this conditional matrix could be expressed as
[TABLE]
where is the vector of regression coefficients from -orthogonal projection of margins of onto , and is an invariant depending only on the characteristic generator of , the probability distribution of , and set . This substantially reduces the calculation of as we do not need to consider the conditional -dimensional vector . In particular, we show that for the multivariate normal random vector one only need to consider the conditional variance of on set as the value of invariant does not depend on the choice of .
Next, we use this result to construct a set of quantile-based conditional sets, for which the corresponding covariance (or correlation) matrices must coincide, if only is from the particular distribution. This approach could be used to construct statistical test which checks if the sample is from the given multivariate distribution. As a direct application, we show that if we (approximately) split the probability space into three subsets:
[TABLE]
[TABLE]
where , then for any multivariate normal vector the three conditional covariance matrices , , and must be equal to each other. Moreover, this (approximate) 20/60/20 division ratio is a unique ratio with this property. We present similar results for more conditioning sets and quantile-based divisions.
When financial applications are considered, the change of dependance structure in the quantile based conditioning areas is of particular interest, at it might correspond to spatial contagion between assets (see [2, 1] and references therein) or increase of risk in the tails when so called tail conditional variance is considered in the risk measure framework (see [3, 10, 9] and references therein). In the financial context might correspond to a financial portfolio, while in decision sciences it might be associated with the (benchmark) decision criterion.
This note is organised as follows: In Section 2 we provide a set of some underlying concepts that will be used throughout the paper. In Section 3 we introduce the notion of -invariant which is the main tool used for study of conditional covariance and correlation matrices. Section 4 is devoted to the study of conditional variance/covariance matrices. Here, we present the main result of this note – Theorem 4.1, which provides analytic formula for the conditional variance matrices.In Section 5 we consider the special case of multivariate normal vector. Theorem 5.2 is the second main result of this note, and it shows that one might construct unique division partitions on which conditional covariance matrices are equal to each other. Section 6 provides a discussion about the correlation matrices. Proposition 6.2 might be seen as analog of Theorem 5.2 for the general elliptic case.
2 Preliminaries
Let be a probability space and let us fix . Throughout this paper we assume that which means that is an -dimensional elliptically distributed random vector with location parameter , scale matrix , and characteristic generator for which the equality
[TABLE]
is satisfied for any , where (see [8, 4] for details). We also assume that admits the canonical (stochastic) representation with a continuous distribution function given by
[TABLE]
where
is a positive random variable such that ; 2. -
is a invertible square matrix, such that , where is a positively defined (non-degenerate and finite) covariance matrix; 3. -
is a random vector independent of and uniformly distributed on the unit sphere of .
The vector is singular as and the margins of are uncorrelated as . Also, for any orthogonal matrix we get . For technical reasons we write in a vertical form (as the matrix). It is also useful to note that
[TABLE]
and consequently
[TABLE]
In other words, the scale matrix is in fact the covariance matrix of and
[TABLE]
Many known families of elliptical distributions are associated with particular radial distributions of R. For example, if is a multivariate Gaussian vector, then it could be shown that is distributed as a -distribution with degrees of freedom. In this particular case, given and , we use the standard notation .
Next, for any random vectors and , and an -measurable set , such that , we set
[TABLE]
to denote conditional expectation, conditional variance matrix, and conditional covariance matrix on set , respectively. Moreover, throughout this paper we fix
[TABLE]
and consider random variable
[TABLE]
that is a linear combination of coordinates of . We refer to as a benchmark for . Note that when we -orthogonally project ’s onto the line spanned by then the ”Least Square Error Models” are given by
[TABLE]
where the vector of regression coefficients is equal to
[TABLE]
3 -invariant
In this section we introduce two invariants which will be later crucial to the study of the conditional covariance and correlation matrices.
Definition 3.1**.**
Given a characteristic generator and a Borel subset of positive Lebesgue measure, the -invariant and -invariant are given by
[TABLE]
where and denotes the distribution function of .
The -invariant is simply a conditional variance of a standardized margin of an elliptical random vector conditioned on an (standardized) uncorrelated margin. It will be a key tool used in the conditional covariance matrix computation. On the other hand, the -invariant might be seen as a standardized version of -invariant, which will be used for conditional correlation matrices analysis. For future reference please note that given , any -dimensional margin of is also elliptical with the same generator and so is the random vector ; see [4] for details. Also, note that for a normal random vector uncorrelation implies independence, so in that special case -invariant does not depend on and is always equal to one.
It is useful to note, that -invariant could be expressed in terms of so called tail density (cf. [7, Definition 2]) of the corresponding elliptical density generator (assuming it exists).
Proposition 3.2**.**
Let us assume that admits a density. Then, using notation from Definition 3.1, we get
[TABLE]
where , is a random variable with density
[TABLE]
the function is the density generator for and is the corresponding normalizing constant.444Please see [7] for the definition of and .
Proof.
For , we denote by the generator of the density of ; see [7]. In particular, and are the densities of and , respectively. We get
[TABLE]
which concludes the proof. ∎
Representation (3.1) shows that could be considered as a form of probability distortion function; see [7]. Moreover, one could show that the function defined in (3.2) is a density of a spherical random variable. We refer to [7] for more details and examples of tail densities for many know elliptical families.
4 Conditional variance/covariance matrices
We are now ready to present the main result of this note – Theorem 4.1. It states that for the benchmark random variable and a conditioning set , where and , we can easily compute the conditional covariance matrix of on set . To do so, we only need to take into account (univariate) conditional random variable and the value of the corresponding -invariant. This potentially simplifies the calculations, as we do not need to consider the conditional -dimensional vector .
Theorem 4.1**.**
Let for an elliptical random vector with . Let be such that and . Then,
[TABLE]
*where and . *
Proof.
The proof is based on a stochastic representation (2.1). We set
[TABLE]
where and is an orthogonal matrix such that
[TABLE]
for . Then,
[TABLE]
and
[TABLE]
Hence is a spherical vector with a version of as the first coordinate; see [4]. Furthermore
[TABLE]
and
[TABLE]
Moreover, we know that
[TABLE]
which implies that conditioning by coincides with conditioning by . For any Borel function we have
[TABLE]
Since is a spherical random vector, the random vectors
[TABLE]
have the same probability distribution as . The same is valid for conditional distributions
[TABLE]
Consequently, for we have
[TABLE]
and
[TABLE]
Furthermore,
[TABLE]
Hence, recalling (4.2) and noting that as well as , we get
[TABLE]
which concludes the proof of Formula (4.1).
∎
It is useful to note that the -invariant from Theorem 4.1 could be expressed using the stochastic representation (2.1). Indeed, using notation from Theorem 4.1 it can be shown that
[TABLE]
It is also interesting to note that the conditional covariance matrix of and on set could be explicitly expressed in terms of and the vector of regression coefficients . This is a statement of Proposition 4.2.
Proposition 4.2**.**
Under assumptions and notations from Theorem 4.1 we get
[TABLE]
Proof.
Let . To prove Proposition 4.2 it is enough to note that . Using notation from Theorem 4.1 we immediately get
[TABLE]
∎
Equality (4.4) has interesting linear modelling consequences. Let be a horizontal vector from . If a random variable is uncorrelated with , then the same remains valid under conditioning on set . Indeed,
[TABLE]
Hence, the ”Least Square Error Models” introduced in (2.2) remain invariant under conditioning of . In other words, for we get
[TABLE]
and the -orthogonal projection commutes with conditioning of . Note that for the case where is a half-line, the conditional covariance matrix was considered in [10]; see [10, Theorem 1] and [10, Theorem 3] for details.
5 Multivariate Normal
In this section we discuss in details a special case when . We recall that for the normal characteristic generator the -invariant is independent of and always equal to one. Combining this with Theorem 4.1 we get the following corollary (cf. [6, Lemma 3.2]).
Corollary 5.1**.**
Let and be such that and . Then,
[TABLE]
where
[TABLE]
From Corollary 5.1 we see that the conditional covariance matrix depend on only through the variance of the benchmark on set , i.e. value . Consequently, if we have any number of Borel sets such that
[TABLE]
then we immediately get equality of corresponding covariance matrices
[TABLE]
Moreover, given , it is easy to show that for any one could find (a unique) partition of into Borel subsets
[TABLE]
where are such that
[TABLE]
While the sequence depend on the choice of it could be expressed as
[TABLE]
where and is a sequence of numbers independent of the choice of . In other words, assuming that and we get that there exists a unique sequence for which the chain of equalities
[TABLE]
is satisfied. Moreover, the sequence is independent of and . For the idea of the proof (and the exact proof for the case ) we refer to [6, Lemma 3.3]. Combining Corollary 5.1 and (5.2) we get the following result, which could be seen as a generalisation of [6, Theorem 3.1].
Theorem 5.2**.**
Let , , and . Then,
[TABLE]
for a unique sequence , where . Moreover, the sequence is fixed and independent of and .
We know that if the conditional covariance matrices defined in (5.3) do not coincide (for any and the corresponding unique sequence), then is not normally distributed. Consequently, this approach could be used to construct statistical test, which checks if the sample comes from a multivariate normal distribution. When financial applications are considered, the change of dependance structure in the quantile based conditioning areas is of particular interest, at it might correspond to spatial contagion between assets or increase of risk in the tails. Note that in this particular case might be treated as a financial portfolio and lower quantile sets might be associated with periods with low return rates. Consequently, we believe that Theorem 5.2 might be interesting from a practical perspective.
Note that in a simplified form, one could simply take any linear combination of univariate coordinates of , do the conditioning based on (pre-specified) quantiles and compare the variances. Alternatively, one can focus on the dependance structure and compare correlation matrices instead of covariance matrices.
Let us now have a brief discussion about the unique sequence given in Theorem 5.2. For a fixed , due to symmetry of , we immediately get
[TABLE]
The approximate values of sequences for are presented in Table 1. In particular, setting we get the partition of approximate ratio 20%, 60% and 20%. This might be associated with so called 20-60-20 Rule; see [6] for a more detailed discussion.
6 Conditional correlation matrices
In Section 5 we have shown that Theorem 4.1 might be used to test if is (not) normally distributed; see Theorem 5.2 for details. The choice of the quantile-based conditioning sets in (5.3) was robust in the sense that it was independent of the choice of , , and . Unfortunately, this result is not true for a general elliptical random vector as equality of conditional variances for two subsets does not necessarily imply equality of the corresponding -invariants. Nevertheless, it is easy to note that using the -invariant we can rewrite Equation (4.1) as
[TABLE]
where and . From Equality (6.1) we see that if we have two subsets and such that
[TABLE]
then the conditional matrices and are proportional with proportion ratio equal to
[TABLE]
This immediately implies equality of the corresponding conditional correlation matrices. This is in fact the statement of Proposition 6.1, which we present without the proof.
Proposition 6.1**.**
Let . For let
* be an elliptical random vector;* 2. 2)
* be the corresponding benchmark for some fixed ;* 3. 3)
* be the corresponding conditioning set for being the set of positive Lebesgue measure.*
Moreover, let us assume that covariance matrices and weight vectors are proportional, i.e. there exists such that and . Then, the equality
[TABLE]
implies the equality of the conditional correlation matrices and .
In particular, Proposition 6.1 implies the following variation of Theorem 5.2 for correlation matrices:
Proposition 6.2**.**
Let and let be a characteristic generator. Let the partition of the unit interval be such that
[TABLE]
Then, for any elliptical random vector and (where ) we get
[TABLE]
for , .
For many elliptical distributions the unit interval partition mentioned in Proposition 6.2 exists and can be easily approximated. The exemplary (approximate) results for t-student distribution are presented in Table 2.
Acknowledgements
The first author acknowledges the support from National Science Centre, Poland, via project 2015/17/B/HS4/00911.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] F. Durante, E. Foscolo, P. Jaworski, and H. Wang, A spatial contagion measure for financial time series , Expert Systems with Applications 41 (2014), no. 8, 4023–4034.
- 2[2] F. Durante and P. Jaworski, Spatial contagion between financial markets: a copula-based approach , Appl. Stoch. Models Bus. Ind. 26 (2010), 551–564.
- 3[3] E. Furman and Z. Landsman, Tail variance premium with applications for elliptical portfolio of risks , Astin Bulletin 36 (2006), no. 02, 433–462.
- 4[4] E. Gómez, M. A Gómez-Villegas, and J. M. Marín, A survey on continuous elliptical vector distributions , Revista matemática complutense 16 (2003), no. 1, 345–361.
- 5[5] E. Hashorva and P.Jaworski, Gaussian Approximation of Conditional Elliptical Copulas, J. Multivariate Analysis 111 (2012) 397–407.
- 6[6] P. Jaworski and M. Pitera, The 20-60-20 rule , Discrete and continuous dynamical systems series B 21 (2016) 1149-1166.
- 7[7] Z. M. Landsman and E. A. Valdez, Tail conditional expectations for elliptical distributions , North American Actuarial Journal 7 (2003), no. 4, 55–71.
- 8[8] S. Cambanis, S. Huang, and G. Simons. On the theory of elliptically contoured distributions. J. Multivariate Analysis 11 (1981), 3, 368–385.
