Dissecting the multivariate extremal index and tail dependence
Helena Ferreira, Marta Ferreira

TL;DR
This paper investigates the multivariate extremal index, a key parameter in extreme value theory, providing new decompositions, bounds, and insights into its role in tail dependence and maxima of stationary sequences.
Contribution
It introduces two novel decompositions of the multivariate extremal index, establishes improved bounds, and analyzes its influence on tail dependence in stationary sequences.
Findings
New decompositions of the multivariate extremal index
Improved bounds for the extremal index based on marginal dependence
Analysis of the extremal index's role in the limiting distribution of maxima
Abstract
A central issue in the theory of extreme values focuses on suitable conditions such that the well-known results for the limiting distributions of the maximum of i.i.d. sequences can be applied to stationary ones. In this context, the extremal index appears as a key parameter to capture the effect of temporal dependence on the limiting distribution of the maxima. The multivariate extremal index corresponds to a generalization of this concept to a multivariate context and affects the tail dependence structure within the marginal sequences and between them. As it is a function, the inference becomes more difficult, and it is therefore important to obtain characterizations, namely bounds based on the marginal dependence that are easier to estimate. In this work we present two decompositions that emphasize different types of information contained in the multivariate extremal index, an upper…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarket Dynamics and Volatility · Monetary Policy and Economic Impact
Dissecting the multivariate extremal index and tail dependence
Helena Ferreira
Universidade da Beira Interior, Centro de Matemática e Aplicações (CMA-UBI), Avenida Marquês d’Avila e Bolama, 6200-001 Covilhã, Portugal
Marta Ferreira
Center of Mathematics of Minho University
Center for Computational and Stochastic Mathematics of University of Lisbon
Center of Statistics and Applications of University of Lisbon, Portugal
Abstract
A central issue in the theory of extreme values focuses on suitable conditions such that the well-known results for the limiting distributions of the maximum of i.i.d. sequences can be applied to stationary ones. In this context, the extremal index appears as a key parameter to capture the effect of temporal dependence on the limiting distribution of the maxima. The multivariate extremal index corresponds to a generalization of this concept to a multivariate context and affects the tail dependence structure within the marginal sequences and between them. As it is a function, the inference becomes more difficult, and it is therefore important to obtain characterizations, namely bounds based on the marginal dependence that are easier to estimate. In this work we present two decompositions that emphasize different types of information contained in the multivariate extremal index, an upper limit better than those found in the literature and we analyze its role in dependence on the limiting model of the componentwise maxima of a stationary sequence. We will illustrate the results with examples of recognized interest in applications.
keywords: multivariate extreme values, multivariate extremal index, tail dependence, extremal coefficients, madogram
AMS 2000 Subject Classification: 60G70
1 Introduction
Let be a multivariate distribution function (df), with continuous marginal dfs, in the domain of attraction of a multivariate extreme values (MEV) df having unit Fréchet marginals. Thus
[TABLE]
and , .
Consider a stationary sequence such that and let be a componentwise maxima sequence generated from and therefore , . If
[TABLE]
for some MEV df , we can relate and through the so called multivariate extremal index of . This is possible, even if the marginals are not unit Fréchet distributed, as considered for simplicity and without loss of generality. Indeed, to have (1) or mutatis mutandis (2), it is sufficient that, as , the sequence of copulas , with , converges to , as well as, , , which can be reduced to the case of convergence to the Fréchet without affecting the convergence of .
We recall the definition of multivariate extremal index of and its role in the relation between and (Nandagopalan [14] 1994). The sequence has multivariate extremal index , , when for each there is a sequence of real levels satisfying
[TABLE]
where , , , and is a sequence of independent vectors such that .
Observe that
[TABLE]
with
[TABLE]
where
[TABLE]
and, in particular, , . So, to say that exists is equivalent to say that exists and we have
[TABLE]
If has multivariate extremal index then any sequence of subvectors with indexes in has multivariate extremal index , with
[TABLE]
In particular, for each , has extremal index .
If , , exists for we have
[TABLE]
and , .
From inequalities
[TABLE]
we obtain
[TABLE]
Besides the relation between and , also informs about the existence of clustering of events “at least some exceedance of by , for some ", since
[TABLE]
for sequences and provided that satisfies condition strong-mixing.
The multivariate extremal index thus preserves, with the natural adaptations, the characteristics that made famous the univariate extremal index. Additionally to these similar characteristics to the univariate extremal index, it plays an unavoidable role in the tail dependence characterization of . If the tail dependence coefficients applied to remain unchanged when applied to (Li [12] 2009), we can not guarantee the same for , as will be seen in Section 3. The presence of serial dependence within each marginal sequence and between marginal sequences, makes it impossible to approximate the dependence coefficients in the tail of to those of .
The dependence modeling between the marginals of has received considerably more attention in literature than the dependence between the marginals of , which differs from for being affected by . The need to characterize this dependence appears, for instance, when we have a random field and we consider random vectors corresponding to locations at time instant . The sequence presents in general a multivariate extremal index encompassing information about dependence in the space of locations and when the time varies (Ferreira et al. [4] 2016). Relation (24) applied to MEV distributions and functions compatible with the properties of a multivariate extremal index, provide a means of constructing MEV distributions (Martins and Ferreira [13] 2005).
Notwithstanding all these challenges posed by and for the multivariate extremal index, the literature proves that it remained on the theoretical shelves of the study of extreme values.
The main difficulty of applying the multivariate extremal index lies in the fact that it is a function, unlike what happens with the marginal univariate extremal indexes, for which we have several estimation methods in the literature (see, e.g., Gomes et al. [7] 2008, Northrop [16] 2015, Ferreira and Ferreira [6] 2016 and references therein).
Since it remains present the need to estimate the propensity for clustering in a context of multivariate sequences, we propose in this work: (a) decompose it, highlighting different types of information contained in it; (b) bound it in order to obtain a better upper limit than those available in the literature; (c) enhance its role in the dependence of the tail of ; (d) apply it to models of recognized interest in applications.
2 Co-movements point processes
Based on (30) the multivariate extremal index can be seen as the number of the limiting mean dimension of clustering of events counted by the point process
[TABLE]
We are going to consider two point processes of more restricted events, corresponding to joint exceedances for various marginals of and enhance the contribution of the extremal indexes of these events in the value of .
Let, for each ,
[TABLE]
and
[TABLE]
where notations and stand for minimum and maximum, respectively.
We denote the respective limiting mean number of occurrences by
[TABLE]
and
[TABLE]
Observe that
[TABLE]
Thus
[TABLE]
with an increasing function in and homogeneous of order . Therefore, we have
[TABLE]
for all , a relation that will be fundamental for the independence of from .
In case , we will omit the index in notation.
For each of these processes, we can define an index of clustering of occurrences, which we will also call extremal indexes, and , being the latter a constant independent of , as we will see.
Let us assume that sequence satisfies the strong-mixing condition (Leadbetter et al. [9] 1983) and, as consequence, we have, as ,
[TABLE]
[TABLE]
and
[TABLE]
for any integers sequence , such that, , and , as , where and are the sequences of the strong-mixing condition. Thus
[TABLE]
[TABLE]
and
[TABLE]
with
[TABLE]
[TABLE]
[TABLE]
and
[TABLE]
In the following we present relations between , and , which will allow us a detailed interpretation of the information contained in and an upper bound better than the one in (28). But first, we start by proving that , i.e., these extremal indexes are independent of , which is already known for (Leadbetter et al. [9] 1983), , since . Indeed the proof runs along the same lines.
Proposition 2.1**.**
For stationary sequences satisfying the strong-mixing condition, if there exists the limit (32) for some , then it exists for any and we have
[TABLE]
with constant.
Proof.
From the strong-mixing condition, we have
[TABLE]
Thus, if there exists , we have , and so is a strictly positive function.
We also have that function would have to satisfy , for all and , since, representing by and applying (32), it holds
[TABLE]
and thus we would have
[TABLE]
On the other hand, would have to be a non increasing function because if
[TABLE]
and is increasing in , then from some order,
[TABLE]
and thus
[TABLE]
and . If is a strictly positive function, non increasing and such that , then , with a non negative constant. Since , it also comes . For the lower limit, we can make the same reasoning to obtain the result. ∎
Let us start by emphasizing that, to , we have the contribution of the clustering of the joint exceedances of all levels by the respective marginals, including the particular case of the clustering of exceedances of the largest level by the lower marginal, as well as, the clustering of exceedances of one or more levels by the respective marginals without joint exceedances of all levels.
Proposition 2.2**.**
Let be a stationary sequence satisfying the strong-mixing condition and a sequence of normalized real levels for which there exists . Then
[TABLE]
where
[TABLE]
and
[TABLE]
Proof.
We have
[TABLE]
In what concerns the last term, observe that
[TABLE]
and since , we have the result. ∎
Observe that reduces from the joint exceedances of accounted for . We can say that in the last term of representation of we are accounting the clustering propensity concerning one or more marginals, without joint exceedances of all the marginals.
We illustrate the previous result with a bivariate sequence with unit Fréchet marginals and such that the joint tail is regularly varying at with index measuring a penultimate tail dependence, as the (sub)model presented in Ledford and Tawn ([10, 11] 1996,1997).
Example 2.1**.**
Suppose that and is a strong-mixing stationary sequence, with unit Fréchet marginals and such that
[TABLE]
as , where and is a slowly varying function, i.e., , . Then
[TABLE]
[TABLE]
Therefore, regardless of additional conditions on the serial dependence, the validity of (59) implies
[TABLE]
and . Since we can thus write in this model
[TABLE]
We now consider several particular situations.
(a) In the case of independent vectors , , we have
[TABLE]
*It will then come , that is, and are also asymptotically independent.
(b) Suppose that , satisfies condition defined by
[TABLE]
which extends of Davis ([2] 1982), satisfied by i.i.d. sequences. Then
[TABLE]
*where the last part reflects the cross dependence.
*(c) If we assume an analogous hypothesis of (59) for with different , we will also obtain asymptotic independence between and , since the last term has null limit. We have .
(d) If , , then and, from (60),
[TABLE]
which implies that this limit is null and thus .
We present below a relation between and the extremal indexes and , , which discriminates different informations contained in function and provides an upper bound for better than the one in (28). In Example 2.2 we show that the proposed upper bound for the M4 processes, can be better than the one presented in Ehlert and Schlather ([3] 2008). The new upper bound has also the advantage of depending only on constant extremal indexes which can be estimated by known methods of literature.
Proposition 2.3**.**
Let be a stationary sequence satisfying the strong-mixing condition and a sequence of normalized real levels for which there exists . Then
- (a)
[TABLE]
where and , provided that the limiting constants exist. 2. (b)
Proof.
We have
[TABLE]
Regarding the second term, we can also say that
[TABLE]
Therefore,
[TABLE]
since , as . ∎
The above result means that, for each , the values only contribute to if it is not asymptotically almost surely the local occurrence of some joint exceedances of the largest level , , among the joint exceedances of these levels. Otherwise, the joint exceedances clustering is considered only through the clustering of the joint exceedances of the largest level , , and measured by , disappearing the third term. Therefore, the second and third terms together account for the clustering of two situations of joint exceedances. The fourth term measures the clustering of exceedances of and of one or more , , in the absence of joint exceedances of levels , , not accounted within the second and third terms. All these clustering situations were accounted by excess in the first term.
The function is homogeneous of order zero and thus , . The constant has been used as a dependence coefficient of the marginals of (see, e.g., Martins and Ferreira [13] 2005, Ehlert and Schlather [3] 2008, Ferreira and Ferreira [5] 2015, and references therein).
We are going to analyze the consequences of the decompositions presented for in the calculation of .
If , then , , and .
The first decomposition
[TABLE]
separates once again the contribution of the clustering of exceedances across all marginals from the contribution of the clustering of exceedances of one or more marginals without exceedances of all marginals.
In the next section, we will give an important utility to the boundary of . It will serve to delimitate the difference between the tail dependence coefficients of and .
The second decomposition allow us to obtain an upper bound for , which can be better than the one presented in (28). From the previous result, we have
[TABLE]
From the proof of Proposition 2.3 we found that, instead of following the order to decompose initially the event in a reunion of disjoint events , and , we can consider any other permutation from and repeat the process. Therefore the previous upper limit can be improved in the following sense:
[TABLE]
where denotes the set of all permutations of .
Example 2.2**.**
Consider the M4 process,
[TABLE]
with , where , , is an array of independent unit Fréchet random variables. We have , and . Since is -independent, representing by and , we have that
[TABLE]
Therefore, Proposition 2.3 indicates that . The upper limit in this type of processes has no great interest since we have the theoretical expression for . However, this example serves to show that our upper bound can be better than the one presented in Ehlert and Schlather ([3] 2008) for M4 processes. Indeed, by applying their Corollary 3, we obtain
[TABLE]
In the cases where the number of signatures of an M4 process exceeds the number of marginals, examples are easily constructed in which the Ehlert and Schlather upper limit is reduced to , being in these cases the lower limit of (103) below this. Our upper bound still has the advantage of being applied to processes outside the max-stable class.
3 Effect of the extremal index in the tail of a bivariate extreme values distribution
For each pair , belonging to , consider the bivariate (upper) tail dependence coefficient for random pair with df , discussed in Sibuya ([17] 1960) and Joe ([8] 1997), defined by
[TABLE]
and coefficient of Coles et al. ([1] 1999), defined by
[TABLE]
We can say that corresponds to the probability of one variable being high given that the other is high too. The case means asymptotic dependence between and and whenever the variables are said to be asymptotically independent. Assuming within asymptotically independent data may carry to an over-estimation of probabilities of extreme joint events (see, e.g., Ledford and Tawn [10, 11] 1996, 1997). Asymptotically independent models, i.e., having , may exhibit a residual tail dependence rendering different degrees of dependence at finite levels. Coefficient is a suitable tail measure within this class. Thus the pair is a useful tool in characterizing the extremal dependence: under asymptotic dependence we have and quantifies the strength of dependence between the variables and, within the class of asymptotic independence, we have and measures the strength of dependence of the random pair.
Observe that, both measures can be calculated from the copula , with
[TABLE]
and
[TABLE]
If belongs to the domain of attraction of , then and . This results from the uniform convergence of to and from . We will then have
[TABLE]
which guarantees the constancy of and , as .
The presence of dependence among the variables of expressed by a function with values less than one, may affect the limiting behavior of but not the limiting behavior of , where denotes de df of .
Proposition 3.1**.**
For stationary sequences , with multivariate extremal index , , for any choice in , we have, .
Proof.
Based on the spectral representation of MEV copulas and relation
[TABLE]
we have
[TABLE]
where is the spectral measure of . On the other hand
[TABLE]
Therefore,
[TABLE]
with
[TABLE]
with and . ∎
Proposition 3.2**.**
For stationary sequences , with multivariate extremal index , , we have, for any choice in ,
- (a)
; 2. (b)
.
Proof.
Using the spectral representation of MEV copulas and relation (119), we have
[TABLE]
where is the spectral measure of . ∎
The previous proposition can be rewritten in terms of the extremal coefficients and , such that, and , since these satisfy the relations and . From (a) we conclude that . Consequently, for the measure of asymptotic independence called madogram (Naveau et al. [15] 2009), defined by
[TABLE]
and satisfying
[TABLE]
we have
- (a)
; 2. (b)
Therefore, for large , the madogram of can not be taken by the madogram of .
From relation (b) in Proposition 2.3, we conclude that
[TABLE]
and we can establish the following consequence about the value of the difference between and .
Corollary 3.3**.**
For stationary sequences satisfying the strong-mixing condition, with multivariate extremal index , , we have, for any choice in ,
- (a)
, implies ; 2. (b)
.
Proof.
- (a)
If is constant equal to , then and, since is homogeneous of order 1, from (b) of Proposition 3.2, we have ; 2. (b)
The inequality follows from (b) of Proposition 3.2 and from (140).
∎
We emphasize that the quantity that we find in (140) and in (b) of the previous proposition reflects a clustering propensity of through the extremal index and
[TABLE]
From this discussion we conclude that:
- (i)
The tail dependencies of and of , for large , evaluated through coefficient , can be considered equal when the multivariate extremal index is constant, otherwise they differ in at least , where the previous quantities can be estimated from the existing methods in literature. 2. (ii)
If we estimate the dependence on the tail of , we do not obtain the dependence on the tail of , unless we correct the result with an estimate of .
In cases where has totally dependent marginals () or has independent marginals (), the previous lower limit loses interest by triviality. We underline the expression of in these two cases in the next result, which is derived from (a) of Proposition 3.2.
Corollary 3.4**.**
For stationary sequences , with multivariate extremal index , , we have, for any choice in ,
- (a)
If has independent marginals, then ; 2. (b)
If has totally dependent marginals, then .
Now we construct some examples that illustrate the cases and .
Example 3.1**.**
We first consider the following bivariate M4 process with one moving pattern,
[TABLE]
where , . We have in this case
[TABLE]
and
[TABLE]
Otherwise
[TABLE]
Therefore, and .
Example 3.2**.**
Now consider a modification in the above example through the introduction of one more pattern,
[TABLE]
We have the same and as in the previous example, but here
[TABLE]
and therefore,
[TABLE]
Then .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Coles, S., Heffernan, J., Tawn, J. (1999). Dependence Measures for Extreme Value Analyses. Extremes 2(4), 339-365
- 2[2] Davis, R.A. (1982). Limit laws for the maximum and minimum of stationary sequences. Z. Wahrsch. verw. Gebiete. 61, 31-42
- 3[3] Ehlert, A., Schlather, M. (2008). Capturing the multivariate extremal index: Bounds and interconnections. Extremes 11(4), 353-377
- 4[4] Ferreira, H., Pereira, L., Martins, A.P. (2016). Clustering of high values in random fields. Accepted for publication in Extremes
- 5[5] Ferreira, H., Ferreira, M. (2015). Extremes of scale mixtures of multivariate time series. Journal of Multivariate Analysis 137, 82-99
- 6[6] Ferreira, H., Ferreira, M. (2016). Estimating the extremal index through local dependence. Accepted for publication in Annales de l’Institut Henri Poincaré
- 7[7] Gomes, M.I., Hall, A., Miranda, M.C. (2008). Subsampling techniques and the Jackknife methodology in the estimation of the extremal index. Computational Statistics & Data Analysis 52(4), 2022-2041
- 8[8] Joe, H. (1997). Multivariate Models and Dependence Concepts. Chapman and Hall, London.
