Tail dependence and smoothness
Helena Ferreira, Marta Ferreira

TL;DR
This paper introduces a smoothness coefficient to assess oscillations in stochastic processes, linking it to tail dependence, and proposes a new estimator for tail dependence with evaluation through simulations.
Contribution
It proposes a novel smoothness coefficient for processes, providing an intuitive measure of oscillations, and establishes its connection to tail dependence, leading to a new estimator.
Findings
The smoothness coefficient coincides with tail dependence in stationary processes.
A new estimator for tail dependence is developed and tested.
Simulation results demonstrate the estimator's performance.
Abstract
The risk of catastrophes is related to the possibility of occurring extreme values. Several statistical methodologies have been developed in order to evaluate the propensity of a process for the occurrence of high values and the permanence of these in time. The extremal index (Leadbetter 1983) allows to infer the tendency for clustering of high values, but does not allow to evaluate the greater or less amount of oscillations in a cluster. The estimation of entails the validation of local dependence conditions regulating the distance between high levels oscillations of the process, which is difficult to implement in practice. In this work, we propose a smoothness coefficient to evaluate the degree of smoothness/oscillation in the trajectory of a process, with an intuitive reading and simple estimation. Application in some examples will be provided. We will see that, in…
| abias | rmse | abias | rmse | abias | rmse | |||
|---|---|---|---|---|---|---|---|---|
| MAR(1) | c=0.25 | 0.0559 | 0.0723 | 0.0579 | 0.0745 | 0.0566 | 0.0724 | |
| c=0.50 | 0.0556 | 0.0695 | 0.0557 | 0.0680 | 0.0561 | 0.0700 | ||
| c=0.75 | 0.0457 | 0.0550 | 0.0489 | 0.0594 | 0.0456 | 0.0551 | ||
| MMA(1) | c=0.25 | 0.0163 | 0.022 | 0.0198 | 0.0257 | 0.0277 | 0.0354 | |
| c=0.50 | 0.0453 | 0.0581 | 0.0430 | 0.0533 | 0.0461 | 0.0587 | ||
| c=0.75 | 0.0439 | 0.0523 | 0.0348 | 0.044 | 0.0440 | 0.0527 | ||
| YARP(1) | p=0.25 | 0.0520 | 0.0678 | 0.0531 | 0.0695 | 0.0524 | 0.0678 | |
| p=0.50 | 0.0576 | 0.0695 | 0.0503 | 0.0623 | 0.0577 | 0.0699 | ||
| p=0.75 | 0.0469 | 0.0604 | 0.0485 | 0.0633 | 0.0471 | 0.0604 | ||
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFinancial Risk and Volatility Modeling · Market Dynamics and Volatility · Statistical Methods and Inference
Tail dependence and smoothness
Helena Ferreira
Universidade da Beira Interior, Centro de Matemática e Aplicações (CMA-UBI), Avenida Marquês d’Avila e Bolama, 6200-001 Covilhã, Portugal
Marta Ferreira
Center of Mathematics of Minho University
Center for Computational and Stochastic Mathematics of University of Lisbon
Center of Statistics and Applications of University of Lisbon, Portugal
Abstract
The risk of catastrophes is related to the possibility of occurring extreme values. Several statistical methodologies have been developed in order to evaluate the propensity of a process for the occurrence of high values and the permanence of these in time. The extremal index (Leadbetter [16] 1983) allows to infer the tendency for clustering of high values, but does not allow to evaluate the greater or less amount of oscillations in a cluster. The estimation of entails the validation of local dependence conditions regulating the distance between high levels oscillations of the process, which is difficult to implement in practice. In this work, we propose a smoothness coefficient to evaluate the degree of smoothness/oscillation in the trajectory of a process, with an intuitive reading and simple estimation. Application in some examples will be provided. We will see that, in a stationary process, it coincides with the tail dependence coefficient (Sibuya [19] 1960, Joe [15] 1997), providing a new interpretation of the latter. This relationship will inspire a new estimator for and its performance will be evaluated based on a simulation study.
keywords: extreme values, smoothness coefficient, tail dependence coefficient
AMS 2000 Subject Classification: 60G70
1 Introduction
The occurrence of high values in a stochastic process can mean a natural, social or economic catastrophe, which has motivated the development of statistical models and techniques for extremes of random variables (see, e.g., Gomes and Guillou, [12] 2014 and their references). The unpredictability we would like to dominate is based on the propensity of the process for high values and the mean time permanency of these, usually measured by the arithmetic inverse of the extremal index (Leadbetter [16] 1983; Hsing et al. [14] 1988). Clustering of high values can be predicted in models that verify local dependency conditions D, which regulate the distance between oscillations of the process relative to high levels (Chernick et al. [2] 1991). Under the validity of such conditions we can obtain expressions for the mean size of a cluster of high values. Not only the validation of local dependence conditions is difficult in practice, but also the estimation of does not give us information about the greater or less amount of oscillations in a cluster. In this work, we propose a measure to distinguish between processes with more oscillating trajectories from processes with smoother ones, which has an intuitive reading and is easy to estimate. The smoothness coefficient of a block of variables that we propose takes values in and grows with the degree of concordance of the variables. We will applied it in theoretical examples. We will also verify that, in a stationary process, it coincides with the tail dependence coefficient (Sibuya [19] 1960, Joe [15] 1997), which gives us a new reading for this well-known coefficient in the literature of extremes. The new representation for the tail dependence coefficient inspires an estimation procedure that will be analysed through a simulation study.
This paper is organized as follows: in Section 2 we introduce the smoothness coefficient and present some properties and examples. In Section 3 we consider a new estimator for and analyse its performance through simulation.
2 The smoothness coefficient
Consider a sequence of real random variables (r.v.’s) and denote the distribution function (d.f.) of , . A natural way to evaluate the propensity for oscillations within a process is to compare the expected number of oscillations in instant ,
[TABLE]
relative to real high levels , with the expected number of exceedances of ,
[TABLE]
around the instant . Existing, at least, one exceedance between instants and (), the expected total of oscillations will be closer of the expected total of exceedances, for , in processes with more oscillating trajectories. We then propose as a summary measure of the result of this comparison between exceedances and oscillations, a coefficient with values in , which increases with the concordance of the variables.
Definition 2.1**.**
The smoothness coefficient of is defined by
[TABLE]
where , provided the limit exists.
The proposed smoothness coefficient can naturally be expressed as a function of tail dependence coefficients
[TABLE]
These summarize the behavior of the bivariate tails of a sequence and have been extensively studied and applied in the literature of extremes (see, e.g., Schmidt and Stadtmüller [18] 2006, Li [17] 2009, Ferreira and Ferreira [7] 2014, and references therein).
Proposition 2.1**.**
The smoothness coefficient of satisfies
[TABLE]
provided exists for all and .
Proof.
Observe that
[TABLE]
∎
This result points to the reading of , , in a stationary process, as the smoothness coefficient for any block of variables .
Corollary 2.2**.**
If is a stationary sequence with tail dependence coefficient , then
[TABLE]
Tail dependence increases with the concordance of the variables (Li, [17] 2009). We can therefore deduce the following properties from (7).
Proposition 2.3**.**
Let process have smoothness coefficient . Then
- (i)
;
- (ii)
If are more concordant than , then .
Proof.
Assertion (i) results from the coefficient definition and for (ii), observe that, if are more concordant than , then , . ∎
In the bounds of the concordance relation, we have the independent and totally dependent variables. If all random pairs , , , are independent we have , whereas if they are totally dependent then .
In the context of max-stable processes, the independence or total bivariate dependence of the variables in is equivalent to the independence or total dependence of all variables. Thus, if is max-stable, then , we will have if and only if are independent and if and only are dependent. For the context of max-stability, we also have the possibility of relating with the extreme coefficients (Tiago de Oliveira [21] 1962/1963, Smith [20] 1990), which allows the estimation of the coefficients , by estimating expected values (Ferreira, [10] 2013).
Example 2.1**.**
Consider the -factor model (Einmahl et al. [4], 2012)
[TABLE]
where factors , , are independent and Fréchet() distributed r.v.’s, , and are non-negative constants such that . Variables in are not identically distributed since each one of the factors contribute to the value of with weights updated over time . Specifically we have
[TABLE]
We have
[TABLE]
Observe that
[TABLE]
Thus, for the dependence on the tail of and , we have
[TABLE]
Denoting , , , we have
[TABLE]
In the particular case of , , we have a constant sequence and , . Thus we obtain and . If , then is a sequence of totally dependent variables and we have and . Under the special case of equally weighted factors, that is, , , , and , we have
[TABLE]
and therefore
[TABLE]
Example 2.2**.**
(Temporary Failures Model or "Stopped Clock") Let be a sequence of independent and identically distributed (i.i.d.) variables and independent of the sequence of Bernoulli variables . Consider notations , , and
[TABLE]
. We denominate by temporary failures model, a sequence defined as follows:
[TABLE]
Such designation relies on the interpretation of as a sequence of states corresponding to the registration or non-registration of values of . Thus, if, for example, , we will have, almost surely, . Zero sequences at the values of determine replicates of the last recorded value of . If is the time, the zeros of mean a stop of the register in time, keeping the last record. Let us consider a short-failures model to illustrate the smoothness coefficient calculation. In the short-failures model, we assume that , i.e., it is almost impossible to lose two or more consecutive records of . We start by deriving the common d.f. of :
[TABLE]
Suppose, without loss of generality, that , . For and , we have
[TABLE]
Therefore,
[TABLE]
and we obtain the smoothness coefficient given by
[TABLE]
We can see that increases with the tendency to stop in the initial sequence records, as expected. With some more time-consuming calculations, we can extend the result to models with longer lasting failures. We note that in this short-failures model, the estimation of allows us to estimate . The estimation of can be done from the natural estimation of , since, in general, and are unobservable sequences.
3 A new estimator for under stationarity
The usual linear Pearson’s correlation coefficient does not give us enough insight about the amount of dependence in the tails (Embrechts et al. [5] 2002). Extreme values theory is the natural framework to address this topic. The tail dependence coefficient is perhaps the most common measure of extremal dependency. Many other coefficients have been presented in the literature, most of them related to (see, e.g., Schmidt and Stadtmüller [18] 2006, Li [17] 2009, Ferreira and Ferreira [8] 2018, and references therein). The smoothness coefficient introduced here is another measure of tail dependence and from Corollary 2.2 it coincides with under stationarity. Inference based on the definition in (1) is quite straightforward by taking the respective empirical counterparts. Thus, we can state a new estimator for based on , which we denote . More precisely, considering a stationary sequence with marginal d.f. and and , respectively the number of upcrossings and the number of exceedances of a high level of , we have
[TABLE]
In the following we address a simulation study in order to analyse the performance of in (52). We also consider two estimators of well-known and commonly used in literature, motivated by the second equality in (19):
[TABLE]
where
[TABLE]
and corresponds to the empirical d.f. of . See Frahm et al. ([11] 2005) and references therein.
The simulations correspond to replicas of samples with size from the following models for :
- •
First-order max-autoregressive (Davis and Resnick [3] 1989) denoted MAR(1):
, , with a sequence of i.i.d. r.v.’s with unit Fréchet d.f., as well as and thus , . We have (see, e.g., Ferreira and Ferreira [6] 2012);
- •
First order moving-maximum (Davis and Resnick [3] 1989) denoted MMA(1):
, , with a sequence of i.i.d. r.v.’s with unit Fréchet d.f., as well as and thus , . We have (see, e.g., Heffernan et al. [13] 2007);
- •
First order autoregressive Yeh-Arnold-Robertson Pareto(III) (Arnold [1] 2001), denoted YARP(1):
, where , is a sequence of i.i.d. r.v.’s coming from a Pareto(III)(0,,), i.e., , and sequence of i.i.d. r.v.’s coming from Bernoulli, , independent of , . We consider , Pareto(III)(0,,) and thus , . We have (Ferreira [9] 2012).
The absolute bias (abias) and the root mean squared error (rmse) derived from simulations are in Table 1, where we considered the high level given by the 95% sample quantile. Quantiles 90% and 99% were also used but do not improve the results and are not reported. The values in bold correspond to the least absolute bias and the least root mean squared error obtained in each model. We can see that the three estimators have very similar performances. The estimator proposed here, being of very simple application, thus constitutes a possible alternative.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Arnold, B.C. (2001). Pareto Processes. In: Handbook of Statistics (D.N. Shanbhag and C.R. Rao, eds.) , Vol. 19, Elsevier Science B.V.
- 2[2] Chernick M.R., Hsing T., Mc Cormick W.P. (1991). Calculating the extremal index for a class of stationary sequences. Advances in Applied Probability 23, 835–850.
- 3[3] Davis, R., Resnick, S. Basic properties and prediction of max-ARMA processes. Adv. Appl. Probab. 21, 781–803, 1989.
- 4[4] Einmahl, J. H. J., Krajina, A., Segers, J. (2012). An M-estimator for tail dependence in arbitrary dimensions. The Annals of Statistics, 40(3), 1764–1793.
- 5[5] Embrechts, P., Mc Neil, A., Straumann, D. (2002). Correlation and dependence in risk management: properties and pitfalls. In "Risk Management: Value at Risk and Beyond" (M.A.H. Dempster, Eds.), Cambridge University Press, Cambridge, 176–223.
- 6[6] Ferreira, M., Ferreira, H. (2012). On extremal dependence: some contributions. TEST 21(3), 566–583.
- 7[7] Ferreira, H., Ferreira, M.Extremal behavior of p MAX processes. Statistics and Probability Letters , 93, 46–57, 2014.
- 8[8] Ferreira, H., Ferreira, M. Multidimensional extremal dependence coefficients. Statistics & Probability Letters 133, 1–8, 2018.
