Tail models and the statistical limit of accuracy in risk assessment

Ingo Hoffmann; Christoph J. B\"orner

arXiv:1904.12113·q-fin.RM·July 15, 2020

Tail models and the statistical limit of accuracy in risk assessment

Ingo Hoffmann, Christoph J. B\"orner

PDF

TL;DR

This paper investigates the statistical limits of accurately estimating high quantiles in risk management, focusing on tail models like the generalized Pareto distribution and analyzing the bias and variance of quantile estimators in finite samples.

Contribution

It derives the finite sample distribution, bias, and variance of tail quantile estimators, providing insights into their accuracy limits in risk assessment scenarios.

Findings

01

Finite sample distribution of quantile estimators derived

02

Bias and variance of estimators quantified

03

Impact analysis on unknown distribution quantiles conducted

Abstract

In risk management, tail risks are of crucial importance. The assessment of risks should be carried out in accordance with the regulatory authority's requirement at high quantiles. In general, the underlying distribution function is unknown, the database is sparse, and therefore special tail models are used. Very often, the generalized Pareto distribution is employed as a basic model, and its parameters are determined with data from the tail area. With the determined tail model, statisticians then calculate the required high quantiles. In this context, we consider the possible accuracy of the calculation of the quantiles and determine the finite sample distribution function of the quantile estimator, depending on the confidence level and the parameters of the tail model, and then calculate the finite sample bias and the finite sample variance of the quantile estimator. Finally, we…

Equations40

F (x)

F (x)

f (x)

f (x)

q_{α}

q_{α}

\hat{Q}_{α}

\hat{Q}_{α}

\overset{q}{^}_{α}

\overset{q}{^}_{α}

(1 - α)^{- \hat{ξ}}

(1 - α)^{- \hat{ξ}}

\hat{Q}_{α}

\hat{Q}_{α}

f_{q} (z; n, α, σ, ξ) =

f_{q} (z; n, α, σ, ξ) =

\frac{1}{2 π} \frac{n}{σ} \frac{1}{1 + 4 ξ + 5 ξ ^{2} + 2 ξ ^{3}} \int_{- \infty}^{+ \infty} d u ψ (u) \times

\displaystyle\exp\left\{-\frac{n}{1+2\xi}\left[\frac{\big{(}u-\xi\big{)}^{2}}{1+\xi}+\frac{\big{(}u-\xi\big{)}\big{(}z\psi(u)-\sigma\big{)}}{(1+\xi)\sigma}+\frac{\big{(}z\psi(u)-\sigma\big{)}^{2}}{2\sigma^{2}}\right]\right\}

B (n, α, σ, ξ) = E [\overset{q}{^}_{α} - q_{α}] .

B (n, α, σ, ξ) = E [\overset{q}{^}_{α} - q_{α}] .

B (n, ξ)

B (n, ξ)

a_{1} = - 1.00733 a_{2} = + 3.49572 a_{3} = + 1.49397

a_{1} = - 1.00733 a_{2} = + 3.49572 a_{3} = + 1.49397

B (n, ξ)

B (n, ξ)

M^{- 1}

M^{- 1}

μ

μ

\displaystyle\hat{q}_{\alpha}=\frac{\hat{\sigma}}{\hat{\xi}}\Big{[}\big{(}1-\alpha\big{)}^{-\hat{\xi}}-1\Big{]}.

\displaystyle\hat{q}_{\alpha}=\frac{\hat{\sigma}}{\hat{\xi}}\Big{[}\big{(}1-\alpha\big{)}^{-\hat{\xi}}-1\Big{]}.

Φ (\overset{q}{^}_{α})

Φ (\overset{q}{^}_{α})

Φ (\overset{q}{^}_{α})

Φ (\overset{q}{^}_{α})

f_{Y} (u, v) = δ (u - ξ) δ (v - σ) for n \to \infty,

f_{Y} (u, v) = δ (u - ξ) δ (v - σ) for n \to \infty,

f_{q} (z)

f_{q} (z)

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Tail models and the statistical limit of accuracy

in risk assessment

Ingo Hoffmann

[email protected]

Christoph J. Börner

[email protected]

Financial Services, Faculty of Business Administration and Economics,

Heinrich Heine University Düsseldorf, 40225 Düsseldorf, Germany

Abstract

In risk management, tail risks are of crucial importance. The assessment of risks should be carried out in accordance with the regulatory authority’s requirement at high quantiles. In general, the underlying distribution function is unknown, the database is sparse, and therefore special tail models are used. Very often, the generalized Pareto distribution is employed as a basic model, and its parameters are determined with data from the tail area. With the determined tail model, statisticians then calculate the required high quantiles. In this context, we consider the possible accuracy of the calculation of the quantiles and determine the finite sample distribution function of the quantile estimator, depending on the confidence level and the parameters of the tail model, and then calculate the finite sample bias and the finite sample variance of the quantile estimator. Finally, we present an impact analysis on the quantiles of an unknown distribution function.

keywords:

Exceedances , Extreme Value Theory , Generalized Pareto Distribution , Quantile Estimation , Risk Assessment , Tail Models.

JEL Classification: C13 , C16 , C46 , C51.

1 Introduction

In many disciplines, there is often a need to adapt a statistical model to existing data to be able to make statements regarding uncertain future outcomes. In particular, when assessing risks, an estimate of major losses must be based on events that, despite having a low probability of occurrence, have a high impact. Since the actual distribution of data – the parent distribution – is generally unknown, statisticians begin their modeling with a guess regarding the underlying statistical model. In a first step, they try to fit one or more parametric distribution functions as a model to the data to evaluate the rare events in the next step. These models generally do not perfectly reflect the data. However, specific statistical tests can be applied to assess how well or how poorly a model fits the data as a whole. Nevertheless, especially in the case of rare events and high damage, small uncertainties in the assumption of a model lead to a faulty description of these extremes and will call into question the value of the information obtained by this approach. Therefore, any uncertainties regarding the underlying model and the resulting misjudgments must be assumed to negatively affect the quality of the models for rare events. In particular, model uncertainties pose problems in the finance and insurance industries, especially when rare events need to be evaluated, for example, by calculating high quantiles.

To make more precise statements regarding rare events and their severity, statisticians can describe the tail of the parent distribution function using a separate model and calculate the corresponding quantiles more precisely to improve the quality of the results. In the case of financial institutions, the respective regulatory frameworks provide statisticians and risk managers with the confidence levels of the parent distribution quantiles (Basel Commitee,, 2004; Directive,, 2009, 2013; Regulation,, 2013). Depending on the purpose, the confidence level is frequently given as 99.9%; however, the available data often do not cover this area at all. The calculation of the capital that is regulatorily required to take a risk is based on the value-at-risk (VaR) or conditional value-at-risk (CVaR), which are calculated from high quantiles. In addition to these regulatory risk measures, additional measures exist for internal management decisions, such as the return on risk-adjusted capital (RORAC) and the risk-adjusted return on capital (RAROC), which are also calculated from high quantiles of the parent distribution and are important for the risk assessment of a company. Given this framework, standalone modeling of the tails of the underlying distribution is suggested for a more accurate calculation of the risk values.

For a very large class of parent distribution functions, the generalized Pareto distribution (GPD) can be used as a model for the tail, cf., e.g., Embrechts et al., (2003) and the large number of references on this topic contained therein. This class of distributions includes all common parent distributions that play a role in the financial sector such that almost no uncertainty exists regarding the model selection for the tail of the unknown parent distribution. The required quantiles can then be determined to high confidence levels with a sufficient certainty.

In practice, when modeling risks, further constraints need to be considered (e.g., a small database). Moreover, the question arises: which accuracy can be achieved at all in the risk assessment? Specifically, if in the ideal case only the statistical errors remain, what accuracy can be expected within the risk assessment? This question, which is central to financial companies and audit authorities, will be addressed in this study.

The question of the accuracy of a tail model is discussed in detail in the literature regarding the parameter estimators of the GPD (Smith,, 1984; Choulakian and Stephens,, 2001; Embrechts et al.,, 2003). The focus is often on the statistical properties of the estimators and less on the statistical properties of the quantiles. Hosking and Wallis, (1987) have compared different methods of parameter estimation for the GPD. Thereby, on the basis of Monte Carlo simulations and asymptotic functions for the variance of the quantile estimator, they could already make statements about the properties of the quantile estimator of the GPD. In the field of climatology, it is known that quantile estimators are biased, but it seems that more detailed causal research has not been carried out. The research in the field of climatology focuses primarily on practical methods to reduce the bias (Hoffmann et al.,, 2018; Fang et al.,, 2015). In this study, we go beyond the state of the art and aim to examine the statistical properties of the quantiles and consider the reason for the bias of the quantiles.

After a brief description of risk modeling and quantile estimation (Sec. 2), we derive a finite sample distribution of the quantile estimator of the GPD (Sec. 3) and further explore the distribution in the context of limited datasets (Sec. 4). As a result, we find that the quantile estimator is positively biased for a finite number of data and grows larger with smaller amounts of data following a power law. The same behavior is found for the variance of the quantile estimator. The results also indicate and quantify that these inaccuracies increase for risks whose underlying unknown distribution function has a fat tail. Of particular interest could be the positive bias: quantiles and thus risks are systematically overestimated for small databases. For practical applications, we provide a correction formula to mitigate the impact of this overestimation (Sec. 5). Overall, these inaccuracies are expected to affect all derived risk measures, based on the quantile estimator. The results of this study therefore illustrate the accuracy limit that can be achieved during risk assessment in practice, especially if the underlying parent distribution is unknown, the database is small, and the regulatory framework requires risk assessment with high quantiles.

Note that in the following, peculiarities of censorized or winzorized data or contaminations of these due to superimposed distribution functions are not taken into account. Instead, in our further considerations, we exclude censoring and contamination of the data and assume that the dataset consists of independently and identically distributed realizations. Therefore, in the following, we focus clearly on financial data with these properties. In this way, we consider the best of all imaginable cases and further examine the achievable accuracy for finite data series.

2 Risk assessment at high quantiles

Especially when high quantiles are considered in risk assessment in practice, a separate modeling of the tail of the generally unknown parent distribution often takes place. Predominantly, the GPD is used as a tail model in practice (Basel Commitee,, 2009). In the following, the main features of the GPD are briefly considered, as well as their use in tail modeling. The estimation of high quantiles of an unknown underlying distribution function is attributed to the estimation of high quantiles with the GPD as a model. Furthermore, the databases occurring in practice are described as a framework condition.

2.1 Model of the tail of a distribution

A theorem in extreme value theory, which goes back to Gnedenko, (1943), Balkema and de Haan, (1974) and Pickands, (1975), states that for a broad class of distributions, the distribution of the excesses over a threshold $u$ converges to a GPD if the threshold is sufficiently large.

The GPD is usually expressed as a two-parameter distribution and has the following distribution function:

[TABLE]

where $\sigma$ is a positive scale parameter and $\xi$ is a shape parameter (sometimes called the tail parameter). The density function is

[TABLE]

with support $0\leq x<\infty$ for $\xi\geq 0$ and $0\leq x\leq-\frac{\sigma}{\xi}$ when $\xi<0$ . The quantile function of the GPD depending on the confidence level $\alpha$ is:

[TABLE]

The mean and variance are ${\text{E}}[x]=\frac{\sigma}{1-\xi}$ and ${\text{Var}}[x]=\frac{\sigma^{2}}{(1-\xi)^{2}(1-2\xi)}$ , respectively; thus, the mean and variance of the GPD are positive and finite only for $\xi<1$ and $\xi<0.5$ , respectively. For special values of $\xi$ , the GPD leads to various other distributions. When $\xi=0$ and $-1$ , the GPD becomes an exponential and a uniform distribution, respectively. For $\xi>0$ , the GPD has a long tail to the right and is a reparameterized version of the usual Pareto distribution. Several areas of applied statistics have used the latter range of $\xi$ to model datasets that exhibit this form of a long tail.

Since the GPD was introduced by Pickands, (1975), numerous theoretical advancements and applications have followed (Davison,, 1984; Smith,, 1984, 1985; van Montfort and Witter,, 1985; Hosking and Wallis,, 1987; Davison and Smith,, 1990; Choulakian and Stephens,, 2001). The applications of the GPD include use in the analysis of extreme events in hydrology, as a failure-time distribution in reliability studies and in the modeling of large insurance claims. Numerous examples of applications can be found in Embrechts et al., (2003) and the studies listed therein. The GPD is also increasingly used in the financial and banking sectors. Especially in the assessment of risks based on high quantiles, the GPD is one of the proposed distributions for modeling the tail of an unknown parent distribution (Basel Commitee,, 2009).

The preferred method in the literature for estimating the parameters of the GPD is the well-studied maximum likelihood method (Davison,, 1984; Smith,, 1984, 1985; Hosking and Wallis,, 1987). Choulakian and Stephens, (2001) stated that it is theoretically possible to have datasets for which no solution to the likelihood equations exists, and they concluded that, in practice, this is extremely rare. In many practical applications, the estimated shape parameter $\hat{\xi}$ ranges between -0.5 and 0.5, and a solution to the likelihood equations exists (Hosking and Wallis,, 1987; Choulakian and Stephens,, 2001). For practical and theoretical reasons, these authors limited their attention to this range of values. Hoffmann and Börner, (2018) adapted the GPD as a model for the tail of different parent distributions applicable to finance and banking and also found that the maximum likelihood estimator $\hat{\xi}$ falls within this range. Furthermore, in many applications with real data from the finance sector, it was found that $\hat{\xi}$ was only slightly smaller than zero and was limited upward by $\hat{\xi}<0.5$ (Hoffmann and Börner,, 2018). Thus, the range of interest of the tail index in practice can be given by the interval $0.0\lesssim\xi<0.5$ , indicating that the (unknown) underlying parent distribution functions may have longer upper tails.

2.2 Determination of the model parameters

Let $X_{1},X_{2},\ldots,X_{N}$ be a sample of random variables with common unknown continuous distribution function $F(x)$ and density function $f(x)$ . Let further $x_{(1)}\geq x_{(2)}\geq\ldots\geq x_{(N)}$ be the sample values (in ascending order) obtained by ordering each realization $x_{1},x_{2},\ldots,x_{N}$ of the random variables $X_{1},X_{2},\ldots,X_{N}$ .

A specific value $x_{n+1}$ for $n=1,\ldots,N-1$ determined from the ordered sample then separates the data into two parts. The first part of the data belongs to the body of the unknown parent distribution function, and the second part belongs to the tail. The value $x_{n+1}$ is referred to as the threshold $u$ at which the tail begins. The threshold can be estimated, for example, by the method of Hoffmann and Börner, (2018): $\hat{u}=x_{\hat{n}+1}$ , where $\hat{n}+1$ is an estimate of the index of the data point that marks the threshold.

The parameters $\xi$ and $\sigma$ of the GPD, Eq. (1), as the tail model are then determined via maximum likelihood estimation for the ordered subset of the data: $x_{(1)}\geq x_{(2)}\geq\ldots\geq x_{(\hat{n})}$ . This leads to the finite sample estimates $\hat{\xi}_{\hat{n}}$ and $\hat{\sigma}_{\hat{n}}$ . To simplify the notation, we omit the index $\hat{n}$ below.

2.3 Estimation of high quantiles.

The estimation function for the quantile $Q_{\alpha}$ of the unknown parent distribution with a confidence level $\alpha$ can be noted as follows (Embrechts et al.,, 2003):

[TABLE]

where $\hat{\xi}$ and $\hat{\sigma}$ are the maximum likelihood estimates of the parameters of the GPD. Furthermore, $\hat{u}$ again denotes the estimator of the threshold after a data point $x_{\hat{n}}$ with estimated index $\hat{n}$ . A quantile estimator $\hat{q}_{\alpha}$ is defined by substituting estimators $\hat{\xi}$ and $\hat{\sigma}$ for the parameters in Eq. (3), cf., e.g., Hosking and Wallis, (1987):

[TABLE]

Then, we have

[TABLE]

With this equation, we rewrite Eq. (4), and it follows that a relation between the estimated quantile $\hat{q}_{\alpha}$ of the GPD and the estimated quantile $\hat{Q}_{\alpha}$ of the unknown parent distribution is:

[TABLE]

This show that the accuracy of the quantile estimator $\hat{Q}_{\alpha}$ is influenced by many factors. The estimator $\hat{n}$ and thus $\hat{u}$ can be set very well in a sample with the method described for example by Hoffmann and Börner, (2018). The estimators $\hat{\xi}$ and $\hat{\sigma}$ are maximum likelihood estimators with the properties of being consistent and asymptotically efficient (Embrechts et al.,, 2003), so that they converge in the limit of large samples against the true values. The question is: which statistical properties does $\hat{q}_{\alpha}$ have? This will be examined below. Knowing the statistical properties, it is possible to clarify how the inaccuracies in the quantile $\hat{q}_{\alpha}$ affect the quantile $\hat{Q}_{\alpha}$ .

2.4 Practical considerations – restrictions due to the database.

The regulatory requirements stipulate that risks in the financial sector should be calculated at high confidence levels and thus high quantiles, whereby a certain holding period is assumed (weekly, fortnightly, monthly, quarterly or annually), cf., e.g., Basel Commitee, (2004, 2009); Directive, (2009, 2013). The quantiles are determined on the basis of datasets for each risk category (e.g., assets) individually or on the basis of a bundled risk (e.g., a portfolio). To estimate the amount of usable data, let us consider an example: if the risks of assets are valued within the period since the introduction of the euro, less than $N\approx 5000$ data points per asset are available for the analysis based on daily closing prices. The latter is the ideal case. For an asset launched later, the database will be reduced accordingly. If the GPD is used to determine the high quantiles, the estimation of the parameters of the tail model is based on a much smaller part of the database. Previous reports have already carried out studies on the favorable choice of the amount of data used for tail modeling in the finance and insurance fields (McNeil and Saladin,, 1997; Moscadelli,, 2004; Dutta and Perry,, 2007), revealing that the preferred tail length $n$ for the data series analyzed in those fields comprises approximately 10% to 15% of the total amount of data available. Hence, the database for estimating $q_{\alpha}$ is limited in the majority of cases upwards by $n\approx 750$ . If further restrictions are added – for example, if there are only weekly data available – the database is reduced again. On the other hand, for $n<50$ data points, the influence of statistical errors increases enormously, so that for smaller data bases an evaluation of high quantiles becomes numerically difficult; see also Sec. 4.

In summary, in most cases, the number of data points used in tail modeling practice is somewhere in the interval between $n=50\ldots 1000$ data points. This amount of data is another restriction in tail modeling. This limitation follows from practice and will be considered in the further analysis.

3 Density of the finite sample distribution of the quantile $q_{\alpha}$

Based on the results of Smith, (1987) and Embrechts et al., (2003), we derive by a straightforward calculation a very good approximation of the density for the finite sample distribution of the quantile estimator $\hat{q}_{\alpha}$ . The technical details of the calculation can be found in A; here, we only note the final result:

[TABLE]

with $\psi(u)=\frac{u}{(1-\alpha)^{-u}-1}$ and $n$ being the sample size; $\alpha$ is the confidence level, and $\sigma$ and $\xi$ are the actual parameters of the GPD.

As an example with an arbitrary parameter configuration, Fig. 1 shows a comparison of the theoretical density, Eq. (3), with the corresponding empirical density.

In addition, the graph shows the location of the actual quantile $q_{0.999}$ and the expected value $\textrm{E}[\hat{q}_{0.999}]$ and the standard deviation of the quantile estimator when considering a sample of $n=100$ data points. Note that the expected value is above the actual value. This observation means that the quantile estimator $\hat{q}_{\alpha}$ is positively biased, with $B=2.475$ .

For various parameter configurations, the performed goodness-of-fit tests do not reject the hypothesis that the theoretical density, Eq. (3), describes the empirical density. However, for very small sample lengths $n<50$ and as $\xi$ becomes significantly less than zero, this observation changes. The reason for this is that with this parameter configuration, the assumption of the asymptotic normal distribution (Smith,, 1987), used by us, for the maximum likelihood estimators $\hat{\xi}$ and $\hat{\sigma}$ is clearly not justified; see also A. However, this assumption is used to calculate the density $f_{q}$ and works well for a sample length of $n>50$ and the parameter range $\xi=0.0,\ldots,0.5$ . The latter range covers all applications occurring in the financial sector where the measured data belong to a distribution function with a possibly fat tail. In addition, with this parameter interval, the mappable value range of the data is not limited by the tail model upwards; see also Sec. 2.1. Therefore, in the further investigations, we concentrate on this parameter range. We also assume that data scaling is possible in practice, so in the following sections we focus on the parameter setting $\sigma=1$ . Furthermore, we focus on the area of high quantiles, which is important for regulators and auditors, at a confidence level of $\alpha=0.999$ . However, the following calculation can also be performed with a different set of parameters.

4 Finite sample bias and variance of the quantile estimator

In general, the bias $B$ is calculated as the expected value:

[TABLE]

The actual quantile $q_{\alpha}$ and the estimator of the quantile $\hat{q}_{\alpha}$ depend on the confidence level $\alpha$ , the scale parameter $\sigma$ and the shape parameter $\xi$ . Furthermore, the estimator of the quantile also depends on the sample length $n$ .

With $\sigma=1$ and $\alpha=0.999$ , the expectation value in Eq. (9) is calculated with the density $f_{q}$ of Eq. (3) for a sample length of $n=50,\ldots,1000$ and shape parameters $\xi=0.0,\ldots,0.5$ . The finite sample bias $B(n,\xi)=B(n,0.999,1,\xi)$ calculated in this case is shown graphically in a log-log plot in Fig. 2.

As can be seen, the bias decreases with increasing sample length. This indicates that the quantile estimator converges to the actual quantile and has the statistical property of asymptotic consistency. We will examine this further in B. There is also a clear dependence on the shape parameter. The larger the parameter $\xi$ , the greater the bias is. This observation means that in practice, the fatter the tail, the greater the expected deviation of the quantile estimator is from the unknown actual quantile.

In the same way, with the density $f_{q}$ , the variance of the quantile estimator can also be calculated.

The log-log plot in Fig. 3 shows the dependence of the variance on the modifiable parameters $n$ and $\xi$ . It can be seen that the variance also converges to zero as the amount of data increases. In addition, as with the bias, it can be seen that there is a clear dependence on the shape parameter. The larger the parameter $\xi$ , the greater the variance is. This observation means that in practice, the fatter the tail, the greater the variance of the quantile estimator is.

The results obtained here coincide with the observations of Hosking and Wallis, (1987). For certain parameter configurations, our results can be compared with their results; Table 4 (bias) and Table 5 (variance) in Hosking and Wallis, (1987). Although the methods are very different, numerically the results show a good agreement.

In practice, therefore, the estimate of the quantile should be subject to great uncertainties, especially for small datasets. In addition, the uncertainty increases when the whole dataset is based on an unknown distribution function that has a fat tail. The bias and the variance, which can be determined with the density $f_{q}$ , are likely to influence every modeling approach and thus represent a limit of the modeling accuracy. If risk models are adjusted as best as possible, then this accuracy – described by the calculated bias and the calculated variance – can be achieved. The calculated values – bias and variance – can be used as a benchmark for the evaluation of a specific modeling approach that aims to assess the risk at high quantiles.

5 Correction of the bias of the quantile estimator

In Sec. 4, it was found that the quantile estimator $\hat{q}$ is systematically above the true value $q$ and thus has a certain positive bias $B$ . To improve the risk assessment in practice, a method that corrects the bias of the quantile estimator would be useful. The conception of such a procedure is derived in this section from the previous results. The correction method provides a way to mitigate in practice the influence of the bias.

5.1 Quantile bias correction – state of the art

In the field of climatology, the necessity of bias corrections, especially corrections of the quantile estimator, has been discussed for a long time. Therefore, there are already a great number of methods in climatology to correct the bias of different estimators such as the estimator of a certain quantile, cf., e.g., Schmidli et al., (2006); Sun et al., (2011); Themeßl et al., (2011); Fang et al., (2015); Jeon et al., (2016); Hoffmann et al., (2018) and the related overwhelming references cited therein. Additionally, in the financial industry, with the tightening of regulatory requirements, such topics could become even more present. The question is whether the simple transfer of existing methods of climatology into the financial sector is fruitful or whether another approach is appropriate.

Recently, two methods (Hoffmann et al.,, 2018) have prevailed in climatology from the original five methods (Fang et al.,, 2015) and are often used to correct the bias of the quantile.

Local intensity scaling described in (Schmidli et al.,, 2006; Hoffmann et al.,, 2018). The basic idea is an adjustment of the mean value of a simulated (or measured) finite sample and a reproduction of an ”adequate” mean value, i.e., a comparison value from reference data. From the comparison of the two mean values, a correction factor is calculated, and the individual measured or simulated values are corrected accordingly. Implicitly, the quantile is also corrected. 2. 2.

Analytical quantile mapping described in (Sun et al.,, 2011; Themeßl et al.,, 2011; Hoffmann et al.,, 2018). The basic idea is that for simulated (or measured) data and for reference data, the GPD is adjusted in the tail area as an analytical distribution function. From the deviation of the two tail models, an analytical transfer function is determined. With the transfer function, the individual measured or simulated values are corrected accordingly, and as before, implicitly, the quantile is also corrected.

The methods described have been developed for specific purposes under certain conditions that are not necessarily found in the financial sector. The methods are therefore unlikely to be transferred to problems in the financial sector. For example, there are usually no reference data available in the financial area, and if so, other reference data or other reference periods will provide different corrections of the bias of the quantile estimator. In addition, the ”manipulation” of the measured data should also be viewed very critically by the statistician and the regulatory authorities, because other values determined from the dataset also change or the connection between different statistical quantities is disturbed. The latter problems are similarly discussed in climatology (Hoffmann et al.,, 2018). However, an essential point is not taken into account in the described methods and has not yet been considered to the best of the authors’ knowledge: the finite number of data points itself already causes a bias in the quantile estimator. This bias can be determined analytically (see Sec. 4) and should be taken into account in further risk assessments, especially in finance. This will be considered below. From Eq. (7), it can be seen that the bias of $\hat{q}_{\alpha}$ has an effect on $\hat{Q}_{\alpha}$ . If the bias is determined for $\hat{q}_{\alpha}$ , an impact analysis on the estimator $\hat{Q}_{\alpha}$ can be carried out in practice in individual cases.

5.2 Quantile bias correction – a formula for practice

In Fig. 2, the bias is shown as a function of the sample size $n$ and the pre-given shape parameter $\xi$ . The figure shows only a small number of parallel graphs. In fact, $\xi$ was varied in smaller steps between 0.0 and 0.5. With the calculated data points $B$ as a function of the sample size $n$ and the shape parameter $\xi$ , we performed nonlinear regression. The formula for the basic model can be read from the graphic and has the following form:

[TABLE]

Nonlinear regression yielded the following parameters:

[TABLE]

For reasons of simplification, the parameters are rounded. Then, the following formula for the bias of the quantile estimator can be used for a practical application:

[TABLE]

Hence, an estimated quantile $\hat{q}_{0.999}$ can be shifted by $\tilde{q}_{0.999}=\hat{q}_{0.999}-B(n,\hat{\xi})$ . This should in practice lead to a new estimator $\tilde{q}_{0.999}$ of the quantile closer to the actual, generally unknown value of the quantile ${q}_{0.999}$ .

It should be noted that Eq. (11) applies to a confidence level of $\alpha=0.999$ and a scale parameter of $\sigma=1$ . Other configurations can be similarly calculated using the finite sample density of the quantile estimator Eq. (3). With $B(n,\alpha,\sigma,\xi)=\textrm{E}[\hat{q}_{\alpha}]-q_{\alpha}$ and $\tilde{q}_{\alpha}=\hat{q}_{\alpha}-B(n,\alpha,\sigma,\hat{\xi})$ , it follows that $\textrm{E}[\tilde{q}_{\alpha}]=q_{\alpha}$ .

Finally, an important property can be observed from Eq. (11): As $n\rightarrow\infty$ , the bias becomes 0 – independent of $\xi$ – indicating again that the quantile estimator $\hat{q}_{\alpha}$ is asymptotically consistent and converges to the true value of the quantile ${q}_{\alpha}$ . In fact, it can be shown theoretically in a simple way that the estimator $\hat{q}_{\alpha}$ of the quantile of the GPD is asymptotically consistent. Technical details of the calculation are shown in B.

6 Discussion and Conclusion

In financial practice, there is a regulatory need to estimate quantiles at high confidence levels from measured data (iid) with an unknown distribution function. In this case, a GPD is usually adapted as a tail model for the unknown distribution function for a subset of the measured data. The required quantile $\hat{Q}_{\alpha}$ is then calculated with the inverse function of the estimated GPD, Eq. (4), or likewise with the estimated quantile $\hat{q}_{\alpha}$ , Eq. (7).

In this article, we considered the statistical property of the quantile estimator $\hat{q}_{\alpha}$ . The investigations focused on the parameter range $\xi=0.0,\ldots,0.5$ , which is important for risk assessment. The GPD then has the support $\mathbb{R}^{\geq 0}$ and can be used as a model for unknown distribution functions that have a fat tail.

The starting point of our analyses was the density of the finite sample distribution of the quantile estimator for the quantiles of the GPD. Thus, the finite sample variance and the finite sample bias of the quantile estimator could be determined. Further, we showed that the quantile estimator is asymptotically consistent and converges to the true value for large datasets. For practical applications, an approximate correction formula, Eq. (11), was derived, which can mitigate the negative influence of the bias on the estimated quantile of an unknown distribution function.

Generally, the results show that in practice, for finite, small datasets, the quantile $\hat{q}_{\alpha}$ determined is positive biased ( $B>0$ ) and has considerable uncertainty ( $\textrm{Var}[\hat{q}_{\alpha}]\gg 0$ as $n<50$ ). The calculation shown is universal, so that for other confidence levels or scale parameters the bias and the variance of the quantile estimator can be calculated. This may allow new perspectives for the review of risk assessment procedures and should also be of interest to audit authorities and regulators.

The results presented represent a lower limit of the accuracy in quantile estimation and can be used in other works as an absolute benchmark for the quality of the quantile estimate, e.g., in automated procedures for threshold detection and tail modeling, cf., e.g., Hoffmann and Börner, (2018) and the references cited therein. Future theoretical work will address the complete statistical properties of the quantile estimator $\hat{Q}_{\alpha}$ , Eq. (7). In another branch, the examination can be extended to the parameter area where the GPD has compact support, i.e., $\xi=-0.5,\ldots,0.0$ .

Appendix A Derivation of the density $f_{q}(z;n,\alpha,\sigma,\xi)$

The goal is to obtain an approximation of the finite sample density for the estimator of the quantile $\hat{q}_{\alpha}$ of the GPD, Eq. (1). As a starting point, we take the results of Smith, (1987) and Embrechts et al., (2003) in the notation of the latter author. Let $\hat{\sigma}$ and $\hat{\xi}$ be the finite sample maximum likelihood estimators for the actual scale parameter $\sigma$ and the actual shape parameter $\xi>-0.5$ . Both estimators are determined for a sample of length $n$ . Then, the distribution of the random vector ${\bf X}=\sqrt{n}(\hat{\xi}-\xi,\frac{\hat{\sigma}}{\sigma}-1)^{\prime}$ converges to a centered normal distribution with covariance matrix:

[TABLE]

This means that ${\bf X}\stackrel{{\scriptstyle d}}{{\rightarrow}}{\cal N}({\bf 0},{\bf M}^{-1})$ if $n\rightarrow\infty$ .

Defining vector ${\bf a}=-\sqrt{n}(\xi,1)^{\prime}$ and matrix ${\bf B}=\sqrt{n}\,\textrm{diag}(1,\frac{1}{\sigma})$ , the vector ${\bf Y}=(\hat{\xi},\hat{\sigma})^{\prime}$ is given by the affine map: ${\bf Y}={\bf B}^{-1}({\bf X}-{\bf a})$ . Then, the distribution of the random vector ${\bf Y}$ converges to a normal distribution with parameters:

[TABLE]

Thus, ${\bf Y}\stackrel{{\scriptstyle d}}{{\rightarrow}}{\cal N}(\boldsymbol{\mu},{\bf C})$ if $n\rightarrow\infty$ . Hence, the probability density function of the vector ${\bf Y}$ for finite $n$ is approximately given by the density $f_{\bf Y}(u,v)$ of a bivariate normal distribution with the parameters given in Eq. 13. Note that $u$ and $v$ represent the integration variables used later for $\hat{\xi}$ and $\hat{\sigma}$ .

Standard methods, cf., e.g., Kendall and Stuart, (1976), then lead us by a straightforward calculation to the distribution function $\Phi(\hat{q}_{\alpha})$ of the estimator of the quantile. By performing a double integration of $f_{\bf Y}(u,v)$ within a region $\cal D$ , the sought density for the quantile estimator can be obtained from the resulting formula.

The integration domain $\cal D$ is determined from the estimator of the quantile:

[TABLE]

Thus, the double integration is carried out as follows:

[TABLE]

with $\psi(u)=\frac{u}{(1-\alpha)^{-u}-1}$ . After a suitable substitution – introducing a new integration variable $z$ – and an exchange of the order of integration,

[TABLE]

Simple algebraic transformations lead to the representation of the desired density $f_{q}(z)$ shown in Eq. (3).

Appendix B Asymptotic consistence of the quantile estimator

The above results in A make it easy to show that the estimator $\hat{q}_{\alpha}$ of the quantile of the GPD is asymptotically consistent.

First, we notice that with the renormalized matrix ${\tilde{\bf C}}=n{\bf C}$ of the covariance matrix, Eq. (13), and its Cholesky decomposition ${\tilde{\bf C}}^{-1}=\big{(}{\tilde{\bf C}}^{-\frac{1}{2}}\big{)}^{\prime}\big{(}{\tilde{\bf C}}^{-\frac{1}{2}}\big{)}$ , the density $f_{\bf Y}(u,v)$ of the bivariate normal distribution becomes a bivariate Dirac delta function for $n\rightarrow\infty$ . Using the algebraic properties of the delta function, cf., e.g., Oldham et al., (2009), one quickly obtains:

[TABLE]

where $\xi$ and $\sigma$ are the actual parameters of the GPD. Substituting Eq. (17) into Eq. (16) leads to:

[TABLE]

This means that the distributions of the estimate $\hat{q}_{\alpha}$ of the quantile become more and more concentrated near the true value of the quantile $q_{\alpha}$ being estimated, if $n\rightarrow\infty$ , so that the probability of the estimator being arbitrarily close to $q_{\alpha}$ converges to one.

References

Abramowitz and Stegun, (2014) Abramowitz, M., Stegun, I.A. (editors), 2014. Handbook of mathematical functions with formulas, graphs, and mathematical tables, National Bureau of Standards (Washington DC), Martino Publishing, reprint of 1964 Edition, Mansfield Centre.
Aggarwal, (1955) Aggarwal, O.P., 1955. Some minimax invariant procedures for estimating a cumulative distribution function. The Annals of Mathematical Statistics 26, 450-463.
Balkema and de Haan, (1974) Balkema, A.A., de Haan, L., 1974. Residual life time at great age. The Annals of Probability 2, 792-804.
Basel Commitee, (2004) Basel Commitee on Banking Supervision, 2004. International convergence of capital measurement and capital standards – a revised framework. Bank for International Settlements, Basel, Switzerland.
Basel Commitee, (2009) Basel Commitee on Banking Supervision, 2009. Observed range of practice in key elements of advanced measurement approaches (AMA). Bank for International Settlements, Basel, Switzerland.
Choulakian and Stephens, (2001) Choulakian, V., Stephens, M.A., 2001. Goodness-of-Fit tests for the generalized pareto distribution. Technometrics 43, 478-484.
Davison, (1984) Davison, A.C., 1984. Modelling excess over high thresholds, with an application. Statistical Extremes and Applications. Ed. J. Tiago de Oliviera, D. Reidel Publishing Company, Dordrecht, Netherlands, 461-482.
Davison and Smith, (1990) Davison, A.C., Smith, R.L., 1990. Models for exceedances over high thresholds (with comments). Journal of the Royal Statistical Society, Series B (Methodological) 52, 393-442.
Directive, (2009) Directive 2009/138/EC of the European Parliament and of the Council of 25 November, 2009. On the taking-up and pursuit of the business of insurance and reinsurance (Solvency II). Official Journal of the European Union 52, L355.
Directive, (2013) Directive 2013/36/EU of the European Parliament and of the Council of 26 June, 2013. On access to the activity of credit institutions and the prudential supervision of credit institutions and investment firms, amending Directive 2002/87/EC and repealing Directives 2006/48/EC and 2006/49/EC. Official Journal of the European Union 56, L176.
Dutta and Perry, (2007) Dutta, K., Perry, J., 2007. A tale of tails: An empirical analysis of loss distribution models for estimating operational risk capital. Federal Reserve Bank of Boston, Working Papers 6/13, Boston.
Embrechts et al., (2003) Embrechts, P., Klüppelberg, C., Mikosch, T., 2003. Modelling extremal events: for insurance and finance (stochastic modelling and applied probability). Springer, corrected 4th printing, Berlin, Germany.
Fang et al., (2015) Fang, G.H., Yang, J., Chen, Y.N., Zammit, C., 2015. Comparing bias correction methods in downscaling meteorological variables for a hydrologic impact study in an arid area in China. Hydrology and Earth System Sciences 19, 2547-2559.
Gnedenko, (1943) Gnedenko, B.V., 1943. Sur la distribution limite du terme maximum d’une série aléatoire. Annals of Mathematics 44, 423-453.
Hoffmann et al., (2018) Hoffmann, P., Menz, Chr., Spekat, A., 2018. Bias adjustment for threshold-based climate indicators. Advances in Science & Research 15, 107-116.
Hoffmann and Börner, (2018) Hoffmann, I., Börner, Chr.J., 2018. Body and Tail - Separating the distribution function by an efficient tail-detecting procedure in risk management. Preprint 04/2018. ArXiv:1805.10040.
Hosking and Wallis, (1987) Hosking, J.R.M., Wallis, J.R., 1987. Parameter and quantile estimation for the generalized Pareto distribution. Technometrics 29, 339-349.
Jeon et al., (2016) Jeon, S., Paciorek, Chr.J., Wehner, M.F., 2016. Quantile-based bias correction and uncertainty quantification of extreme event attribution statements. Weather and Climate Extremes 12, 24-32.
Kendall and Stuart, (1976) Kendall, M.G., Stuart, A., 1976. The advanced theory of statistics. Charles Griffin & Company Limited, 4th Edition, London & High Wycombe, England.
McNeil and Saladin, (1997) McNeil, A.J., Saladin, T., 1997. The peak over threshold method for estimating high quantiles of loss distributions. Proceedings of XXVIIth International Astin Colloquium, Cairns, Australia, 23–43.
van Montfort and Witter, (1985) van Montfort, M.A.J., Witter, J.V., 1985. Testing exponentiality against generalized Pareto distribution. Journal of Hydrology 78, 305-315.
Moscadelli, (2004) Moscadelli, M., 2004. The modelling of operational risk: experience with the analysis of the data collected by the Basel Committee. Banca D’Italia, Temi di discussione 517, Italy.
Oldham et al., (2009) Oldham, K., Myland, J., Spanier, J., 2009. An atlas of functions. Springer Science + Business Media, 2th Edition, New York.
Pickands, (1975) Pickands III, J., 1975. Statistical inference using extreme order statistics. The Annals of Statistics 3, 119-131.
Regulation, (2013) Regulation (EU) No 575/2013 of the European Parliament and of the Council of 26 June, 2013. On prudential requirements for credit institutions and investment firms and amending Regulation (EU) No 648/2012. Official Journal of the European Union 56, L176.
Schmidli et al., (2006) Schmidli, J., Frei, Chr., Vidale, P.L., 2006. Downscaling from GCM precipitation: a benchmark for dynamical and statistical downscaling methods. International Journal of Climatology 26, 679-689.
Smith, (1984) Smith, R.L., 1984. Threshold methods for sample extremes. Statistical Extremes and Applications. Ed. J. Tiago de Oliviera, D. Reidel Publishing Company, Dordrecht, Netherlands, 621-638.
Smith, (1985) Smith, R.L., 1985. Maximum likelihood estimation in a class of nonregular cases. Biometrica 72, 67-90.
Smith, (1987) Smith, R.L., 1987. Estimating tails of probability distributions. The Annals of Statistics 15, 1174-1207.
Sun et al., (2011) Sun, F., Roderic, M.L., Lim, W.H., Farquhar, G.D., 2011. Hydroclimatic projections for the Murray-Darling Basin based on an ensemble derived from Intergovernmental Panel on Climate Change AR4 climate models. Water Resources Research 47, https://doi.org/10.1029/2010WR009829.
Themeßl et al., (2011) Themeßl, M.J., Gobiet, A., Heinrich, G., 2011. Empirical-statistical downscaling and error correction of regional climate models and its impact on the climate change signal. Climatic Change 112, 449–468.

Bibliography31

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Abramowitz and Stegun, (2014) Abramowitz, M., Stegun, I.A. (editors), 2014. Handbook of mathematical functions with formulas, graphs, and mathematical tables, National Bureau of Standards (Washington DC), Martino Publishing, reprint of 1964 Edition, Mansfield Centre.
2Aggarwal, (1955) Aggarwal, O.P., 1955. Some minimax invariant procedures for estimating a cumulative distribution function. The Annals of Mathematical Statistics 26, 450-463.
3Balkema and de Haan, (1974) Balkema, A.A., de Haan, L., 1974. Residual life time at great age. The Annals of Probability 2, 792-804.
4Basel Commitee, (2004) Basel Commitee on Banking Supervision, 2004. International convergence of capital measurement and capital standards – a revised framework. Bank for International Settlements, Basel, Switzerland.
5Basel Commitee, (2009) Basel Commitee on Banking Supervision, 2009. Observed range of practice in key elements of advanced measurement approaches (AMA). Bank for International Settlements, Basel, Switzerland.
6Choulakian and Stephens, (2001) Choulakian, V., Stephens, M.A., 2001. Goodness-of-Fit tests for the generalized pareto distribution. Technometrics 43, 478-484.
7Davison, (1984) Davison, A.C., 1984. Modelling excess over high thresholds, with an application. Statistical Extremes and Applications. Ed. J. Tiago de Oliviera, D. Reidel Publishing Company, Dordrecht, Netherlands, 461-482.
8Davison and Smith, (1990) Davison, A.C., Smith, R.L., 1990. Models for exceedances over high thresholds (with comments). Journal of the Royal Statistical Society, Series B (Methodological) 52, 393-442.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Tail models and the statistical limit of accuracy

Abstract

keywords:

1 Introduction

2 Risk assessment at high quantiles

2.1 Model of the tail of a distribution

2.2 Determination of the model parameters

2.3 Estimation of high quantiles.

2.4 Practical considerations – restrictions due to the database.

3 Density of the finite sample distribution of the quantile qαq_{\alpha}qα​

4 Finite sample bias and variance of the quantile estimator

5 Correction of the bias of the quantile estimator

5.1 Quantile bias correction – state of the art

5.2 Quantile bias correction – a formula for practice

6 Discussion and Conclusion

Appendix A Derivation of the density fq(z;n,α,σ,ξ)f_{q}(z;n,\alpha,\sigma,\xi)fq​(z;n,α,σ,ξ)

Appendix B Asymptotic consistence of the quantile estimator

References

3 Density of the finite sample distribution of the quantile $q_{\alpha}$

Appendix A Derivation of the density $f_{q}(z;n,\alpha,\sigma,\xi)$