
TL;DR
This paper determines the number of standard deviations needed for stress shocks to surpass historical observations considering kurtosis, validating stress test models and improving bounds over classical inequalities.
Contribution
It introduces a method to calculate stress shock sizes accounting for kurtosis, enhancing the accuracy of stress testing and bounds on deviations.
Findings
Stress shocks exceeding historical maxima can be quantified using kurtosis.
Validation of the Brace-Lauer-Rado stress test model with new bounds.
Tighter bounds than classical Chebyshev's Inequality extensions.
Abstract
Stress shocks are often calculated as multiples of the standard deviation of a history set. This paper investigates how many standard deviations are required to guarantee that this shock exceeds any observation within the history set, given the additional constraint of kurtosis. The results of this analysis are then used to validate the shocks produced by some stress test models, in particular that of Brace-Lauer-Rado. A secondary application of our results is to investigate three known extensions of Chebyshev's Inequality where the kurtosis is known. It is found that our results give a tighter bound than the well-known inequalities.
| a = (Max-mean)/StDev | |||||
|---|---|---|---|---|---|
| N-1 | sqrt(N-1) | kurtosis = 7 | kurtosis = 10 | kurtosis = 13 | kurtosis = 16 |
| 250 | 15.811 | 6.296 | 6.952 | 7.46 | 7.881 |
| 500 | 22.361 | 7.464 | 8.247 | 8.853 | 9.355 |
| 1,000 | 31.623 | 8.855 | 9.789 | 10.511 | 11.109 |
| 10,000 | 100.000 | 15.682 | 17.349 | 18.638 | 19.705 |
| 100,000 | 316.22 | 27.849 | 30.817 | 33.113 | 35.011 |
| 1,000,000 | 1,000.000 | 49.502 | 54.781 | 58.865 | 62.241 |
| 833,208 | 912.802 | 47.296 | 52.339 | 56.241 | 59.467 |
| Tail Factor | ||||||
| Deg. freedom | 3 | 4 | 5 | 6 | ||
| N/A | N/A | 6 | 3 | |||
| 250 | 6.322 | 4.908 | 4.262 | 3.898 | 6.023 | 4.828 |
| 500 | 8.053 | 5.951 | 5.030 | 4.524 | 7.138 | 5.709 |
| 1,000 | 10.215 | 7.173 | 5.893 | 5.208 | 8.466 | 6.760 |
| 10,000 | 22.204 | 13.034 | 9.678 | 8.025 | 14.987 | 11.934 |
| 100,000 | 47.928 | 23.332 | 15.547 | 12.032 | 26.610 | 21.171 |
| 1,000,000 | 103.299 | 41.578 | 24.771 | 17.83 | 47.298 | 37.619 |
| Tail Factor | ||||
|---|---|---|---|---|
| kurtosis = 7 | kurtosis = 10 | kurtosis = 13 | kurtosis = 16 | |
| 1m | 13.648 | 17.485 | 20.445 | 22.873 |
| 2m | 13.397 | 17.148 | 20.041 | 22.412 |
| 3m | 13.278 | 16.986 | 19.846 | 22.190 |
| 4m | 13.204 | 16.886 | 19.726 | 22.053 |
| 5m | 13.153 | 16.817 | 19.642 | 21.958 |
| 6m | 13.115 | 16.765 | 19.579 | 21.886 |
| N-1 | kurtosis = 7 | kurtosis = 10 | kurtosis = 13 | kurtosis = 16 |
|---|---|---|---|---|
| 250 | 6.468 | 7.071 | 7.550 | 7.953 |
| 500 | 7.692 | 8.409 | 8.979 | 9.457 |
| 1,000 | 9.147 | 10.000 | 10.678 | 11.247 |
| 10,000 | 16.266 | 17.783 | 18.988 | 20.000 |
| 100,000 | 28.925 | 31.623 | 33.766 | 35.566 |
| 1,000,000 | 51.437 | 56.234 | 60.046 | 63.246 |
| kurtosis = 7 | kurtosis = 10 | kurtosis = 13 | kurtosis = 16 | |
|---|---|---|---|---|
| 2 | 1.137 | 2.044 | 2.044 | 2.044 |
| 3 | 0.704 | 0.973 | 1.302 | 1.302 |
| 4 | 0.405 | 0.486 | 0.680 | 0.794 |
| 5 | 0.219 | 0.297 | 0.358 | 0.428 |
| 6 | 0.125 | 0.166 | 0.205 | 0.238 |
| 7 | 0.068 | 0.091 | 0.116 | 0.137 |
| 8 | 0.039 | 0.052 | 0.064 | 0.076 |
| 9 | 0.021 | 0.029 | 0.036 | 0.043 |
| 10 | 0.012 | 0.016 | 0.020 | 0.024 |
| kurtosis = 7 | kurtosis = 10 | kurtosis = 13 | kurtosis = 16 | |
| 250 | 0.00398 | 0.00400 | 0.00396 | 0.00400 |
| 500 | 0.00199 | 0.00200 | 0.00200 | 0.00199 |
| 1000 | 0.00100 | 0.00100 | 0.00100 | 0.00100 |
| 10000 | 0.00010 | 0.00001 | 0.00010 | 0.00010 |
| 100000 | 0.00001 | 0.00001 | 0.00001 | 0.00001 |
| 1000000 | 0 | 0 | 0 | 0 |
| kurtosis = 7 | kurtosis = 10 | kurtosis = 13 | kurtosis = 16 | |
|---|---|---|---|---|
| 250 | 251 | 250 | 253 | 250 |
| 500 | 503 | 501 | 500 | 502 |
| 1000 | 1000 | 1000 | 1002 | 1000 |
| 10000 | 10001 | 10001 | 10001 | 10001 |
| 100000 | 100000 | 100000 | 100000 | 100000 |
| 1000000 | 1000002 | 1000000 | 1000000 | 1000001 |
| kurtosis = 7 | kurtosis = 10 | kurtosis = 13 | kurtosis = 16 | |
|---|---|---|---|---|
| 10,000 | 0.003453 | 0.002974 | 0.002646 | 0.002407 |
| 100,000 | 0.001099 | 0.000945 | 0.00084 | 0.000764 |
| 1,000,000 | 0.000349 | 0.000299 | 0.000266 | 0.000242 |
| 10,000,000 | 0.00011 | 0.000095 | 0.000084 | 0.000076 |
| 100,000,000 | 0.000035 | 0.00003 | 0.000027 | 0.000024 |
| a = (Max-mean)/StDev | |||||
| N-1 | sqrt(N-1) | Bi-modal | Tri-modal | Two-thirds | Uniform |
| 500 | 22.38 | 9.35 | 9.30 | 9.30 | 9.26 |
| 1000 | 31.62 | 11.10 | 11.03 | 11.04 | 10.99 |
| 2000 | 44.72 | 13.19 | 13.10 | 13.10 | 13.04 |
| 3000 | 54.77 | 14.59 | 14.49 | 14.49 | 14.42 |
| 4000 | 63.24 | 15.68 | 15.56 | 15.56 | 15.49 |
| 5000 | 70.71 | 16.57 | 16.45 | 16.45 | 16.37 |
| 10000 | 100 | 19.70 | 19.55 | 19.55 | 19.45 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProbabilistic and Robust Engineering Design · Fatigue and fracture mechanics · Mathematical Approximation and Integration
How Big Should a Stress Shock Be?
David G. [email protected]
1 Introduction
A common way to determine stress shock sizes is as a multiple, , of the daily standard deviation, , of the risk factor. is typically calibrated on history set of daily rate changes of a suitable length, . The multiple, , is sometimes referred to as a tail factor. This can be computed from the inverse of the cumulative distribution function. For example, if the returns were modelled on the normal distribution, then the tail factor for a 1-in-1,000,000 day shock would be 4.75.
Stress models are rarely published as 1) they are proprietary, and 2) there is no regulatory requirement to do so. As a result, to our knowledge there is no literature on the performance or validation of these models. Indeed, stress models cannot be effectively backtested as they are designed to produce shocks large enough so as not to have backtesting exceptions.
The following stress models have been communicated to the author:
- •
Several banks assume that returns follow a Student-t distribution with low degrees of freedom (c.f. [5]). For example, 3 degrees of freedom (the lowest degree for which the distribution has a finite variance) produces a tail factor of . The length of the history set varies between banks.
- •
One Australian bank uses a tail factor of 7, with an additional add-on for less liquid assets. The history set used for calibration is the VaR history set of 2 years.
- •
One Canadian Bank and one Australian bank use the stress model of Brace-Lauer-Rado [3], which is primarily parameterised by a given value of kurtosis to produce a tail factor. The length of the history set varies between each of these banks.
This paper investigates the following question: Given a stress shock of size , can we be assured that this shock exceeds any observed shock in the historical data? After all, if a shock was found in the last year or two to be larger than the derived stress shock, then this effectively invalidates the stress model.
The approach of this paper to this problem is to extend the results of Samuelson [8] (see also [4] and the references therein). Samuelson shows that no single value can lie more than deviations from the mean by examining the endpoint example where one observation is equal to 1, with the remaining observations set to zero. The main result of this paper is to impose conditions on the kurtosis of this type of endpoint distribution (section 2), which is then applied to validate the stress test models presented above, in particular that of Brace-Lauer-Rado [3] (section 3). This is a novel approach, which could be considered the first to analyse stress models with a falsifiable test.
Following the outline of Samuelson [8], we then use this endpoint distribution to examine three extensions of Chebyshev’s Inequality where the kurtosis is known. These are a version of Chebyshev’s Inequality with higher moments of even order, Zelen’s Inequality [11], and Bhattacharyya’s Inequality [2]. Similarly to Samuelson’s results, we give a tighter bound for three known extensions of Chebyshev’s inequality in the finite case.
2 Incorporating Kurtosis into Samuelson’s distribution
In [8], Samuelson shows that no single value can lie more than deviations from the mean. This is achieved by constructing a finite point distribution where one observation is equal to 1, with the remaining observations set to zero. This distribution is unrealistic as a model for extreme moves of asset prices. It would be equivalent to the market never ever moving except for a large jump one day in 10,000 (or even more), and that our estimate of the volatility is driven entirely by the size of that jump. The reality is that asset prices do move, with varying and clustered volatility, and with more frequent smaller jumps too.
To incorporate these features into Samuelson’s distribution, we will place a restriction on the kurtosis. Focussing on kurtosis is ideal for the validation of a stress model as it does not impose any other assumptions about the distribution, only a general metric of how heavy-tailed the distribution is.
Thus, let us consider distributions which have low kurtosis, plus one outlier. The bi-modal distribution with points at , which has a kurtosis of 1, is the obvious candidate. This distribution, for a fixed value of kurtosis, enables the value of to be large.
To rigorously prove that a distribution, which for a fixed value of kurtosis, maximises the value of , is a complex optimisation problem, analogous to that posed in [8]. To give some certainty this distribution cannot be improved upon, it was compared to three other distributions in the Appendix: neither produced a larger value for than the bi-modal. But moreover, did not differ very much for any choice of distribution, giving credibility to our choice of kurtosis being a good restriction to make, as it allows for asset prices to move - and move in different ways - with some impact on the volatility aside from the one large shock.
So to paraphrase [8]: How deviant can one be when the distribution has a restriction of kurtosis imposed? Consider the bi-modal distribution with one extreme point, which we call :
[TABLE]
with , and are normalised such that , , and . Thus, is the maximum possible number of standard deviations away from the mean. We prove the following:
Proposition 1**.**
Suppose is odd. The value is given by
[TABLE]
where
[TABLE]
*Furthermore, for large .
Proof: Suppose that the mean of the non-normalised distribution is . To ease computation, assume that is odd. Subtracting from every observation, except , gives
[TABLE]
From the second moment:
[TABLE]
Since and , the kurtosis is simply :
[TABLE]
Substituting :
[TABLE]
Rearranging, and multiplying by gives the following quadratic in :
[TABLE]
where
[TABLE]
The value of as a function of and is given by the quadratic formula, then taking the square root of the positive case. This gives this first statement of the proposition.
For the second statement, observe that letting become large yields
[TABLE]
for large .
Remark: (the number of standard deviations above the mean) grows with a leading term of in Samuelson’s distribution, but our distribution grows much slowly in . The fact that also grows proportionately to for large will be important regarding Chebyshev’s Inequality in section 4.
Values of for a given , and various values of kurtosis, are tabulated below:
Thus, if a bank were to target a AA rating, it would need to consider its survivability over a 833,209 day ( year) history set. For the unconstrained case, a tail factor of approximately 913 standard deviations would be required. For a distribution with the choices of kurtosis presented here, a tail factor of 47 to 60 standard deviations would be required. This result in a stress shock of a large magnitude, we would consider it far more reasonable.
There may be other considerations for the size of the history set considered - not least the fact that a 3,333 year history set of daily prices is impossible to obtain! For example, the Basel Committee on Banking Supervision has recently recommended as part of its “Fundamental review of the trading book” (see [1]) that the Internal Model be calibrated on a history set of 10 years. This choice of history set length results in a tail factor of 11 to 14 standard deviations with our above choices of kurtosis presented here. This is still a reasonably sized stress shock, though obviously not as large.
In fact, the above results can be used to construct a stress shocks, without any distributional assumptions beyond kurtosis. For example, is best for targeting a AA rating, is suitable for the consideration of once-in-generation shocks, and for shocks that occur in a typical business cycle (say).
In the next section a validation of the shocks produced by a stress model that emphasises kurtosis is performed by comparing the shocks given to the results above.
3 Validation of Stress Models
In this section the above result are applied to validate the stress shock that are produced by the three stress models outlined in the Introduction: 1) Stress shocks based on the Student-t distribution, 2) a tail factor of 7, calibrated on a history set of 2 years, and 3) the stress model of Brace-Lauer-Rado [3].
3.1 Student-t Distribution
The Student-t distribution has a finite standard deviation only when the degrees of freedom are above 2, and finite kurtosis only when the degrees of freedom are above 4. Tabulated below are the tail factors for the Student-t distribution with degrees of freedom 3, 4, 5 and 6. Against these are the values of when 6 and 3, corresponding to the degrees of freedom cases 5 and 6.
Only when the degree of freedom is 3 can it be assured that the stress shock will exceed the maximum value when the constraint of kurtosis is in place. But as this distribution does not have a finite value of kurtosis, the comparison cannot be made.
3.2 Tail factor of 7
The tail factor of 7 used by one Australian bank would appear to be quite inadequate. According to the values in Table 1, a risk factor with a kurtosis of 7 could produce a value that exceeds the stress shock when calibrated on a two year history set.
3.3 Brace-Lauer-Rado Model
In [3], the authors present a model for determining stress test shocks for a risk factor , based on the stochastic volatility models of Scott [9] and Wiggins [10]:
[TABLE]
[TABLE]
The model has three parameters: , the skewness, , which controls the rate of mean-reversion in volatility back to the Normal over time, and , which is connected to the kurtosis. This model has a number of desirable features for stress tests. For example, it possesses fat tails, but it is relatively straightforward to calculate quantiles for liquidity holding periods.
The instantaneous kurtosis of is given by:
[TABLE]
As values of lead to unique values of for a given , the model can be re-cast in terms of , , and . The authors then present the tail factors targeting a AA rating, ie, a survival probability of 0.9997 various values of (ie, the expected time of mean reversion), and . In fact, the authors present these tail factors for various liquidity holding periods, but here we just consider the 1 day holding period:
Comparing these tail factors to the values of in Table 1, the tail factors given by the model for the case of kurtosis = 7 and could be violated in a history set of 5,000 observations - that is, 20 years of data (assuming 250 business days to a year). For the remaining cases of kurtosis = 10, 13 and 16, this is 9,000, 12,250, and 15,250, respectively. That is, 36, 49, and 61 years of data, respectively.
It could be said that these values are a little low, given that our earlier results showed that for targeting the survivability of a AA rated institution. However, only in few cases can a daily history set be obtained that is of this length, so the existence of an observation in the history set that exceeds the stress shock will be very rare.
4 Chebyshev’s Inequality with Higher moments
As noted in Samuelson [8], the construction of these distributions is closely connected to Chebyshev’s inequality. In this section we show that our endpoint distribution gives a tighter bound than three extensions of Chebyshev’s Inequality where the kurtosis is known - to our knowledge, these are the only three that consider the use of kurtosis. These are a version of Chebyshev’s Inequality with higher moments of even order, Zelen’s Inequality [11], and Bhattacharyya’s Inequality [2]. It is found that Bhattacharyya’s Inequality is not sharp.
Chebyshev’s inequality is given by:
[TABLE]
In a finite world of observations, setting gives the endpoint:
[TABLE]
which Samuelson’s example marginally betters to
[TABLE]
To apply our results concerning , observe the Bi-modal distribution with one extreme point (normalised such that and and ) yields the inequality
[TABLE]
so if an extended Chebyshev’s inequality gives a threshold greater than , or a probability less than , then our finite point distribution has been shown to give a tighter bound. We now consider the three extensions Chebyshev’s inequality mentioned above.
4.1 Higher moments of even order
For any integer
[TABLE]
which is a straightforward consequence of Markov’s inequality. See [6], for example. Setting and gives the endpoint:
[TABLE]
Values of are tabulated below. Note that these are all greater than the values of given in Table 1 in Section 2, and greater than the limit of . That is, in a finite world of observations, (our distribution conditioned on kurtosis) improves on this instance of Chebyshev’s inequality.
4.2 Zelen’s Inequality
For the case of where the third and fourth moments are known, formulae have been given by Zelen [11]:
[TABLE]
where
[TABLE]
Since the distribution normalised and , then . Furthermore, the kurtosis is specified for the distribution. may be expressed in terms of and , the latter being a function of and :
Proposition 2**.**
[TABLE]
where is given by Proposition 1:
[TABLE]
*Furthermore, for large .
Proof:
[TABLE]
For large, substituting gives:
[TABLE]
Values of are tabulated below against for various values of kurtosis. Interestingly, convergence to the limit of is not monotone, and requires a larger value of to converge than for :
Setting , values of Zelen’s probability are given below:
The inverse of the above probabilities are inverted to give “1-in-” values. Thus, it can be seen that our distribution gives a slightly tighter bound than Zelen, but only in the finite point case.
4.3 Bhattacharyya’s Inequality
Another extension was given by Bhattacharyya [2]:
[TABLE]
where
[TABLE]
Analogous to Zelen’s Inequality, set and . To satisfy , must be suitably large, over 10,000 for our distribution . Our distribution gives a much more tighter bound than Bhattacharyya’s inequality than it did for Zelen, but again, only in the finite point case. For example, when the probabilities are approximately 1-in-10,000.
5 Conclusion
This paper considers how big a stress shock should be, based on how many multiples of the standard deviations are required to guarantee that the shock exceeds any observation within the history set. By imposing the additional constraint of kurtosis, this lends itself to a more realistic description of asset price movements than that given by Samuelson’s inequality [8].
A distribution was constructed that maximises the number of standard deviations above the mean that a single point can be for a fixed value of kurtosis. This was then used to validate several stress models, in particular that of Brace-Lauer-Rado [3], which has a primary parameter of kurtosis. Furthermore, the constructed distribution can itself be used to derive stress shocks.
Following the outline of Samuelson’s paper [8], we use our distribution to examine three extensions of Chebyshev’s Inequality where the kurtosis is known. It is found that our results give a tighter bound than the well-known inequalities, particularly that of Bhattacharyya’s Inequality [2] is not, but only in the finite point case.
6 Declarations of Interest
The authors report no conflicts of interest. The authors alone are responsible for the content and writing of the paper.
7 Appendix
In Section 2, this paper uses the distribution of a bi-modal set of points, plus one outlier. The intention is to have a distribution, which for a fixed value of kurtosis, enables the value of to be large. As the bi-modal set of points has the lowest possible kurtosis, this should give the largest value of for any given kurtosis. Indeed, at first glance Samuelson’s example could be a candidate distribution that satisfies these requirements since, but the kurtosis here is .
To rigorously construct a distribution, which for a fixed value of kurtosis, maximises the value of , is a complex optimisation problem:
[TABLE]
To give some certainty this distribution cannot be improved upon, three other distributions were considered to test if the above construction could be bettered:
A tri-modal distribution, with an equal number of points are (kurtosis of 1.5),
The distribution with two-thirds of the points at 0, and the remaining third at (kurtosis of 1.5),
The Uniform distribution between and (kurtosis of 1.8).
Since the statistic and the kurtosis are invariant under dilation and translation, the distributions are then re-scaled, and the outlier found using a simple search routine such that the kurtosis of the distribution is as desired.
Although each produced a value for close to the bi-modal, it was less in all cases:
Furthermore, as remarked in Section 2, does not differ very much for any choice of distribution. This is desirable, and gives credibility to our choice of kurtosis being a good restriction to make, as it allows for asset prices to move - and move in different ways - with some impact on the volatility, not just from the one large shock.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Basel Committee on Banking Supervision (2018) Revisions to the minimum capital requirements for market risk, https://www.bis.org/bcbs/publ/d 436.htm
- 2[2] Bhattacharyya, B. B. (1987). One-sided chebyshev inequality when the first four moments are known, Communications in Statistics – Theory and Methods. 16 (9):2789 – – – 2791.
- 3[3] Brace, A., Lauer, M., and Rado, M. (2006) A Stylised Model for Extreme Shocks: Four Moments of the Apocalypse, UTS Research Paper, https://www.researchgate.net/publication/23697087_A_Stylised_Model_for_Extreme_Shocks_Four_Moments_of_the_Apocalypse
- 4[4] Jensen, S. T. (1999) The Laguerre – – – Samuelson Inequality with Extensions and Applications in Statistics and Matrix Theory, M Sc Thesis, Department of Mathematics and Statistics, Mc Gill University.
- 5[5] Maher, D. (2011) On the use of t-copulas for economic capital calculations, Journal of Risk Model Validation, 5 (3), 21-36.
- 6[6] Mitzenmacher, M. and Upfal, E (2005). Probability and Computing: Randomized Algorithms and Probabilistic Analysis. Cambridge Univ. Press.
- 7[7] Platen, E., and Renata, R. (2008) Empirical Evidence on Student − - t Log-Returns of Diversified World Stock Indices, Journal of Statistical Theory and Practice, 2 (2), 233-251.
- 8[8] Samuelson, P. A. (1968) How deviant can you be?, J. Amer. Statist. Assoc., 63 , 1522-1525.
