How big should a Stress Shock be?

David G Maher

arXiv:1905.10164·q-fin.RM·May 27, 2019

How big should a Stress Shock be?

David G Maher

PDF

Open Access

TL;DR

This paper determines the number of standard deviations needed for stress shocks to surpass historical observations considering kurtosis, validating stress test models and improving bounds over classical inequalities.

Contribution

It introduces a method to calculate stress shock sizes accounting for kurtosis, enhancing the accuracy of stress testing and bounds on deviations.

Findings

01

Stress shocks exceeding historical maxima can be quantified using kurtosis.

02

Validation of the Brace-Lauer-Rado stress test model with new bounds.

03

Tighter bounds than classical Chebyshev's Inequality extensions.

Abstract

Stress shocks are often calculated as multiples of the standard deviation of a history set. This paper investigates how many standard deviations are required to guarantee that this shock exceeds any observation within the history set, given the additional constraint of kurtosis. The results of this analysis are then used to validate the shocks produced by some stress test models, in particular that of Brace-Lauer-Rado. A secondary application of our results is to investigate three known extensions of Chebyshev's Inequality where the kurtosis is known. It is found that our results give a tighter bound than the well-known inequalities.

Tables9

Table 1. Table 1: a = ( M a x − m e a n ) / S t D e v 𝑎 𝑀 𝑎 𝑥 𝑚 𝑒 𝑎 𝑛 𝑆 𝑡 𝐷 𝑒 𝑣 a=(Max-mean)/StDev for given N 𝑁 N and kurtosis.

		a = (Max-mean)/StDev
N-1	sqrt(N-1)	kurtosis = 7	kurtosis = 10	kurtosis = 13	kurtosis = 16
250	15.811	6.296	6.952	7.46	7.881
500	22.361	7.464	8.247	8.853	9.355
1,000	31.623	8.855	9.789	10.511	11.109
10,000	100.000	15.682	17.349	18.638	19.705
100,000	316.22	27.849	30.817	33.113	35.011
1,000,000	1,000.000	49.502	54.781	58.865	62.241
833,208	912.802	47.296	52.339	56.241	59.467

Table 2. Table 2: Tail Factors for the Student-t distribution, versus a ( N , κ ) 𝑎 𝑁 𝜅 a(N,\kappa) .

	Tail Factor
Deg. freedom	3	4	5	6
$N \ κ$	N/A	N/A	6	3	$a (N, κ = 6)$	$a (N, κ = 3)$
250	6.322	4.908	4.262	3.898	6.023	4.828
500	8.053	5.951	5.030	4.524	7.138	5.709
1,000	10.215	7.173	5.893	5.208	8.466	6.760
10,000	22.204	13.034	9.678	8.025	14.987	11.934
100,000	47.928	23.332	15.547	12.032	26.610	21.171
1,000,000	103.299	41.578	24.771	17.83	47.298	37.619

Table 3. Table 3: Tail Factors of the Brace-Lauer-Rado Stress Model.

	Tail Factor
$g^{- 1}$	kurtosis = 7	kurtosis = 10	kurtosis = 13	kurtosis = 16
1m	13.648	17.485	20.445	22.873
2m	13.397	17.148	20.041	22.412
3m	13.278	16.986	19.846	22.190
4m	13.204	16.886	19.726	22.053
5m	13.153	16.817	19.642	21.958
6m	13.115	16.765	19.579	21.886

Table 4. Table 4: Values of ( κ N ) 1 / 4 superscript 𝜅 𝑁 1 4 (\kappa N)^{1/4} .

	${(κ N)}^{1 / 4}$
N-1	kurtosis = 7	kurtosis = 10	kurtosis = 13	kurtosis = 16
250	6.468	7.071	7.550	7.953
500	7.692	8.409	8.979	9.457
1,000	9.147	10.000	10.678	11.247
10,000	16.266	17.783	18.988	20.000
100,000	28.925	31.623	33.766	35.566
1,000,000	51.437	56.234	60.046	63.246

Table 5. Table 5: Values of 𝔼 ( X 3 ) 𝔼 superscript 𝑋 3 \mathbb{E}(X^{3}) .

	$𝔼 (X^{3})$
${Log}_{10} (N)$	kurtosis = 7	kurtosis = 10	kurtosis = 13	kurtosis = 16
2	1.137	2.044	2.044	2.044
3	0.704	0.973	1.302	1.302
4	0.405	0.486	0.680	0.794
5	0.219	0.297	0.358	0.428
6	0.125	0.166	0.205	0.238
7	0.068	0.091	0.116	0.137
8	0.039	0.052	0.064	0.076
9	0.021	0.029	0.036	0.043
10	0.012	0.016	0.020	0.024

Table 6. Table 6: Values of Zelen’s probability, [ 1 + a 2 + ( a 2 − a θ 3 − 1 ) 2 θ 4 − θ 3 2 − 1 ] − 1 superscript delimited-[] 1 superscript 𝑎 2 superscript superscript 𝑎 2 𝑎 subscript 𝜃 3 1 2 subscript 𝜃 4 subscript superscript 𝜃 2 3 1 1 \biggl{[}1+a^{2}+\frac{(a^{2}-a\theta_{3}-1)^{2}}{\theta_{4}-\theta^{2}_{3}-1}\biggr{]}^{-1} for the Bi-modal distribution with one extreme point.

	${[1 + a^{2} + \frac{{(a^{2} - a θ_{3} - 1)}^{2}}{θ_{4} - θ_{3}^{2} - 1}]}^{- 1}$
$N - 1$	kurtosis = 7	kurtosis = 10	kurtosis = 13	kurtosis = 16
250	0.00398	0.00400	0.00396	0.00400
500	0.00199	0.00200	0.00200	0.00199
1000	0.00100	0.00100	0.00100	0.00100
10000	0.00010	0.00001	0.00010	0.00010
100000	0.00001	0.00001	0.00001	0.00001
1000000	0	0	0	0

Table 7. Table 7: Values of Zelen’s probability for the Bi-modal distribution with one extreme point, given as “1-in-” values.

	$1 + a^{2} + \frac{{(a^{2} - a θ_{3} - 1)}^{2}}{θ_{4} - θ_{3}^{2} - 1}$
$N - 1$	kurtosis = 7	kurtosis = 10	kurtosis = 13	kurtosis = 16
250	251	250	253	250
500	503	501	500	502
1000	1000	1000	1002	1000
10000	10001	10001	10001	10001
100000	100000	100000	100000	100000
1000000	1000002	1000000	1000000	1000001

Table 8. Table 8: Values of Bhattacharyya’s probability, κ − θ 3 2 − 1 ( κ − θ 3 2 − 1 ) ( 1 + a 2 ) + ( a 2 − a θ 3 − 1 ) 𝜅 superscript subscript 𝜃 3 2 1 𝜅 superscript subscript 𝜃 3 2 1 1 superscript 𝑎 2 superscript 𝑎 2 𝑎 subscript 𝜃 3 1 \frac{\kappa-\theta_{3}^{2}-1}{(\kappa-\theta_{3}^{2}-1)(1+a^{2})+(a^{2}-a\theta_{3}-1)} for the Bi-modal distribution with one extreme point.

	$\frac{κ - θ_{3}^{2} - 1}{(κ - θ_{3}^{2} - 1) (1 + a^{2}) + (a^{2} - a θ_{3} - 1)}$
$N - 1$	kurtosis = 7	kurtosis = 10	kurtosis = 13	kurtosis = 16
10,000	0.003453	0.002974	0.002646	0.002407
100,000	0.001099	0.000945	0.00084	0.000764
1,000,000	0.000349	0.000299	0.000266	0.000242
10,000,000	0.00011	0.000095	0.000084	0.000076
100,000,000	0.000035	0.00003	0.000027	0.000024

Table 9. Table 9: Extreme point values for the considered example distributions.

		a = (Max-mean)/StDev
N-1	sqrt(N-1)	Bi-modal	Tri-modal	Two-thirds	Uniform
500	22.38	9.35	9.30	9.30	9.26
1000	31.62	11.10	11.03	11.04	10.99
2000	44.72	13.19	13.10	13.10	13.04
3000	54.77	14.59	14.49	14.49	14.42
4000	63.24	15.68	15.56	15.56	15.49
5000	70.71	16.57	16.45	16.45	16.37
10000	100	19.70	19.55	19.55	19.45

Equations72

Φ_{N} = (X_{1}, X_{2}, X_{3}, X_{4}, X_{5}, \dots, X_{N}) = (a, b, - b, \dots, b, - b)

Φ_{N} = (X_{1}, X_{2}, X_{3}, X_{4}, X_{5}, \dots, X_{N}) = (a, b, - b, \dots, b, - b)

a=a(N,\kappa)=\sqrt{-\frac{N-1}{N+1}+\sqrt{\biggl{(}\frac{N-1}{N+1}\biggr{)}^{2}-G(N,\kappa)}}

a=a(N,\kappa)=\sqrt{-\frac{N-1}{N+1}+\sqrt{\biggl{(}\frac{N-1}{N+1}\biggr{)}^{2}-G(N,\kappa)}}

G (N, κ) = \frac{N ( N - 1 ) ^{2} - ( N - 1 ) ^{3} κ}{( N + 1 ) ( N - 3 )}

G (N, κ) = \frac{N ( N - 1 ) ^{2} - ( N - 1 ) ^{3} κ}{( N + 1 ) ( N - 3 )}

E (X) = \frac{1}{N} i = 1 \sum N X_{i}

E (X) = \frac{1}{N} i = 1 \sum N X_{i}

= \frac{a}{N} - \frac{N - 1}{N} . \frac{a}{y}

\Rightarrow y = N - 1

E (X^{2}) = \frac{1}{N} i = 1 \sum N X_{i}^{2}

E (X^{2}) = \frac{1}{N} i = 1 \sum N X_{i}^{2}

= \frac{a ^{2}}{( N - 1 )} + \frac{N - 1}{N} b^{2}

\Rightarrow b^{2} = \frac{N}{N - 1} - a^{2} \frac{N}{( N - 1 ) ^{2}}

κ = E (X^{4}) = \frac{1}{N} i = 1 \sum N X_{i}^{4}

κ = E (X^{4}) = \frac{1}{N} i = 1 \sum N X_{i}^{4}

= \frac{a ^{4}}{N} + \frac{N - 1}{2 N} (2 b^{4} + 12 \frac{b ^{2} a ^{2}}{( N - 1 ) ^{2}} + 2 \frac{a ^{4}}{( N - 1 ) ^{4}})

\displaystyle\kappa=\frac{a^{4}}{N}+\frac{N-1}{N}\biggl{(}\Bigl{(}\frac{N}{N-1}-a^{2}\frac{N}{(N-1)^{2}}\Bigr{)}^{2}+6\frac{\Bigl{(}\frac{N}{N-1}-a^{2}\frac{N}{(N-1)^{2}}\Bigr{)}a^{2}}{(N-1)^{2}}+\frac{a^{4}}{(N-1)^{4}}\biggr{)}

\displaystyle\kappa=\frac{a^{4}}{N}+\frac{N-1}{N}\biggl{(}\Bigl{(}\frac{N}{N-1}-a^{2}\frac{N}{(N-1)^{2}}\Bigr{)}^{2}+6\frac{\Bigl{(}\frac{N}{N-1}-a^{2}\frac{N}{(N-1)^{2}}\Bigr{)}a^{2}}{(N-1)^{2}}+\frac{a^{4}}{(N-1)^{4}}\biggr{)}

\displaystyle=\frac{a^{4}}{N}+\frac{N-1}{N}\biggl{(}\Bigl{(}\frac{N^{2}}{(N-1)^{2}}-\frac{2N^{2}a^{2}}{(N-1)^{3}}+\frac{N^{2}a^{4}}{(N-1)^{4}}\Bigr{)}+\frac{6Na^{2}}{(N-1)^{3}}-\frac{6Na^{4}}{(N-1)^{4}}+\frac{a^{4}}{(N-1)^{4}}\biggr{)}

= \frac{a ^{4}}{N} + \frac{N ( N - 1 )}{( N - 1 ) ^{2}} + \frac{( 6 - 2 N ) a ^{2}}{( N - 1 ) ^{2}} + \frac{( N ^{2} - 6 N + 1 ) a ^{4}}{N ( N - 1 ) ^{3}}

= \frac{N ( N - 1 )}{( N - 1 ) ^{2}} + \frac{6 - 2 N}{( N - 1 ) ^{2}} a^{2} + \frac{N ^{2} - 6 N + 1 + ( N - 1 ) ^{3}}{N ( N - 1 ) ^{3}} a^{4}

= \frac{N ( N - 1 )}{( N - 1 ) ^{2}} + \frac{- 2 ( N - 3 )}{( N - 1 ) ^{2}} a^{2} + \frac{( N + 1 ) ( N - 3 )}{( N - 1 ) ^{3}} a^{4}

0 = (a^{2})^{2} - 2 \frac{N - 1}{N + 1} (a^{2}) + G (N, κ)

0 = (a^{2})^{2} - 2 \frac{N - 1}{N + 1} (a^{2}) + G (N, κ)

G (N, κ) = \frac{N ( N - 1 ) ^{2} - ( N - 1 ) ^{3} κ}{( N + 1 ) ( N - 3 )}

G (N, κ) = \frac{N ( N - 1 ) ^{2} - ( N - 1 ) ^{3} κ}{( N + 1 ) ( N - 3 )}

a = a (N, κ) = - 1 + 1 + N (κ - 1) \sim [N (κ - 1)]^{1/4}

a = a (N, κ) = - 1 + 1 + N (κ - 1) \sim [N (κ - 1)]^{1/4}

X_{t} = exp Y_{t} ab c d e d Y_{t} = exp \frac{1}{2} V_{t} d W_{t}^{(1)}

X_{t} = exp Y_{t} ab c d e d Y_{t} = exp \frac{1}{2} V_{t} d W_{t}^{(1)}

d V_{t} = - g V_{t} d t + h d W_{t}^{(2)} ab c d e ⟨ d W_{t}^{(1)}, d W_{t}^{(2)} ⟩ = ρ d t

d V_{t} = - g V_{t} d t + h d W_{t}^{(2)} ab c d e ⟨ d W_{t}^{(1)}, d W_{t}^{(2)} ⟩ = ρ d t

κ := \frac{E [ d Y _{t}^{4} ]}{E [ d Y _{t}^{2} ] ^{2}} = 3 exp \frac{h ^{2}}{2 g}

κ := \frac{E [ d Y _{t}^{4} ]}{E [ d Y _{t}^{2} ] ^{2}} = 3 exp \frac{h ^{2}}{2 g}

P (∣ X - \overset{ˉ}{X} ∣ \geq t σ) \leq \frac{1}{t ^{2}}

P (∣ X - \overset{ˉ}{X} ∣ \geq t σ) \leq \frac{1}{t ^{2}}

P (∣ X - \overset{ˉ}{X} ∣ \geq N σ) \leq \frac{1}{N}

P (∣ X - \overset{ˉ}{X} ∣ \geq N σ) \leq \frac{1}{N}

P (∣ X - \overset{ˉ}{X} ∣ \geq N - 1 σ) \leq \frac{1}{N}

P (∣ X - \overset{ˉ}{X} ∣ \geq N - 1 σ) \leq \frac{1}{N}

P (∣ X - \overset{ˉ}{X} ∣ \geq a) \leq \frac{1}{N}

P (∣ X - \overset{ˉ}{X} ∣ \geq a) \leq \frac{1}{N}

P\Bigl{(}|X-\bar{X}|\geq t\mathbb{E}[(X-\mathbb{E}[X])^{2k}]^{1/2k}\Bigr{)}\leq\frac{1}{t^{2k}}

P\Bigl{(}|X-\bar{X}|\geq t\mathbb{E}[(X-\mathbb{E}[X])^{2k}]^{1/2k}\Bigr{)}\leq\frac{1}{t^{2k}}

P\Bigl{(}|X-\bar{X}|\geq(\kappa N)^{1/4}\Bigr{)}\leq\frac{1}{N}

P\Bigl{(}|X-\bar{X}|\geq(\kappa N)^{1/4}\Bigr{)}\leq\frac{1}{N}

P(|X-\bar{X}|\geq t\sigma)\leq\biggl{[}1+t^{2}+\frac{(t^{2}-t\theta_{3}-1)^{2}}{\theta_{4}-\theta^{2}_{3}-1}\biggr{]}^{-1}

P(|X-\bar{X}|\geq t\sigma)\leq\biggl{[}1+t^{2}+\frac{(t^{2}-t\theta_{3}-1)^{2}}{\theta_{4}-\theta^{2}_{3}-1}\biggr{]}^{-1}

t \geq \frac{θ _{3} + θ _{3}^{2} + 4}{2} ab c d e and ab c d e θ_{j} = \frac{E [ X ^{j} ]}{σ ^{j}}

t \geq \frac{θ _{3} + θ _{3}^{2} + 4}{2} ab c d e and ab c d e θ_{j} = \frac{E [ X ^{j} ]}{σ ^{j}}

E (X^{3}) = \frac{- 3}{N - 1} a + \frac{( N + 1 )}{( N - 1 ) ^{2}} a^{3}

E (X^{3}) = \frac{- 3}{N - 1} a + \frac{( N + 1 )}{( N - 1 ) ^{2}} a^{3}

a=a(N,\kappa)=\sqrt{-\frac{N-1}{N+1}+\sqrt{\biggl{(}\frac{N-1}{N+1}\biggr{)}^{2}-G(N,\kappa)}}

a=a(N,\kappa)=\sqrt{-\frac{N-1}{N+1}+\sqrt{\biggl{(}\frac{N-1}{N+1}\biggr{)}^{2}-G(N,\kappa)}}

E (X^{3}) = i = 1 \sum N X_{i}^{3}

E (X^{3}) = i = 1 \sum N X_{i}^{3}

= \frac{a ^{3}}{N} - \frac{3 b ^{2} a}{N} - \frac{a ^{3}}{N ( N - 1 ) ^{2}}

\displaystyle=\frac{a^{3}(N-1)^{2}-3a\bigl{(}N(N-1)-a^{2}N\bigr{)}-a^{3}}{N(N-1)^{2}}

= \frac{- 3}{N - 1} a + \frac{( N - 1 ) ^{2} + 3 N - 1}{N ( N - 1 ) ^{2}} a^{3}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsProbabilistic and Robust Engineering Design · Fatigue and fracture mechanics · Mathematical Approximation and Integration

Full text

How Big Should a Stress Shock Be?

David G. [email protected]

1 Introduction

A common way to determine stress shock sizes is as a multiple, $k$ , of the daily standard deviation, $\sigma$ , of the risk factor. $\sigma$ is typically calibrated on history set of daily rate changes of a suitable length, $N$ . The multiple, $k$ , is sometimes referred to as a tail factor. This can be computed from the inverse of the cumulative distribution function. For example, if the returns were modelled on the normal distribution, then the tail factor for a 1-in-1,000,000 day shock would be 4.75.

Stress models are rarely published as 1) they are proprietary, and 2) there is no regulatory requirement to do so. As a result, to our knowledge there is no literature on the performance or validation of these models. Indeed, stress models cannot be effectively backtested as they are designed to produce shocks large enough so as not to have backtesting exceptions.

The following stress models have been communicated to the author:

•

Several banks assume that returns follow a Student-t distribution with low degrees of freedom (c.f. [5]). For example, 3 degrees of freedom (the lowest degree for which the distribution has a finite variance) produces a tail factor of $103.3$ . The length of the history set varies between banks.

•

One Australian bank uses a tail factor of 7, with an additional add-on for less liquid assets. The history set used for calibration is the VaR history set of 2 years.

•

One Canadian Bank and one Australian bank use the stress model of Brace-Lauer-Rado [3], which is primarily parameterised by a given value of kurtosis to produce a tail factor. The length of the history set varies between each of these banks.

This paper investigates the following question: Given a stress shock of size $k\sigma$ , can we be assured that this shock exceeds any observed shock in the historical data? After all, if a shock was found in the last year or two to be larger than the derived stress shock, then this effectively invalidates the stress model.

The approach of this paper to this problem is to extend the results of Samuelson [8] (see also [4] and the references therein). Samuelson shows that no single value can lie more than $\sqrt{N-1}$ deviations from the mean by examining the endpoint example where one observation is equal to 1, with the remaining $N-1$ observations set to zero. The main result of this paper is to impose conditions on the kurtosis of this type of endpoint distribution (section 2), which is then applied to validate the stress test models presented above, in particular that of Brace-Lauer-Rado [3] (section 3). This is a novel approach, which could be considered the first to analyse stress models with a falsifiable test.

Following the outline of Samuelson [8], we then use this endpoint distribution to examine three extensions of Chebyshev’s Inequality where the kurtosis is known. These are a version of Chebyshev’s Inequality with higher moments of even order, Zelen’s Inequality [11], and Bhattacharyya’s Inequality [2]. Similarly to Samuelson’s results, we give a tighter bound for three known extensions of Chebyshev’s inequality in the finite case.

2 Incorporating Kurtosis into Samuelson’s distribution

In [8], Samuelson shows that no single value can lie more than $\sqrt{N-1}$ deviations from the mean. This is achieved by constructing a finite point distribution where one observation is equal to 1, with the remaining $N-1$ observations set to zero. This distribution is unrealistic as a model for extreme moves of asset prices. It would be equivalent to the market never ever moving except for a large jump one day in 10,000 (or even more), and that our estimate of the volatility is driven entirely by the size of that jump. The reality is that asset prices do move, with varying and clustered volatility, and with more frequent smaller jumps too.

To incorporate these features into Samuelson’s distribution, we will place a restriction on the kurtosis. Focussing on kurtosis is ideal for the validation of a stress model as it does not impose any other assumptions about the distribution, only a general metric of how heavy-tailed the distribution is.

Thus, let us consider distributions which have low kurtosis, plus one outlier. The bi-modal distribution with points at $\pm 1$ , which has a kurtosis of 1, is the obvious candidate. This distribution, for a fixed value of kurtosis, enables the value of $(X_{1}-\bar{X})/\sigma$ to be large.

To rigorously prove that a distribution, which for a fixed value of kurtosis, maximises the value of $(X_{1}-\bar{X})/\sigma$ , is a complex optimisation problem, analogous to that posed in [8]. To give some certainty this distribution cannot be improved upon, it was compared to three other distributions in the Appendix: neither produced a larger value for $(X_{1}-\bar{X})/\sigma$ than the bi-modal. But moreover, $(X_{1}-\bar{X})/\sigma$ did not differ very much for any choice of distribution, giving credibility to our choice of kurtosis being a good restriction to make, as it allows for asset prices to move - and move in different ways - with some impact on the volatility aside from the one large shock.

So to paraphrase [8]: How deviant can one be when the distribution has a restriction of kurtosis imposed? Consider the bi-modal distribution with one extreme point, which we call $\Phi_{N}$ :

[TABLE]

with $a$ , $b$ and $c$ are normalised such that $\mathbb{E}(X)=0$ , $\mathbb{E}(X^{2})=1$ , and $\mathbb{E}(X^{4})=\kappa$ . Thus, $a$ is the maximum possible number of standard deviations away from the mean. We prove the following:

Proposition 1.

Suppose $N\geq 5$ is odd. The value $a=a(N,\kappa)$ is given by

[TABLE]

where

[TABLE]

*Furthermore, $a\sim[N(\kappa-1)]^{1/4}$ for large $N$ .

Proof: Suppose that the mean of the non-normalised distribution is $\frac{a}{y}$ . To ease computation, assume that $N$ is odd. Subtracting $\frac{a}{y}$ from every observation, except $X_{1}$ , gives

[TABLE]

From the second moment:

[TABLE]

Since $\mathbb{E}(X)=0$ and $\mathbb{E}(X^{2})=1$ , the kurtosis $\kappa$ is simply $\mathbb{E}(X^{4})$ :

[TABLE]

Substituting $b^{2}=\frac{N}{N-1}-a^{2}\frac{N}{(N-1)^{2}}$ :

[TABLE]

Rearranging, and multiplying by $\frac{(N-1)^{3}}{(N+1)(N-3)}$ gives the following quadratic in $a^{2}$ :

[TABLE]

where

[TABLE]

The value of $a$ as a function of $\kappa$ and $N$ is given by the quadratic formula, then taking the square root of the positive case. This gives this first statement of the proposition.

For the second statement, observe that letting $N$ become large yields

[TABLE]

for large $N$ . $\phantom{abcde}\square$

Remark: $a$ (the number of standard deviations above the mean) grows with a leading term of $N^{1/2}$ in Samuelson’s distribution, but our distribution grows much slowly in $N^{1/4}$ . The fact that $a$ also grows proportionately to $[N(\kappa-1)]^{1/4}$ for large $N$ will be important regarding Chebyshev’s Inequality in section 4.

Values of $a$ for a given $N$ , and various values of kurtosis, are tabulated below:

Thus, if a bank were to target a AA rating, it would need to consider its survivability over a 833,209 day ( $\approx 3333$ year) history set. For the unconstrained case, a tail factor of approximately 913 standard deviations would be required. For a distribution with the choices of kurtosis presented here, a tail factor of 47 to 60 standard deviations would be required. This result in a stress shock of a large magnitude, we would consider it far more reasonable.

There may be other considerations for the size of the history set considered - not least the fact that a 3,333 year history set of daily prices is impossible to obtain! For example, the Basel Committee on Banking Supervision has recently recommended as part of its “Fundamental review of the trading book” (see [1]) that the Internal Model be calibrated on a history set of 10 years. This choice of history set length results in a tail factor of 11 to 14 standard deviations with our above choices of kurtosis presented here. This is still a reasonably sized stress shock, though obviously not as large.

In fact, the above results can be used to construct a stress shocks, without any distributional assumptions beyond kurtosis. For example, $N=833,209$ is best for targeting a AA rating, $N=10,000$ is suitable for the consideration of once-in-generation shocks, and $N=3,000$ for shocks that occur in a typical business cycle (say).

In the next section a validation of the shocks produced by a stress model that emphasises kurtosis is performed by comparing the shocks given to the results above.

3 Validation of Stress Models

In this section the above result are applied to validate the stress shock that are produced by the three stress models outlined in the Introduction: 1) Stress shocks based on the Student-t distribution, 2) a tail factor of 7, calibrated on a history set of 2 years, and 3) the stress model of Brace-Lauer-Rado [3].

3.1 Student-t Distribution

The Student-t distribution has a finite standard deviation only when the degrees of freedom are above 2, and finite kurtosis only when the degrees of freedom are above 4. Tabulated below are the tail factors for the Student-t distribution with degrees of freedom 3, 4, 5 and 6. Against these are the values of $a(N,\kappa)$ when $\kappa=$ 6 and 3, corresponding to the degrees of freedom cases 5 and 6.

Only when the degree of freedom is 3 can it be assured that the stress shock will exceed the maximum value when the constraint of kurtosis is in place. But as this distribution does not have a finite value of kurtosis, the comparison cannot be made.

3.2 Tail factor of 7

The tail factor of 7 used by one Australian bank would appear to be quite inadequate. According to the values in Table 1, a risk factor with a kurtosis of 7 could produce a value that exceeds the stress shock when calibrated on a two year history set.

3.3 Brace-Lauer-Rado Model

In [3], the authors present a model for determining stress test shocks for a risk factor $X_{t}$ , based on the stochastic volatility models of Scott [9] and Wiggins [10]:

[TABLE]

The model has three parameters: $\rho$ , the skewness, $g$ , which controls the rate of mean-reversion in volatility back to the Normal over time, and $h$ , which is connected to the kurtosis. This model has a number of desirable features for stress tests. For example, it possesses fat tails, but it is relatively straightforward to calculate quantiles for liquidity holding periods.

The instantaneous kurtosis of $Y_{t}$ is given by:

[TABLE]

As values of $h$ lead to unique values of $\kappa$ for a given $g$ , the model can be re-cast in terms of $\rho$ , $g$ , and $\kappa$ . The authors then present the tail factors targeting a AA rating, ie, a survival probability of 0.9997 various values of $g^{-1}$ (ie, the expected time of mean reversion), and $\rho=0.5$ . In fact, the authors present these tail factors for various liquidity holding periods, but here we just consider the 1 day holding period:

Comparing these tail factors to the values of $a=(Max-mean)/StDev$ in Table 1, the tail factors given by the model for the case of kurtosis = 7 and $g^{-1}=6m$ could be violated in a history set of 5,000 observations - that is, 20 years of data (assuming 250 business days to a year). For the remaining cases of kurtosis = 10, 13 and 16, this is 9,000, 12,250, and 15,250, respectively. That is, 36, 49, and 61 years of data, respectively.

It could be said that these values are a little low, given that our earlier results showed that for targeting the survivability of a AA rated institution. However, only in few cases can a daily history set be obtained that is of this length, so the existence of an observation in the history set that exceeds the stress shock will be very rare.

4 Chebyshev’s Inequality with Higher moments

As noted in Samuelson [8], the construction of these distributions is closely connected to Chebyshev’s inequality. In this section we show that our endpoint distribution $\Phi_{N}$ gives a tighter bound than three extensions of Chebyshev’s Inequality where the kurtosis is known - to our knowledge, these are the only three that consider the use of kurtosis. These are a version of Chebyshev’s Inequality with higher moments of even order, Zelen’s Inequality [11], and Bhattacharyya’s Inequality [2]. It is found that Bhattacharyya’s Inequality is not sharp.

Chebyshev’s inequality is given by:

[TABLE]

In a finite world of $N$ observations, setting $t=\sqrt{N}$ gives the endpoint:

[TABLE]

which Samuelson’s example marginally betters to

[TABLE]

To apply our results concerning $\Phi_{N}$ , observe the Bi-modal distribution with one extreme point $a$ (normalised such that $\mathbb{E}[X]=0$ and $\mathbb{E}[X^{2}]=1$ and $\mathbb{E}[X^{4}]=\kappa$ ) yields the inequality

[TABLE]

so if an extended Chebyshev’s inequality gives a threshold greater than $a$ , or a probability less than $\tfrac{1}{N}$ , then our finite point distribution $\Phi_{N}$ has been shown to give a tighter bound. We now consider the three extensions Chebyshev’s inequality mentioned above.

4.1 Higher moments of even order

For any integer $k>0$

[TABLE]

which is a straightforward consequence of Markov’s inequality. See [6], for example. Setting $t=N^{1/4}$ and $k=2$ gives the endpoint:

[TABLE]

Values of $(\kappa N)^{1/4}$ are tabulated below. Note that these are all greater than the values of $a$ given in Table 1 in Section 2, and greater than the limit of $((\kappa-1)N)^{1/4}$ . That is, in a finite world of $N$ observations, $\Phi$ (our distribution conditioned on kurtosis) improves on this instance of Chebyshev’s inequality.

4.2 Zelen’s Inequality

For the case of where the third and fourth moments are known, formulae have been given by Zelen [11]:

[TABLE]

where

[TABLE]

Since the distribution $\Phi$ normalised $\mathbb{E}[X]=0$ and $\sigma^{2}=\mathbb{E}[X^{2}]=1$ , then $\theta_{j}=\mathbb{E}[X^{j}]$ . Furthermore, the kurtosis $\kappa=\mathbb{E}[X^{4}]$ is specified for the distribution. $\mathbb{E}[X^{3}]$ may be expressed in terms of $N$ and $a$ , the latter being a function of $N$ and $\kappa$ :

Proposition 2.

[TABLE]

where $a$ is given by Proposition 1:

[TABLE]

*Furthermore, $\mathbb{E}(X^{3})\sim-3(\kappa-1)^{1/4}N^{-3/4}+(\kappa-1)^{3/4}N^{-1/4}$ for large $N$ .

Proof:

[TABLE]

For $N$ large, substituting $a=[(\kappa-1)N]^{1/4}$ gives:

[TABLE]

Values of $\mathbb{E}(X^{3})$ are tabulated below against $\text{Log}_{10}(N)$ for various values of kurtosis. Interestingly, convergence to the limit of $\kappa-1$ is not monotone, and requires a larger value of $N$ to converge than for $a$ :

Setting $t=a$ , values of Zelen’s probability are given below:

The inverse of the above probabilities are inverted to give “1-in-” values. Thus, it can be seen that our distribution $\Phi_{N}$ gives a slightly tighter bound than Zelen, but only in the finite point case.

4.3 Bhattacharyya’s Inequality

Another extension was given by Bhattacharyya [2]:

[TABLE]

where

[TABLE]

Analogous to Zelen’s Inequality, set $\sigma=1$ and $t=a$ . To satisfy $a^{2}-a\theta_{3}-1>0$ , $N$ must be suitably large, over 10,000 for our distribution $\Phi$ . Our distribution $\Phi_{N}$ gives a much more tighter bound than Bhattacharyya’s inequality than it did for Zelen, but again, only in the finite point case. For example, when $N=10^{7}$ the probabilities are approximately 1-in-10,000.

5 Conclusion

This paper considers how big a stress shock should be, based on how many multiples of the standard deviations are required to guarantee that the shock exceeds any observation within the history set. By imposing the additional constraint of kurtosis, this lends itself to a more realistic description of asset price movements than that given by Samuelson’s inequality [8].

A distribution $\Phi_{N}$ was constructed that maximises the number of standard deviations above the mean that a single point can be for a fixed value of kurtosis. This was then used to validate several stress models, in particular that of Brace-Lauer-Rado [3], which has a primary parameter of kurtosis. Furthermore, the constructed distribution can itself be used to derive stress shocks.

Following the outline of Samuelson’s paper [8], we use our distribution to examine three extensions of Chebyshev’s Inequality where the kurtosis is known. It is found that our results give a tighter bound than the well-known inequalities, particularly that of Bhattacharyya’s Inequality [2] is not, but only in the finite point case.

6 Declarations of Interest

The authors report no conflicts of interest. The authors alone are responsible for the content and writing of the paper.

7 Appendix

In Section 2, this paper uses the distribution of a bi-modal set of points, plus one outlier. The intention is to have a distribution, which for a fixed value of kurtosis, enables the value of $a=(X_{1}-\bar{X})/\sigma$ to be large. As the bi-modal set of points has the lowest possible kurtosis, this should give the largest value of $a$ for any given kurtosis. Indeed, at first glance Samuelson’s example could be a candidate distribution that satisfies these requirements since, but the kurtosis here is $N$ .

To rigorously construct a distribution, which for a fixed value of kurtosis, maximises the value of $(X_{1}-\bar{X})/\sigma$ , is a complex optimisation problem:

[TABLE]

To give some certainty this distribution cannot be improved upon, three other distributions were considered to test if the above construction could be bettered:

$\bullet$ A tri-modal distribution, with an equal number of points are $\{-1,0,1\}$ (kurtosis of 1.5),

$\bullet$ The distribution with two-thirds of the points at 0, and the remaining third at $1$ (kurtosis of 1.5),

$\bullet$ The Uniform distribution between $-1$ and $1$ (kurtosis of 1.8).

Since the statistic $(X_{1}-\bar{X})/\sigma$ and the kurtosis are invariant under dilation and translation, the distributions are then re-scaled, and the outlier found $X_{1}>1$ using a simple search routine such that the kurtosis of the distribution is as desired.

Although each produced a value for $(X_{1}-\bar{X})/\sigma$ close to the bi-modal, it was less in all cases:

Furthermore, as remarked in Section 2, $(X_{1}-\bar{X})/\sigma$ does not differ very much for any choice of distribution. This is desirable, and gives credibility to our choice of kurtosis being a good restriction to make, as it allows for asset prices to move - and move in different ways - with some impact on the volatility, not just from the one large shock.

Bibliography11

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Basel Committee on Banking Supervision (2018) Revisions to the minimum capital requirements for market risk, https://www.bis.org/bcbs/publ/d 436.htm
2[2] Bhattacharyya, B. B. (1987). One-sided chebyshev inequality when the first four moments are known, Communications in Statistics – Theory and Methods. 16 (9):2789 – – – 2791.
3[3] Brace, A., Lauer, M., and Rado, M. (2006) A Stylised Model for Extreme Shocks: Four Moments of the Apocalypse, UTS Research Paper, https://www.researchgate.net/publication/23697087_A_Stylised_Model_for_Extreme_Shocks_Four_Moments_of_the_Apocalypse
4[4] Jensen, S. T. (1999) The Laguerre – – – Samuelson Inequality with Extensions and Applications in Statistics and Matrix Theory, M Sc Thesis, Department of Mathematics and Statistics, Mc Gill University.
5[5] Maher, D. (2011) On the use of t-copulas for economic capital calculations, Journal of Risk Model Validation, 5 (3), 21-36.
6[6] Mitzenmacher, M. and Upfal, E (2005). Probability and Computing: Randomized Algorithms and Probabilistic Analysis. Cambridge Univ. Press.
7[7] Platen, E., and Renata, R. (2008) Empirical Evidence on Student − - t Log-Returns of Diversified World Stock Indices, Journal of Statistical Theory and Practice, 2 (2), 233-251.
8[8] Samuelson, P. A. (1968) How deviant can you be?, J. Amer. Statist. Assoc., 63 , 1522-1525.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

How Big Should a Stress Shock Be?

1 Introduction

2 Incorporating Kurtosis into Samuelson’s distribution

Proposition 1**.**

3 Validation of Stress Models

3.1 Student-t Distribution

3.2 Tail factor of 7

3.3 Brace-Lauer-Rado Model

4 Chebyshev’s Inequality with Higher moments

4.1 Higher moments of even order

4.2 Zelen’s Inequality

Proposition 2**.**

4.3 Bhattacharyya’s Inequality

5 Conclusion

6 Declarations of Interest

7 Appendix

Proposition 1.

Proposition 2.