calculation worst-case Value-at-Risk prediction using empirical data   under model uncertainty

Wentao Hu

arXiv:1908.00982·q-fin.RM·August 6, 2019

calculation worst-case Value-at-Risk prediction using empirical data under model uncertainty

Wentao Hu

PDF

Open Access

TL;DR

This paper introduces a practical approach to estimate worst-case Value-at-Risk under model uncertainty using empirical data, combining change point detection and EM algorithm for financial risk analysis.

Contribution

It proposes a finite mixture model with change point detection and EM algorithm to empirically estimate worst-case VaR considering model ambiguity.

Findings

01

WVaR and BVaR differ significantly across markets.

02

The method effectively captures model uncertainty in risk estimation.

03

Empirical results demonstrate the approach's practical applicability.

Abstract

Quantification of risk positions under model uncertainty is of crucial importance from both viewpoints of external regulation and internal management. The concept of model uncertainty, sometimes also referred to as model ambiguity. Although we know the family of models, we cannot precisely decide which one to use. Given the set $P$ , the value of the risk measure $ρ$ varies in a range over the set of all possible models. The largest value in such a range is referred to as a worst-case value, and the corresponding model is called a worst scenario. Value-at-Risk(VaR) has become a very popular risk-measurement tool since it was first proposed. Naturally, WVaR(worst-case Value-at-Risk) attracts the attention of many researchers. Although many literatures investigated WVaR, the implications for empirical data analysis remain rare. In this paper, we proposed a special model…

Equations114

V a R_{α} (X) = in f {x \in R : P [X ⩽ x] > α} = F_{X}^{- 1} (α) .

V a R_{α} (X) = in f {x \in R : P [X ⩽ x] > α} = F_{X}^{- 1} (α) .

sup ρ (X), X \in P,

sup ρ (X), X \in P,

V a R_{α} (X) = - in f {x \in R : P [X ⩽ x] > α}

V a R_{α} (X) = - in f {x \in R : P [X ⩽ x] > α}

W V a R_{α} (X) = - in f {x \in R : P \in P max P [X ⩽ x] ⩾ α} .

W V a R_{α} (X) = - in f {x \in R : P \in P max P [X ⩽ x] ⩾ α} .

r_{t} = l o g (S_{t}) - l o g (S_{t - 1}), \forall S_{t} \in S, t = 1, \dots, n

r_{t} = l o g (S_{t}) - l o g (S_{t - 1}), \forall S_{t} \in S, t = 1, \dots, n

R_{1} = {r_{1}, \dots, r_{n_{1}}},

R_{1} = {r_{1}, \dots, r_{n_{1}}},

\dots

R_{t} = {r_{n_{1} + \dots + n_{t - 1} + 1}, \dots, r_{n_{1} + \dots + n_{t}}},

\dots

R_{N} = {r_{n_{1} + \dots + n_{N - 1} + 1}, \dots, r_{n_{1} + \dots + n_{N}}},

\forall r_{i}^{(1)} \in R_{1}, r_{i}^{(1)} \sim f^{1} (r) = j = 1 \sum K_{2} β_{j}^{1} p_{j} (r ∣ θ_{j}),

\forall r_{i}^{(1)} \in R_{1}, r_{i}^{(1)} \sim f^{1} (r) = j = 1 \sum K_{2} β_{j}^{1} p_{j} (r ∣ θ_{j}),

\dots

\forall r_{i}^{(t)} \in R_{t}, r_{i}^{(t)} \sim f^{t} (r) = j = 1 \sum K_{2} β_{j}^{t} p_{j} (r ∣ θ_{j}),

\dots

\forall r_{i}^{(N)} \in R_{N}, r_{i}^{(N)} \sim f^{N} (r) = j = 1 \sum K_{2} β_{j}^{N} p_{j} (r ∣ θ_{j}),

\forall r_{i}^{(t)} \in R_{t}, r_{i}^{(t)} \sim f^{t} (r) = j = 1 \sum K_{2} β_{j}^{t} p_{j} (r ∣ θ_{j}),

\forall r_{i}^{(t)} \in R_{t}, r_{i}^{(t)} \sim f^{t} (r) = j = 1 \sum K_{2} β_{j}^{t} p_{j} (r ∣ θ_{j}),

\forall r_{i}^{(s)} \in R_{s}, r_{i}^{(s)} \sim f^{s} (r) = j = 1 \sum K_{2} β_{j}^{s} p_{j} (r ∣ θ_{j}),

W V a R_{α} (X) = - in f {x \in R : max P_{p_{j}} [X ⩽ x] ⩾ α j = 1, 2, \dots, K_{2}} .

W V a R_{α} (X) = - in f {x \in R : max P_{p_{j}} [X ⩽ x] ⩾ α j = 1, 2, \dots, K_{2}} .

p_{1} (r ∣ θ_{1}) = i = 1 \sum K_{1, 1} α_{1, i} N (r ∣ μ_{1, i}, σ_{1, i}^{2}),

p_{1} (r ∣ θ_{1}) = i = 1 \sum K_{1, 1} α_{1, i} N (r ∣ μ_{1, i}, σ_{1, i}^{2}),

\dots

p_{j} (r ∣ θ_{j}) = i = 1 \sum K_{1, j} α_{j, i} N (r ∣ μ_{j, i}, σ_{j, i}^{2}),

\dots

p_{K_{2}} (r ∣ θ_{K_{2}}) = i = 1 \sum K_{1, K_{2}} α_{K_{2}, i} N (r ∣ μ_{K_{2}, i}, σ_{K_{2}, i}^{2}) .

\forall r_{s}^{(1)} \in R_{1}, r_{s}^{(1)} \sim f^{1} (r) = j = 1 \sum K_{2} β_{j}^{1} i = 1 \sum K_{1, j} α_{j, i} N (r ∣ μ_{j, i}, σ_{j, i}^{2}),

\forall r_{s}^{(1)} \in R_{1}, r_{s}^{(1)} \sim f^{1} (r) = j = 1 \sum K_{2} β_{j}^{1} i = 1 \sum K_{1, j} α_{j, i} N (r ∣ μ_{j, i}, σ_{j, i}^{2}),

\dots

\forall r_{s}^{(t)} \in R_{t}, r_{s}^{(t)} \sim f^{t} (r) = j = 1 \sum K_{2} β_{j}^{t} i = 1 \sum K_{1, j} α_{j, i} N (r ∣ μ_{j, i}, σ_{j, i}^{2}),

\dots

\forall r_{s}^{(N)} \in R_{N}, r_{s}^{(N)} \sim f^{N} (r) = j = 1 \sum K_{2} β_{j}^{N} i = 1 \sum K_{1, j} α_{j, i} N (r ∣ μ_{j, i}, σ_{j, i}^{2}),

\forall r_{s}^{(t)} \in R_{t}, r_{s}^{(t)} \sim f^{t} (r)

\forall r_{s}^{(t)} \in R_{t}, r_{s}^{(t)} \sim f^{t} (r)

=

U V a R_{α} (X) = - in f {x \in R : P \in P max P [X ⩽ x] ⩾ α},

U V a R_{α} (X) = - in f {x \in R : P \in P max P [X ⩽ x] ⩾ α},

j = 1 \sum K_{2} β_{j}^{t} i = 1 \sum K_{1, j} α_{j, i} N (r ∣ μ_{j, i}, σ_{j, i}^{2}),

j = 1 \sum K_{2} β_{j}^{t} i = 1 \sum K_{1, j} α_{j, i} N (r ∣ μ_{j, i}, σ_{j, i}^{2}),

p_{\hat{j}} (r ∣ θ_{\hat{j}}) = i = 1 \sum K_{1, \hat{j}} α_{\hat{j}, i} N (r ∣ μ_{\hat{j}, i}, σ_{\hat{j}, i}^{2}),

p_{\hat{j}} (r ∣ θ_{\hat{j}}) = i = 1 \sum K_{1, \hat{j}} α_{\hat{j}, i} N (r ∣ μ_{\hat{j}, i}, σ_{\hat{j}, i}^{2}),

U V a R_{α, p_{\hat{j}}} (X) = - in f {x \in R : P_{p_{\hat{j}}} [X ⩽ x] > α} .

U V a R_{α, p_{\hat{j}}} (X) = - in f {x \in R : P_{p_{\hat{j}}} [X ⩽ x] > α} .

j = 1 \sum K_{2} i = 1 \sum K_{1, j} γ_{j, i}^{t} N (r ∣ μ_{j, i}, σ_{j, i}^{2}), γ_{j, i}^{t} = β_{j}^{t} \cdot α_{j, i},

j = 1 \sum K_{2} i = 1 \sum K_{1, j} γ_{j, i}^{t} N (r ∣ μ_{j, i}, σ_{j, i}^{2}), γ_{j, i}^{t} = β_{j}^{t} \cdot α_{j, i},

N (r ∣ \overset{μ}{ˉ}, \overset{σ}{ˉ}^{2}) \in {N (r ∣ μ_{\hat{j}, i}, σ_{\hat{j}, i}^{2}), i = 1, \dots, K_{1, \hat{j}}},

N (r ∣ \overset{μ}{ˉ}, \overset{σ}{ˉ}^{2}) \in {N (r ∣ μ_{\hat{j}, i}, σ_{\hat{j}, i}^{2}), i = 1, \dots, K_{1, \hat{j}}},

U V a R_{α, p_{\hat{j}}} (X) ⩽ U V a R_{α, N (r ∣ \overset{ˉ}{θ})} (X) = - in f {x \in R : P_{N (r ∣ \overset{ˉ}{θ})} [X ⩽ x] > α} .

U V a R_{α, p_{\hat{j}}} (X) ⩽ U V a R_{α, N (r ∣ \overset{ˉ}{θ})} (X) = - in f {x \in R : P_{N (r ∣ \overset{ˉ}{θ})} [X ⩽ x] > α} .

P [N (r ∣ \overset{μ}{ˉ}, \overset{σ}{ˉ}^{2}) \in {N (r ∣ μ_{\hat{j}, i}, σ_{\hat{j}, i}^{2}), i = 1, \dots, K_{1, \hat{j}}}] = 0.

P [N (r ∣ \overset{μ}{ˉ}, \overset{σ}{ˉ}^{2}) \in {N (r ∣ μ_{\hat{j}, i}, σ_{\hat{j}, i}^{2}), i = 1, \dots, K_{1, \hat{j}}}] = 0.

k (y_{s}, y_{t}) = ⟨ ϕ (y_{s}) ∣ ϕ (y_{t}) ⟩_{H},

k (y_{s}, y_{t}) = ⟨ ϕ (y_{s}) ∣ ϕ (y_{t}) ⟩_{H},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFinancial Risk and Volatility Modeling · Risk and Portfolio Optimization · Statistical Methods and Inference

Full text

Calculating worst-case Value-at-Risk prediction using empirical data under model uncertainty

Wentao Hu Institute for Financial Studies and School of Mathematics, Shandong University, Jinan 250100, China

(July 31, 2019)

Abstract

Quantification of risky positions under model uncertainty is of crucial importance from both viewpoints of external regulation and internal management. The concept of model uncertainty, sometimes also referred to as model ambiguity. Although we know the the family of models, we cannot precisely decide which one to use. Given the set $\mathcal{P}$ , the value of the risk measure $\rho$ varies in a range over the set of all possible models. The largest value in such a range is referred to as a worst-case value, and the corresponding model is called a worst scenario. Value-at-Risk (VaR) has become a very popular risk-measurement tool since it was first proposed. Naturally, WVaR(worst-case Value-at-Risk) attracts the attention of many researchers. Although many literatures investigated WVaR, the implications for empirical data analysis remain rare. In this paper, we proposed a special model uncertainty market model to simply the $\mathcal{P}$ to a set contain finite number of probability distributions. The model has the structure of the two-layer mixed distribution model. We used change point detection method to divide the returns series and then used EM algorithm to estimate the parameters. Finally, we calculated VaR, WVaR(worst-case Value-at-Risk) and BVaR(best-case Value-at-Risk) for four financial markets and then analyzed their different performance.

Keywords: Value-at-Risk; worst-case; model uncertainty; empirical data; mixed distribution model

1 Introduction

With the rapid development of the financial industry and the frequent emergence of financial crisis, risk management has become an important issue faced by company managers and government regulators.Value-at-Risk (VaR) has become a very popular risk-measurement tool since it was first proposed by J.P. Morgan. Let $X$ be the loss,

[TABLE]

It has achieved the high status of being written into industry regulations. Simplicity is an important advantage. First of all, VaR is easy to understand. According to Duffie[1], VaR can be defined as For a given time horizon $T$ and a confidence level $\alpha$ , VaR is the loss in market value that can only be exceeded with a probability of at most $1-\alpha$ . That is “We are $\alpha\%$ certain that we will not lose more than $V$ dollars in the next $N$ days.” VaR is simply the $\alpha$ percentile of the loss distribution. Using a single number to describe complex financial risks can make the measurement of risk simple and intuitive. In addition, Kupiec[2], Christoffersen[3] and Hull[4] indicated that VaR is easy to calculate and get back-test. However, Lo[5] and Leskow[6] claimed that VaR suffers from being unstable and difficult to work with numerically when losses are not “normally” distributed–which in fact is often the case, because loss distributions tend to exhibit ”fat-tails”. Thus, Zangari[7] and Venkataraman[8] pointed out that VaR under normal assumption will lead to serious underestimation of the risk of extreme losses.

Many researchers made efforts to solve this problem. Huschens[9], Glasserman[10] and Lin[11] used multivariate t-distribution to amend the original assumption. Li[12], Wilhelmsson[13], Su[14] and Gabrielsen[15] used statistics such as volatility, skewness and kurtosis to capture the extreme tail. Haas[16] and Gebizlioglu[17] used Weibull distribution. Zangari[7], Venkataraman[8] and Hull[18] estimated VaR with Gaussian mixture. Zhang[19] uses an EM algorithm based on KS test to determine the number of component distributions. Billio[20] and Kawata[21] used switching volatility model to describe the independence of data. For an up-to-date account of relative literature, some recent review papers Kuester[22], Jorion[23], Abad[24] and Zhang[25] can be referred.

However, quantification of risky positions held by a financial institution under model uncertainty is of crucial importance from both viewpoints of external regulation and internal management. Given some sources of uncertainty a Bayesian methodology, O’Hagan[26], Bernardo[27] and Alexander[28] assumed that VaR is described in terms of a set of unknown parameters. Bayesian estimates may be derived from posterior parameter densities and posterior model probabilities which are obtained from the prior densities via Bayes theorem, assuming that both the model and its parameters are uncertain. The problem has also been studied from a non-Bayesian point of view, such as Modarres[29], Giorgi[30] and Figlewski[31]. As for VaR model, Alexander[28] shared ideas with the Bayesian approach. Jorion[32] and Talay[33] investigated sampling error. However, we remark that there is no consensus on the sources of model risk. The concept of model uncertainty, sometimes also referred to as model ambiguity. It was argued that data follow not a single distribution but rather a family of distributions. Although we know the the family of models, we cannot precisely decide which one to use. Cont[34] quantified the model risk of a complex product by the range of prices obtained under all possible valuation models. Many literatures studied the worst-case value of risk measures with given partial information. Very often, the problem of interest is of the following type: to find

[TABLE]

where $\rho$ is a risk measure, and the set $\mathcal{P}$ is a class of random variables with some given partial distributional information. Given the set $\mathcal{P}$ , the value of the risk measure $\rho$ varies in a range over the set of all possible models. The largest value in such a range is referred to as a worst-case value, and the corresponding model is called a worst scenario. An early source is Royden[35]. Kass[36], Schepper[37], Popescu[38], He[39] and many other literatures calculated the bound of $P(X\leqslant x)$ under partial information. Peng[40, 41, 42, 43] proposed sublinear expectation. The central concept of sublinear expectation theory is a family of distributions inherent in the data series. Artzner[44] proposed the concept of coherent risk measures, which can be viewed as a special instance of sublinear expectation. Although many literatures investigated worst-case value of risk measures, the implications for real data analysis remain rare. Peng[45] proposed a G-VaR based on sublinear expectation, which is specifically for the condition of variance uncertainty. Besides, we did not find results of calculating the worst-case Value-at-Risk from the empirical data.

In this paper, we proposed a special model uncertainty market model to simply the $\mathcal{P}$ to a set contain finite number of probability distributions. We divide $\mathbb{R}$ into $N$ segments, and in every subset $\mathbb{R}_{t}$ , data is i.i.d.. The model uncertainty market model used the structure of the two-layer mixed distribution model. The first-layer components are Gaussian mixture distributions. Different component distributions correspond to different market factors. The weights of components represent the probability of the occurrence of market factors. We don’t care about and unable to model the probability of the occurrence of components. Therefore, when we predict the distribution of future data, we can only know the whole of the component distributions (i.e. market factors) from which the data can be generated, but we cannot accurately know the weights of the components (i.e. the exact probability of any market factor occurring). The second-layer components have no financial meaning and they are just parts of a numerical stimulate method. The remainder of the paper is organized as follows. Section 2 briefly reviews the WVaR(worst-case Value-at-Risk) of return series. In Section 3, we propose a special market model to describe model uncertainty. Section 4 reports the method of returns series segmentation. Section 5 present the method of parameters estimation. In Section 6, we show the empirical results of the WVaR for four financial markets. Finally, Section 7, concludes.

2 WVaR (worst-case Value-at-Risk)

First we consider the certain condition. Let $X$ be the returns of financial assets, it is a random variable in probability space $(\Omega,\mathcal{F},P)$ .

Definition 2.1 (Value at risk)

Given confident level $\alpha\in[0,1]$ , the value at risk $VaR_{\alpha}$ at level $\alpha$ of $X$ with distribution $P$ is

[TABLE]

In uncertain situation, we have a set of finitely additive probabilities $\mathcal{P}$ and cannot decide precisely which probability $X$ should obey. Therefore, we have WVaR(worst-case Value-at-Risk) under uncertain conditions.

Definition 2.2

(WVaR) Give a real number $\alpha\in[0,1]$ , the $WVaR_{\alpha}$ at level $\alpha$ of $X$ with a set of finitely additive probabilities $\mathcal{P}$ is

[TABLE]

$WVaR_{\alpha}(X)$ only care about the “worst” distribution, that is, the distribution of the greatest loss. This “worst case” corresponds to the “worst” scenario for financial assets.

In the definition of $WVaR$ , the meaning of $\alpha$ has changed compared with $VaR$ . Under certain conditions, $\alpha$ is a confidence level, and $VaR$ is a quantile under $\alpha$ . Under uncertain conditions, we cannot decide the probability precisely, therefore, we cannot find the quantile precisely. Under this condition, $\alpha$ becomes a ‘conditional’ confidence level. that is, in the case of a ‘worst’ financial scenario, the probability that the value of asset $X$ is not less than $WVaR_{\alpha}(X)$ is $\alpha$ . This change has led to a change in the criteria for evaluating risk measures. As for $VaR$ , we can test the accuracy of method by observing whether the ratio of return to break through $VaR_{\alpha}(X)$ equals $\alpha$ . For $WVaR$ , this method is obviously no longer applicable.

Moreover, although many literatures, such as Kass[36], Schepper[37], Popescu[38] and He[39] studied the properties of $WVaR$ , there are rare results about the calculation. One important reason is, in uncertain conditions, it’s difficult to estimate the parameters of the distributions in the set of finitely additive probabilities $\mathcal{P}$ . In the following paper, we proposed a special market model to simply the $\mathcal{P}$ to a set contain finite number of probability distributions.

3 Model uncertainty market model

Let $S_{t}\in\mathcal{R}$ be the financial asset prices series and $\mathbb{S}=\{S_{0},S_{1},\cdots,S_{n}\}$ . Define returns of the assets as

[TABLE]

we have financial asset returns series $\mathbb{R}=\{r_{1},r_{2},\cdots,r_{n}\}$ . Suppose that the returns series are independent data and have identical distribution in a short period. We can divide $\mathbb{R}$ into $N$ segments, every segment has $n_{t},\ t=1,\cdots,N$ data:

[TABLE]

where $\sum_{i=1}^{N}n_{i}=N$ . Given a subset $\mathbb{R}_{t},\ \forall r_{i}^{(t)}\in\mathbb{R}_{t}$ are independent and identically distributed(i.i.d.). For every subset $\mathbb{R}_{t},\ t=1,\cdots,N$ , we assume that returns are generated from mixture distributions:

[TABLE]

where $p_{j}(x|\theta_{j})$ is component distribution, $K_{2}$ is number of component distributions, $\theta_{j}$ is the parameters of $j^{th}$ component distribution.

Intuitively, the return $r_{i}$ can be drawn from one of $K_{2}$ component distributions $p_{j}(x|\theta_{j})$ with the probability $\beta^{t}$ . This means that the distribution of return $r_{i}$ is the result of the comprehensive effect of different market factors, where $K_{2}$ component distributions $p_{j}(x|\theta_{j})$ represents $K_{2}$ market factors and $\beta^{t}_{j}$ represents the probability of the occurrence of $j^{th}$ market factor. For these financial meanings, our method shares ideas with Zangari[7], Venkataraman[8], Hull[18], Zhang[19], Billio[20] and Kawata[21]. Within one subset $\mathbb{R}_{t}$ , returns are identically distributed. This setting corresponds to a reasonable assumption that market environment remains static in a short or relative long period.

However, with time goes by, the change of market environment causes the probability of the occurrence of market factors changes. Such kind of change, we assume that, is difficult or even impossible to model because of the complexity of market. On the contrary, the number and type of market factors will not change. That is to say that the change in the distribution of returns is the result of an interaction and trade-off between different market factors, rather than a dramatic change in the number and type of market factors. Therefore, between two different periods $\mathbb{R}_{t},\ \mathbb{R}_{s}$ , the distributions of returns $r_{i}$ changes but the component distributions are unchanged:

[TABLE]

Although Zangari[7], Venkataraman[8], Hull[18] and Zhang[19] used mixture distribution model, the probability of the occurrence of components is unchanged. Therefore they are certainty probability models. Billio[20] and Kawata[21] modeled return series using HMM(Hidden Markov Model). The HMM model gives the components a certain probability of occurrence by transition probability, although the probability of occurrence is changed at any time. Hence they are also certainty probability models. Different from the above methods, we don’t care about and unable to model the probability of the occurrence of components. Therefore, when we predict the distribution of future data, we can only know the whole of the component distributions (i.e. market factors) from which the data can be generated, but we cannot accurately know the weights of the components (i.e. the exact probability of any market factor occurring). That is to say, there is model uncertainty.

Therefore, the WVaR under uncertain conditions is:

[TABLE]

Different from the GMM(Gaussian mixture model) used by Zangari[7], Venkataraman[8], Hull[18] and Zhang[19], In this paper, the component distributions $p_{j}(r|\theta_{j})$ are also mixture distributions:

[TABLE]

Therefore, $r_{s}$ are generated from a two-layer Gaussian mixture model:

[TABLE]

Admittedly the mixture model with components as Gaussian mixture distributions can be rewritten as a general Gaussian mixture model, for example:

[TABLE]

but the first-layer components $p_{j}(r|\theta_{j})$ and the second-layer components $N(r|\mu_{j,i},\sigma_{j,i}^{2})$ have distinct different meanings: $p_{j}(r|\theta_{j})$ represents $j^{th}$ market factor. But $N(r|\mu_{j,i},\sigma_{j,i}^{2})$ not have any financial meaning, it is just a part of a numerical stimulate method. This difference is particularly important in calculating UVaR. Reviewing

[TABLE]

what we want to do is to find out the worst distribution corresponded to the worst scenario. Using a one-layer Gaussian mixture model with more components rather than a two-layer Gaussian mixture model will make the UVaR overestimated. For example, for a two-layer Gaussian mixture model:

[TABLE]

suppose that the worst distribution is:

[TABLE]

then, UVaR is:

[TABLE]

But for a one-layer Gaussian mixture model with more components

[TABLE]

if the “worst” distribution:

[TABLE]

then,

[TABLE]

Moreover, for different returns data, we obviously can’t guarantee that:

[TABLE]

Therefore, using two-layer Gaussian mixture model is reasonable.

4 Returns series segmentation

As we mentioned in Section 3, we need to divide $\mathbb{R}$ into $N$ segments. Within one subset $\mathbb{R}_{t}$ , $\forall r_{i}^{(t)}\in\mathbb{R}_{t}$ are independent and identically distributed(i.i.d.). In this paper, we use Kernel-based detection method to divide $\mathbb{R}$ . A kernel-based method has been proposed by Harchaoui[46] to perform change point detection in a non-parametric setting. Truong[47] gave a good review about the relative methods. As described by Truong[47]: to that end, the original series $\mathbb{y}=\{y_{1},y_{2},\cdots,y_{T}\}$ is mapped onto a reproducing Hilbert space (rkhs) $\mathcal{H}$ associated with a user-defined kernel function $k(\cdot,\cdot):\mathbb{R}^{d}\times\mathbb{R}^{d}\to\mathbb{R}$ . The mapping function $\phi:\mathbb{R}\to\mathcal{H}$ onto this rkhs is implicitly defined by $\phi(y_{t})=k(y_{t},\cdot)\in\mathcal{H}$ , resulting in the following inner-product and norm:

[TABLE]

for any samples $y_{s},y_{t}\in\mathbb{R}^{d}$ . The associated cost function, denoted $c_{kernel}$ , is defined as follows.

[TABLE]

where $y_{a\cdots b}=\{y_{t}\}_{t=a+1}^{b}$ , $\bar{\mu}_{a\cdots b}\in\mathcal{H}$ is the empirical mean of the series $\{\phi(y_{t})\}_{t=a+1}^{b}$ . Indeed, after simple algebraic manipulations, $c_{kernel}(y_{a\cdots b})$ can be rewritten as follows:

[TABLE]

The cost function $c_{kernel}$ can be combined with any kernel to accommodate various types of data. In this paper, we use the Gaussian kernel:

[TABLE]

with $x,y\in\mathbb{R}^{d}$ and $\gamma>0$ is the so-called bandwidth parameter. The associated cost function, denoted $c_{rbf}$ , is defined as follows:

[TABLE]

where $\gamma>0$ is the so-called bandwidth parameter.

As described by Truong[47]: Denote $\mathcal{T}=\{t_{1},t_{2},\cdots\}\subset\{1,\cdots,T\}$ and

[TABLE]

where $c(\cdot)$ is a cost function. The change point detection problem with an unknown number of change points consists in solving the following discrete optimization problem

[TABLE]

where $pen(\mathcal{T})$ is an appropriate measure of the complexity of a segmentation $\mathcal{T}$ . Truong[47] also introduce several penalty functions. In this paper, we use linear penalties which are linear functions of the number of change points, meaning that:

[TABLE]

where $\beta>0$ is a smoothing parameter, and $|\mathcal{T}|$ is cardinal of $\mathcal{T}$ . In this paper, we used $ruptures$ , which is a Python scientific library provided by Truong[48], to divide $\mathbb{R}$ .

5 Model parameters estimating

In this paper, we use EM algorithm to estimate the parameters. For simplicity, let $K_{1,1}=K_{1,2}=\cdots=K_{1,K_{2}}=K_{1}$ . Denote the latent variable

[TABLE]

where $j=1,2,\cdots,K_{2}$ and $i=1,2,\cdots,K_{1}$ . Denote

[TABLE]

where $t=1,\cdots,N$ . Because the observable variable is $\{r_{s}\}_{s=1}^{n}$ , the complete variable is

[TABLE]

Then, given a subset $\mathbb{R}_{t}$ , the likelihood function of complete variable is

[TABLE]

Denote

[TABLE]

then we have

[TABLE]

Therefore,

[TABLE]

For the whole series $\mathbb{R}$ , the likelihood function of complete variable is

[TABLE]

The log-likelihood function is

[TABLE]

E Step:

Denote $\theta^{(i)}$ is the parameter obtained in the $i^{th}$ iteration, and

[TABLE]

Then

[TABLE]

We can find that, the parameter $\hat{{\eta}}_{s,j,i}$ should contain $t$ . So we have $\hat{{\eta}}_{s,j,i}=\hat{{\eta}}_{s,j,i}^{(t)}$ . Denote

[TABLE]

we have

[TABLE]

M Step:

We need to find $\theta^{(i+1)}$ satisfies

[TABLE]

We have the iterative formulas

[TABLE]

6 Empirical computation

We consider two financial markets i.e. Chinese(000001.SH from 1999 to 2018) and American(SPX.GI from 1999 to 2018) securities market. First, we use Kernel-based detection method to divide losses series. The smooth parameter $\beta=2.5$ . Then we use EM algorithm estimate the parameters. Let $K_{2}=5,K_{1}=3$ . Finally we calculate WVaR and VaR with tolerance level $\alpha=95\%$ . To compare the differences between the two markets, we also calculate the BVaR (best-case Value-at-Risk) :

Definition 6.1

(BVaR) Give a real number $\alpha\in[0,1]$ , the $BVaR_{\alpha}$ at level $\alpha$ of $X$ with a set of finitely additive probabilities $\mathcal{P}$ is

[TABLE]

Obviously, BVaR is Value-at-Risk for the best scenario.

Figure 1 shows that, for Chinese(000001.SH) financial market, we find $17$ change points. $WVaR_{SH}=5.89\%$ , $VaR_{SH}=2.59\%$ and $BVaR_{SH}=0.44\%$ . Figure 2 shows that, for American(SPX.GI) financial market, we find $17$ change points. $WVaR_{SP}=6.18\%$ , $VaR_{SP}=1.97\%$ and $BVaR_{SP}=0.42\%$ .

Firstly, for the results of losses series segmentation, although two markets both have $17$ change points, the indexes of these points are significant different. For Chinese markets, the occurrence of change points is more uniform. This means that no market condition will last for a long time. But in American markets something is different. The occurrence of change points is more concentrated, and concentrated in the period of high volatility. This means that some “good” or “moderate” market conditions will last for a relative long time, but in the period of high volatility market condition shifts frequently.

Secondly, from the perspective of risk measures, $WVaR_{SP}>WVaR_{SH}$ which means that American markets has more sever worst-case than Chinese market. American markets and Chinese market have similar best-case because $BVaR_{SH}=0.44\%$ is very near to $BVaR_{SP}=0.42\%$ . The interesting thing is, although American markets has more sever worst-case, Chinese market has higher VaR value: $VaR_{SH}=2.59\%>VaR_{SP}=1.97\%$ . This fact indicates that, firstly, i.i.d. hypothesis is indeed inappropriate when measuring tail risk. Secondly, for the set of the distributions $\mathcal{P}$ generated by different market factors, the elements of $\mathcal{P}$ of Chinese markets are more concentrated and similar. Yet the elements of $\mathcal{P}$ of American markets tend to perform greater differences.In general, the two markets present different risk characteristics.

In addition, we present the results of Japanese markets (N225.GI from 1999 to 2018) Figure 3 and Germany markets (GDAXI.GI from 1999 to 2018) Figure 4. But the analysis of the relevant results is not repeated.

7 Conclusion

With the rapid development of the financial industry and the frequent emergence of financial crisis, risk management has become an important issue faced by company managers and government regulators. Nowadays, quantification of risky positions held by a financial institution under model uncertainty is of crucial importance from both viewpoints of external regulation and internal management. However, there is no consensus on the sources of model risk. The concept of model uncertainty, sometimes also referred to as model ambiguity. It was argued that data follow not a single distribution but rather a family of distributions. Although we know the the family of models, we cannot precisely decide which one to use. Given the set $\mathcal{P}$ , the value of the risk measure $\rho$ varies in a range over the set of all possible models. The largest value in such a range is referred to as a worst-case value, and the corresponding model is called a worst scenario. Although many literatures investigated worst-case Value-at-Risk measures, the implications for empirical data analysis remain rare.

In this paper, we proposed a special model uncertainty market model to simply the $\mathcal{P}$ to a set contain finite number of probability distributions. Suppose that the returns series are independent data and have identical distribution in a short period. We can divide $\mathbb{R}$ into $N$ segments, and in every subset $\mathbb{R}_{t}$ , data is i.i.d.. The model uncertainty market model used the structure of the two-layer mixed distribution model. The first-layer components are Gaussian mixture distributions. Different component distributions correspond to different market factors. The weights of components represent the probability of the occurrence of market factors. We don’t care about and unable to model the probability of the occurrence of components. Therefore, when we predict the distribution of future data, we can only know the whole of the component distributions (i.e. market factors) from which the data can be generated, but we cannot accurately know the weights of the components (i.e. the exact probability of any market factor occurring). That is to say, there is model uncertainty. The second-layer components are Gaussian distributions. Actually, the second-layer components have no financial meaning. They are just parts of a numerical stimulate method. For empirical data, firstly we used change point detection method to divide the returns series and then used EM algorithm to estimate the parameters. We calculated VaR, WVaR(worst-case Value-at-Risk) and BVaR(best-case Value-at-Risk) for four financial markets and then analyzed their different performance.

Bibliography48

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Duffie D and Pan J. An overview of value at risk. Journal of Derivatives , 4(3):7–49, 1997.
2[2] Kupiec P. Techniques for verifying the accuracy of risk measurement models. Journal of Derivatives , 2(2):73–84, 1995.
3[3] Christoffersen P-F. Evaluating interval forecasts. International Economic Review , 39(4):841–862, 1998.
4[4] Hull J. Risk Management and Financial Institutions (3rd Edition) . John Wiley & Sons,, 2012.
5[5] Andrew W Lo and A. Craig Mackinlay. Stock market prices do not follow random walks: Evidence from a simple specification test. Review of Financial Studies , 1(1):41–66, 1988.
6[6] Leskow J. The impact of stationarity assessment on studies of volatility and value-at-risk. Mathematical & Computer Modelling , 34(9):1213–1222, 2001.
7[7] Zangari P. An improved methodology for measuring var. Riskmetrics Monitor , 1996.
8[8] Venkataraman S. Value at risk for a mixture of normal distributions: the use of quasi-bayesian estimation techniques. Economic Perspectives , 21(2):2–13, 1997.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Calculating worst-case Value-at-Risk prediction using empirical data under model uncertainty

Abstract

1 Introduction

2 WVaR (worst-case Value-at-Risk)

Definition 2.1** (Value at risk)**

Definition 2.2

3 Model uncertainty market model

4 Returns series segmentation

5 Model parameters estimating

6 Empirical computation

Definition 6.1

7 Conclusion

Definition 2.1 (Value at risk)