Estimation of Risk Contributions with MCMC
Takaaki Koike, Mihoko Minami

TL;DR
This paper introduces a Markov Chain Monte Carlo (MCMC) based estimator for risk contributions in financial portfolios, improving efficiency and accuracy over traditional methods, especially in high-dimensional, rare-event scenarios.
Contribution
The paper presents a novel MH algorithm-based estimator for VaR contributions that is more sample-efficient and accurate than existing estimators, applicable to complex, high-dimensional risk models.
Findings
MH estimator has smaller bias and mean squared error than existing methods.
The method is consistent and asymptotically normal.
Effective in high-dimensional, inhomogeneous risk models.
Abstract
Determining risk contributions of unit exposures to portfolio-wide economic capital is an important task in financial risk management. Computing risk contributions involves difficulties caused by rare-event simulations. In this study, we address the problem of estimating risk contributions when the total risk is measured by value-at-risk (VaR). Our proposed estimator of VaR contributions is based on the Metropolis-Hasting (MH) algorithm, which is one of the most prevalent Markov chain Monte Carlo (MCMC) methods. Unlike existing estimators, our MH-based estimator consists of samples from conditional loss distribution given a rare event of interest. This feature enhances sample efficiency compared with the crude Monte Carlo method. Moreover, our method has the consistency and asymptotic normality, and is widely applicable to various risk models having joint loss density. Our numerical…
| Estimate of AC (Bias): | Standard error (): | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Estimator | |||||||||
| (1) | Pareto + survival Clayton: True AC = (10.708, 10.708, 10.708) | ||||||||
| AC1 | 10.575 | 11.744 | 10.745 | 10.708 | 0.173 | 0.008 | 0.019 | ||
| (-0.133) | (1.036) | (0.037) | (0.000) | (0.218) | (0.038) | (0.019) | |||
| AC2 | 10.138 | 10.547 | 10.635 | 10.724 | 0.169 | 0.008 | 0.020 | ||
| (-0.571) | (-0.161) | (-0.074) | (0.016) | (0.595) | (0.074) | (0.025) | |||
| AC3 | 10.389 | 9.813 | 10.745 | 10.693 | 0.178 | 0.008 | 0.018 | ||
| (-0.320) | (-0.896) | (0.037) | (-0.016) | (0.366) | (0.038) | (0.024) | |||
| (2) | Pareto + -copula: | ||||||||
| AC1 | 6.835 | 8.162 | 7.697 | 7.339 | 0.238 | 0.010 | 0.041 | ||
| (-0.362) | (0.964) | (0.499) | (-0.121) | (0.433) | (0.499) | (0.132) | |||
| AC2 | 8.785 | 8.355 | 8.740 | 8.765 | 0.223 | 0.010 | 0.028 | ||
| (-0.122) | (-0.553) | (-0.167) | (-0.023) | (0.255) | (0.168) | (0.046) | |||
| AC3 | 11.913 | 11.781 | 11.875 | 12.208 | 0.134 | 0.006 | 0.024 | ||
| (-0.293) | (-0.426) | (-0.332) | (0.144) | (0.322) | (0.332) | (0.148) | |||
| (3) | Student’s + survival Clayton: True AC = (5.647, 5.647, 5.647) | ||||||||
| AC1 | 5.592 | 5.693 | 5.662 | 5.617 | 0.081 | 0.006 | 0.018 | ||
| (-0.055) | (0.046) | (0.015) | (-0.029) | (0.098) | (0.016) | (0.034) | |||
| AC2 | 5.410 | 5.722 | 5.642 | 5.665 | 0.079 | 0.006 | 0.019 | ||
| (-0.236) | (0.076) | (-0.005) | (0.018) | (0.249) | (0.007) | (0.026) | |||
| AC3 | 5.473 | 5.517 | 5.636 | 5.658 | 0.082 | 0.006 | 0.018 | ||
| (-0.173) | (-0.130) | (-0.011) | (0.011) | (0.192) | (0.012) | (0.021) | |||
| (4) | Student’s + -copula: True AC = (2.996, 3.745, 6.741) | ||||||||
| AC1 | 2.821 | 3.065 | 2.997 | 2.940 | 0.117 | 0.007 | 0.036 | ||
| (-0.176) | (0.069) | (0.001) | (-0.056) | (0.211) | (0.007) | (0.067) | |||
| AC2 | 3.772 | 3.560 | 3.742 | 3.792 | 0.109 | 0.006 | 0.033 | ||
| (0.027) | (-0.185) | (-0.004) | (0.047) | (0.112) | (0.007) | (0.057) | |||
| AC3 | 6.564 | 6.852 | 6.745 | 6.751 | 0.043 | 0.002 | 0.011 | ||
| (-0.178) | (0.110) | (0.003) | (0.010) | (0.183) | (0.004) | (0.015) | |||
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Bayesian Methods and Mixture Models · Statistical Methods and Inference
Estimation of risk contributions with MCMC
TAKAAKI KOIKE*∗* and MIHOKO MINAMI ∗Corresponding author. Email: [email protected] Department of Statistics and Actuarial Science, University of Waterloo, 200 University Avenue West, Waterloo, ON, N2L 3G1, Canada.
Department of Mathematics, Keio University, Hiyoshi, Kohoku-ku, Yokohama, Kanagawa, 3-14-1, Japan.
Abstract
Determining risk contributions of unit exposures to portfolio-wide economic capital is an important task in financial risk management. Computing risk contributions involves difficulties caused by rare-event simulations. In this study, we address the problem of estimating risk contributions when the total risk is measured by value-at-risk (VaR). Our proposed estimator of VaR contributions is based on the Metropolis-Hasting (MH) algorithm, which is one of the most prevalent Markov chain Monte Carlo (MCMC) methods. Unlike existing estimators, our MH-based estimator consists of samples from conditional loss distribution given a rare event of interest. This feature enhances sample efficiency compared with the crude Monte Carlo method. Moreover, our method has the consistency and asymptotic normality, and is widely applicable to various risk models having joint loss density. Our numerical experiments based on simulation and real-world data demonstrate that in various risk models, even those having high-dimensional () inhomogeneous margins, our MH estimator has smaller bias and mean squared error compared with existing estimators.
keywords:
Value-at-risk; Risk allocation; Risk contributions; VaR contributions; Copulas; Markov chain Monte Carlo; Metropolis-Hastings algorithm
{classcode}
C58, C63
1 Introduction
In most financial institutions, the risk of their portfolios is measured by economic capital. Capital allocation is an important risk analysis, where the economic capital is decomposed into a sum of risk contributions of unit exposures; see, for example, Dev (2004). The Euler principle, proposed in Tasche (1995), is one of the most well-known rules of risk allocation. It is economically justified, for example, in Denault (2001) and Tasche (1995, 2008)
On the other hand, calculating risk contributions poses theoretical and numerical difficulties, especially when the portfolio-wide risk is measured by value-at-risk (VaR). Although a simple formula of VaR contributions is derived by Tasche (2001), it can rarely be calculated analytically without a few exceptions, for example, in Tasche (2004). As is seen in Fan et al. (2012) and Yamai and Yoshiba (2002), the crude Monte Carlo (MC) method is the simplest method of computing risk contributions. However, the MC estimator suffers from unignorable bias caused by sample inefficiency and by inevitable numerical modification; see, for instance, Yamai and Yoshiba (2002). To overcome such difficulties, several methods have been proposed in the literature. For instance, Hallerbach (2003) and Tasche and Tibiletti (2004) derived approximation formulas by regarding VaR contributions as the best predictor of individual losses given total loss. In this paper, we call this estimator the generalized regression (GR) estimator. Glasserman (2005) developed importance sampling (IS) estimators with their main focus on credit portfolios.
Finally, Tasche (2009) proposed the Nadaraya-Watson (NW) estimator, which is based on the kernel estimation method. Despite its ease of calculation, it still requires importance sampling to achieve an efficient estimation.
In this paper, we propose a new method of estimating VaR contributions that utilizes the Markov chain Monte Carlo (MCMC), especially the Metropolis-Hastings (MH) algorithm (Metropolis et al., 1953; Hastings, 1970). Our MH method requires joint loss density which can be evaluated at each point. This is often the case when losses are modelled separately by marginal distributions and a copula; see Yoshiba (2013) for various examples. . To the best of our knowledge, no stable estimator of VaR contributions is known for general risk models. We study the consistency and asymptotic normality of our MH estimator, and provide practical guidelines for the efficient application of the MCMC method to the problem of computing VaR contributions. The proposed method is then carried out for various risk models based on simulations and real-world data. In numerical experiments, we compare the performance of the MH estimator with other existing estimators.
The foremost difference between our MH method and the crude MC is that in the former, samples are generated directly from the joint loss distribution given a rare event of interest. In contrast, the MC method generates samples from the unconditional loss distribution, which makes it inevitable to waste a large portion of samples;
This paper is organized as follows. Section 2 introduces the mathematical setting of the capital allocation problem and explains challenges on estimating VaR contributions with the existing estimators. Section 3 provides a brief introduction to the MCMC method and various MH algorithms. In section 4, we propose the MH estimator that combines the MH method with the estimation of VaR contributions. Next, in section 5, numerical studies are conducted based on simulation and real-world data. We demonstrate that for various risk models with marginal- and dependence-inhomogeneity and/or high-dimensionality, the MH estimator has smaller bias and mean squared error (MSE) than those of existing estimators. For applying our method to other risk models not presented in this paper, practical guidelines on the usage of the MH method are also provided. Concluding remarks and discussions are given in section 6. Based on the theory of MCMC, the consistency and asymptotic normality of our estimator are derived in appendix A
2 Capital allocation problem
Throughout this paper, the aggregate loss
[TABLE]
is considered, where is the size of the portfolio, and are random variables on an atomless probability space that represent the losses incurred by exposures within a fixed time period. In this study, a positive value of a loss random variable represents a financial loss, and a negative loss is interpreted as a profit. Let be the joint distribution function (df) of with margins , and let be the df of the total loss . Assume that and have densities with marginal densities and , respectively. According to Sklar’s theorem (see, for example, Nelsen, 2006), it holds that
[TABLE]
where is called a copula of . The density can be written by
[TABLE]
where denotes the density of .
As mentioned in section 1, computing risk contributions is an important task in risk management. A standard procedure of determining risk contributions involves two steps. The first step is to compute the economic capital for a risk measure . Risk measures map a loss random variable to a capital buffer that is required to cover the loss over a predetermined period such as one year or two weeks. One of the most popular risk measures is the VaR defined by where is called the confidence level. Another popular measure is the expected shortfall (ES) defined by for . The second step is to allocate the capital to -exposures. Mathematically, capital allocation addresses the problem of determining the vector of allocated capitals that satisfies the full allocation property
[TABLE]
The Euler principle derives such AC’s by utilizing the well-known Euler rule for a function :
[TABLE]
where is an open set such that , and is positive homogeneous, that is, for . For
[TABLE]
the full allocation property (2) holds for the vector (ACAC) by taking in equation (3).
Since VaRp is positive homogeneous, the Euler principle can be applied, and the corresponding risk contributions are given by
[TABLE]
We call the vector the VaR contributions. Since we mainly focus on this form of allocated capital in this study, we drop the superscript VaRp and write (5) as AC(). Note that other forms of allocated capitals are also possible; for example, when the risk measure is ES, the ES contribution is derived as
[TABLE]
by positive homogeneity of ES; see Tasche (2001) for derivations of the last equalities in (5) and (6).
Even when the joint density of the portfolio loss vector is given explicitly, the analytical computation of AC is not straightforward since it often requires the joint distribution of , which is in general difficult to derive.
A possible numerical method to calculate VaR contributions is the crude MC method, in which the pseudo VaR contribution
[TABLE]
is computed for a sufficiently small bandwidth . Since the probability is positive, the right hand side of (7) can be written as
[TABLE]
This expression allows one to construct the estimator of the pseudo VaR contributions given by
[TABLE]
where is the sample size; are independent and identically distributed (i.i.d.) samples from ; are i.i.d. samples from for ; and is the number of samples contained in . We call the MC estimator. By setting and as sufficiently small and large, respectively, one can expect that the MC estimator approximates the true VaR contributions. Note that this method is available only when is positive, since by continuity of .
As long as the i.i.d. samples from can be generated, one can estimate ACδ by constructing the estimator (8). However, this estimator suffers from an inevitable bias The bias of the MC estimator can be decomposed by
[TABLE]
where and . should be taken as small as possible to reduce . However, when is quite small, it is difficult to ensure a large enough sample size to keep the first term small since , and is typically much less than .
To overcome this problem, several estimators have been proposed in the literature. First, Second, the NW kernel estimator proposed in Tasche (2009) is defined by
[TABLE]
where is the kernel density and is the bandwidth. Since this estimator can be interpreted as a smoothing modification of the MC estimator (8) by kernel , it shares the same bias trade-off explained above. Furthermore, the bias and asymptotic standard deviation of the NW estimator (see, for example, Hansen, 2009) cannot be computed easily because they require an evaluation of the total loss density at . Finally, Hallerbach (2003) and Tasche and Tibiletti (2004) constructed estimators by assuming a regression model among the losses of the form:
[TABLE]
where is a function parameterized by , and is an error random vector such that . For an estimator of , we call the following estimator the GR estimator:
[TABLE]
Although this estimator is intuitive and can easily be computed, it is in general difficult to construct an appropriate model and estimator of , unless samples from are available. A notable exception is the case wherein follows an elliptical distribution. In this case, the following result holds:
[TABLE]
see, for example, McNeil et al. (2015). The true VaR contributions are then provided by setting , where
[TABLE]
Since these coefficients are the minimizers of , the OLS estimators of are calculated based on the unconditional samples of and converges to the true parameters (12) as .
3 MCMC estimators
As seen in section 2, the essential problem in estimating VaR contributions is that the conditional samples from are unavailable. To solve this problem, we introduce the MCMC method wherein a given distribution is simulated by constructing a Markov chain whose stationary distribution is the desired one. By allowing Markovian-type dependence within the samples, the MCMC allows us to simulate a wide variety of distributions. In this section, we briefly review MCMC, especially the Metropolis-Hastings algorithm as a major subclass of MCMC methods.
3.1 A brief introduction to MCMC
Let be a set and be a -algebra on . A Markov chain is a sequence of -valued random variables satisfying the Markov property;
[TABLE]
for all , , and . A Markov chain is characterized by its stochastic kernel , given by . If there exists a probability distribution such that for any and , then is called the stationary distribution. See, for example, Nummelin (2004) for the general theory of Markov chain.
The MCMC method is widely used for simulating a distribution by generating a Markov chain with the given distribution as a stationary distribution . For some distribution and -measurable vector-valued function on , our estimand is denoted as
[TABLE]
The MCMC estimator of (13) is given by
[TABLE]
where is a sample path from time to (we call it an -path) of a Markov chain whose stationary distribution is . The distribution is called the target distribution. Since it is determined by the problem at hand, the problem is to find a stochastic kernel such that it has the stationary distribution , and sample paths of its Markov chain can easily be generated.
One of the most popular stochastic kernels is the MH kernel defined by
[TABLE]
where
[TABLE]
is the Dirac delta function; is a function such that is measurable for any ; and is a probability density for any . This function is called a proposal density. It can be shown that the MH kernel has stationary distribution ; see Tierney (1994). Under the three conditions (i)–(iii) where (i) at least one vector is known, where ; (ii) samples from can be generated for any ; and (iii) the ratio can be calculated for any , we can generate an -path of the desired Markov chain by the following MH algorithm:
Algorithm 1: (MH algorithm)
Fix a sample size , proposal density , and initial value . 2. 2.
For , do: 3. 3.
Generate and . 4. 4.
Set
[TABLE] 5. 5.
Set
[TABLE] 6. 6.
Return .
We call in the acceptance probability at the th iteration. Based on the -path generated in Algorithm 1, the MCMC (MH) estimator is constructed.
Under regularity conditions, the MCMC estimator satisfies consistency and the central limit theorem (CLT). First, the MCMC estimator is consistent if
[TABLE]
for any -integrable function and any initial state . Next, CLT holds if
[TABLE]
where the asymptotic variance matrix is given by
[TABLE]
Since the asymptotic variance can rarely be computed in a real situation, it is estimated from the sample path generated in Algorithm 1. One popular estimator of is the so-called batch means estimator; see Geyer (2011). For an -path , the batch means estimator is defined by
[TABLE]
where and are positive integers satisfying , and
[TABLE]
is called the batch length, and is the number of batches. Under regularity conditions, the batch means estimator converges to as ; see Jones et al. (2006) and Vats et al. (2015). By using CLT of and the consistency of , one can construct an approximate confidence interval of the true quantity based on an -path of the Markov chain.
3.2 Choice of the proposal distribution
When implementing the MH, an appropriate choice of the proposal function is necessary since it affects the asymptotic variance (18). Since can rarely be calculated explicitly in a real situation, a post-implementation review is usually conducted; that is, the goodness of the selected proposal distribution is evaluated after performing the MH. In this section, we introduce two methods for evaluating the selected proposal distribution. We also provide some families of proposal distributions for later use.
In practice, there are two prevalent methods to determine the performance of the proposal distribution. One is to inspect the autocorrelation plots of the marginal sample paths. For an -path , vector-valued measurable function , and the MH estimator , the sample autocorrelations are drawn against the lag , where
[TABLE]
for . From the form of the asymptotic variance , one can expect that asymptotic variance is small if the autocorrelation plots steadily decline to zero as the lags increase. Another implicative quantity is the acceptance rate (ACR), which is the percentage of times a candidate is accepted through the whole run.
Meanwhile, altering proposal distribution is generally suggested when extremely low or high ACR is observed.
Typically, proposal distribution is selected among certain classes of distributions. To find an appropriate depending on the target distribution, several classes of proposal distributions are presented in order. First, if the proposal function is of the form for some density , the candidate is drawn according to the following process:
[TABLE]
and is the current state. This type of is called the random walk proposal distribution. In the case wherein is symmetric around the origin, the acceptance probability is written simply as . Second, when for some density , then candidate is updated by
[TABLE]
This is called the independent proposal distribution. The two proposal distributions—random walk and independent—are widely used due to their simplicities. However, these proposal distributions often fail to perform well when the target distribution is heavy-tailed. To overcome this problem, the mixed preconditioned Crank-Nicolson (MpCN) proposal distribution is proposed by Kamatani (2014). This proposal distribution updates the candidate according to the following process:
[TABLE]
where , follows the gamma distribution with shape parameter and scale parameter , and for some -vector and matrix . Throughout this paper, is set to be 0.8 as a default choice in Kamatani (2014). Ideally, and are set to be and , while in practice, they can be replaced by their rough estimates since moments of are typically unknown. Note that the original MpCN proposed in Kamatani (2014) is the standardized version (that is, and , where is an identity matrix). The acceptance probability (15) of the MpCN proposal distribution can be written as
[TABLE]
One of the key differences between this proposal distribution and the first two simple ones is that in the MpCN, not only the mean but also the variance of the candidate changes with the current state . Since the MpCN proposal distribution admits larger jumps in the tail, a better acceptance rate can be expected even when is heavy-tailed.
4 The proposed method
In this section, we propose a new estimator of VaR contributions that utilizes the MCMC method, especially the MH algorithm, to achieve an efficient estimation. Theoretical study on the consistency and asymptotic normality of our MH-based estimator is provided in the Appendix for certain classes of risk models.
4.1 Assumptions and setup
We start by declaring assumptions under which our MH estimator is applicable.
Assumption 1**.**
On applying the MH estimator, we suppose the following:
- (i)
an explicit form of the joint loss density is given, and thus one can compute the quantity for any ; 2. (ii)
a generator of i.i.d. samples from the loss distribution is available; and 3. (iii)
neither the explicit form of total loss density nor the way to compute the quantity is available.
Note that assumption 1 (ii) enables us to generate samples from by setting where () is an th sample from .
Such a situation typically occurs when the joint loss density is specified through a copula density and marginal loss densities . The resulting joint loss density is specified as in formula .
As is mentioned in section 2, computing VaR contributions involves two steps; the first is to estimate VaR, and the second is to estimate VaR contributions AC with VaR replaced by its estimate. The estimation of VaR in the first step is often conducted with an MC simulation. Based on i.i.d. samples () from , VaR can be estimated, for example, by , where is the smallest integer greater than , and is the th largest sample among samples. Since is a deterministic quantity, one can regard it as a constant .
In the second step, AC is estimated. According to the crude MC method, VaR contributions are estimated by (8). As explained in section 2, the problem of this two-step procedure is that the estimator of VaR contributions in the second step is typically biased. To address this issue, we develop an MCMC (MH)-based estimator that achieves consistency and high sample efficiency.
4.2 The MH estimator of VaR contributions
We propose to estimate VaR contributions by sequentially updating samples so that all samples lie in the set . The updating rule is established so that the componentwise sum of each sample is preserved and the samples are taken from the distribution . We start to describe the MH-based estimator by reformulating the problem of computing VaR contributions.
By the full allocation property , it holds that
[TABLE]
where . Therefore, computation of VaR contributions AC= can be reduced to estimate VaR contributions of the -subportfolio, denoted by AC. In our method, this quantity AC*′* is estimated by generating samples directly from . The conditional joint density of given can be written as
[TABLE]
where the last equation follows from a linear transformation . At this point, sampling directly from is difficult since the total loss density is not easy to evaluate.
By taking , , and in the notations presented in subsection 3.1, our problem of estimating VaR contributions can be reduced to estimate in (13) by MCMC. Even though it is challenging to compute , we can compute the acceptance probability (15) given by
[TABLE]
for any by assumption 1 (i). Note that the term disappears by taking the ratio of to . Therefore, under an appropriate choice of the proposal density , the MH algorithm (algorithm 1) allows one to generate -paths of the Markov chain whose stationary distribution is . Based on a sample path, we can construct the MH estimator defined by . The algorithm to compute the MH estimator of VaR contributions is summarized in the following algorithm.
Algorithm 2: (MH estimator of VaR contributions )
Estimate VaR as by MC samples. 2. 2.
Fix the sample size , proposal distribution , and initial value . 3. 3.
Perform Algorithm 1 (MH) for the given , , and to generate an -path . 4. 4.
Set
[TABLE]
to estimate VaR contributions AC.
Moreover, under regularity conditions, consistency and asymptotic normality of the MH estimator (23) hold; see Appendix 7 for more detail.
Remark 1* (MCMC methods for ES contributions).*
5 Numerical experiments
In this section, we apply the MH estimator proposed in section 4 to various risk models, and compare its performance with other existing estimators of VaR contributions. Our simulation and empirical study based on real-world data show that the MH estimator has smaller bias and lower MSE compared with other estimators for many situations, including high-dimensional () cases. Based on the numerical experiments, we provide practical guidelines on how to choose an appropriate proposal distribution of the MH estimator given a risk model. In the experiments, we used a MacBook Air, 1.4 GHz Intel Core i5, 4 GB 1600 MHz DDR3.
5.1 Simulation study
5.1.1 Description of the numerical comparison
In the simulation study, we consider four risk models that are modelled separately by marginal densities and copula density. We adopt heavy-tailed marginal distributions and copulas with upper-tail dependences as they are often of concern in risk management. In all risk models, we set the size of the portfolio . The models are set as follows.
- (1)
The loss random variables , and follow homogeneous Pareto distributions (29) for and . The loss random vector has a -dimensional survival Clayton copula (30) with . 2. (2)
, and have the same marginal distributions as in case (1). Their copula is a student’s -copula (33) with the degree of freedom , and the dispersion matrix given by
[TABLE] 3. (3)
, and follow homogeneous student’s -distribution with degree of freedom , location parameter , and scale parameter . has a survival Clayton copula (30) with . 4. (4)
follows a multivariate student’s -distribution (35) with , , and where is defined in .
The first two models (1) and (2) consider pure losses, and the last two, (3) and (4), consider Profit&Loss. In all models, marginal distributions have variances of 2.0 and heavy tails with tail indices . Models (1) and (3) possess homogeneous upper-tail dependences with tail coefficients ; see Joe (2014) for formulas on the tail coefficients. Models (2) and (4) have upper, lower, and upper-lower tail dependences with tail coefficients , , , , , and . As inferred by the dispersion matrix , the first and second losses are negatively dependent, while other pairs of losses are positively dependent.
For each risk model, we compute several estimators of VaR contributions for a confidence level with the replaced by its Monte Carlo estimate . The estimators we compare are the MC estimator , NW estimator (Tasche, 2009), GR estimator (Hallerbach, 2003; Tasche and Tibiletti, 2004), and the MH estimator :
[TABLE]
where , and is an -path of a Markov chain whose stationary distribution is .
For all estimators, we fix the sample size . Other parameters of the estimators are determined as follows. First, for the MC estimator, we set such that the MC sample size is around . For a fixed , asymptotic normality
[TABLE]
holds; see, for example, Glasserman (2013). We report the estimate of and its approximated standard error for , where is the -component of the sample standard deviation defined by
[TABLE]
Second, for the NW estimator, we choose the kernel density as the standard normal density. We decide the bandwidth according to Silverman’s rule of thumb (Pagan and Ullah, 1999). Although asymptotic normality holds for the NW estimator, its asymptotic variance can hardly be computed because it requires the evaluation of . Therefore, we report only the estimate of . Third, for the GR estimator, we choose , and its coefficients are estimated by
[TABLE]
where and . Under regularity conditions, the asymptotic normality
[TABLE]
holds for , where and are the th components of and , respectively, for and ; is the th component of the error term ; is the conditional variance of given ; and
[TABLE]
Based on these results, we report the estimate of and its approximated standard error for , where
[TABLE]
, and is the sample standard deviation of the th residuals. Finally, for the MH estimator, we choose different proposal distributions depending on risk models (1)–(4). For each risk model, we choose (1) a random walk proposal with , where ; (2) an independent proposal , where is the density of the Dirichlet distribution with parameters , (3) and (4) MpCN proposal with , , and . In Algorithm 2, we set the initial values as . We estimate the asymptotic variances of MH estimators by the batch means estimators defined by . Following Jones et al. (2006), we choose and . We report the estimate of and its approximated standard error for .
5.1.2 Results and discussions
Due to the simplicity of MC, NW, and GR estimators, they were calculated instantly for all risk models.
As mentioned in section 3.2, the validity of the proposal selection can be inspected by autocorrelation plots and ACR. Figure 3 (v)–(viii) shows the autocorrelation plots of the Markov chains generated by Algorithm 2. The acceptance rate of the MH algorithm in each risk model was (1) 0.566, (2) 0.222, (3) 0.604, and (4) 0.767. In Figure 3 (v)–(viii), we can observe that the autocorrelation plots steadily decline below 0.1 by lag around 100 for all risk models. Together with the observations that the ACRs are moderate, we could state that the choices of the proposal distributions above are appropriate for all risk models.
Before showing the results of the estimation, let us check the shapes of the conditional distributions by plotting the -path generated by Algorithm 2. Figure 3 (i)–(iv) shows the contour plots of the generated Markov chains. According to these plots, the features of the conditional distribution in each risk model are summarized as follows:
- (1)
Pareto + survival Clayton: The contour plot in Figure 3 (i) shows that has a unique mode. The density steadily decays as they move away from the mode. In addition, the contour plot seems symmetric at the diagonal line. 2. (2)
Pareto + -copula: Unlike case (1), seems to possess two distinct modes close to the axes. High probabilities are concentrated around the edges of the simplex. Along with the axes, the gradients of the density seem to be sharp. Moreover, the contour plot in Figure 3 (ii) is asymmetric at the diagonal line . 3. (3)
Student’s + survival Clayton: Although the conditional loss random vector can take negative values, it is supported mostly on the bounded simplex as in case (1). The contour plot in Figure 3 (iii) seems unimodal and symmetric around the diagonal. The tails of are obviously light. 4. (4)
Student’s + -copula: In this case, the conditional distribution can be shown to be a Pearson type VII distribution in Appendix 7. From the contour plot in Figure 3 (iv), we can observe elliptical symmetry and tail-heaviness. Unlike case (3), the loss vector can take large negative values beyond the bounded simplex.
The results of estimation are summarized in Table 1. In the four different risk models (1)–(4), we report the estimates, their approximated standard errors, biases, and rooted MSEs (RMSEs) of the four different estimators: MC, NW, GR, and MH.
In the first risk model, true VaR contributions are obtained by equally allocating the total VaR since the marginal distributions are homogeneous and the copula is exchangeable. We observed that the MC and NW estimators have relatively larger biases than those of others. Compared with the MH estimator, the GR estimator still suffers from some inevitable bias although its standard error is quite small. The MC estimator has a relatively large standard error due to sample inefficiency. Overall, the MH estimator outperforms all other estimators in terms of RMSE.
The second risk model does not allow us to analytically calculate the true VaR contributions. Therefore, the true VaR contributions are computed by Monte Carlo numerical integration, which still works with enough accuracy for dimension three. We can observe that existing estimators suffer from biases possibly caused by asymmetry and multi-modality of the conditional distribution . In particular, the GR estimator has relatively large bias and RMSE in contrast to the good performance in the first risk model. On the other hand, the MH estimator maintains lower bias and RMSE compared with the other estimators.
In the third risk model, the true VaR contributions are given by the equal allocation based on the same discussion as in case (1). Thanks to the symmetry and unimodality of the conditional distribution , all estimators retain small biases and RMSEs. Together with the results in cases (1) and (2), one can state that the GR estimator performs well so long as is symmetric and unimodal. Additionally, the MH estimator reduces bias and RMSE compared with those of MC and NW estimators.
The final risk model provides the true VaR contributions via the formula . In such an elliptical case, the GR estimator provides quite an accurate estimate. Although the conditional distribution is heavy-tailed as seen in Figure 3 (iv), the MH estimator retains high performance compared with the MC and NW estimators. The bias of the MH estimator is significantly improved compared with the MC and NW estimators. Moreover, its standard error and RMSE are lower than those of the MC estimator.
Throughout the numerical study, the MH estimator provided small bias and RMSE regardless of the shape of the conditional distribution . In the case when is unimodal and symmetric, the GR estimator also guarantees a good performance. On the other hand, at least in our numerical experiment, the MC and NW estimators had relatively larger biases and RMSEs compared with other estimators.
5.2 Empirical study
The numerical study is now extended to a high-dimensional case with real-world data. We used the dataset stockdata in R-package huge, which consists of stock market data of closing prices from all stocks in the S&P 500 for all the days the market was open in the period of January 1, 2003 to January 1, 2008 (five years). During the time period, there remained stocks in the S&P 500. The sample size is . We transformed the data into the log-ratio of the price at time to the price at time .
Most stylized facts on stock returns listed in Chapter 3 of McNeil et al. (2015) are observable in the data. For example, return series are unimodal, leptokurtic, and heavy-tailed with little serial correlation and volatility clusters. Moreover, the return series are mutually dependent. Taking these observations into account, we adopted a copula-GARCH model with skew- white noise (ST-GARCH; see, for example, Jondeau and Rockinger, 2006; Huang et al., 2009). In the model, marginal time series are modelled by GARCH and the underlying white noise processes follow skew- distributions with an inhomogeneous degree of freedom and skewness parameter ; that is, within a fixed time period the -th return series follows
[TABLE]
for , where , , and follows a skew- distribution with density given by
[TABLE]
where is a probability density function of a student’s -distribution with degree of freedom and a skewness parameter with symmetric; see Fernández and Steel (1998) for more detail. The copula among is assumed to be a student’s -copula with parameters and independent of time .
We estimated parameters of the ST-GARCH(1,1) model with the -copula based on the copula approach. First, we fitted the ST-GARCH(1,1) model with the maximum likelihood method to the marginal time series. Then, to obtain pseudo-samples from the copula of , distributional transform was applied to the -dimensional white noise process extracted from the ST-GARCH model. We finally fit the -copula to them with method-of-moments using Kendall’s tau for the dispersion matrix and the maximum likelihood method for the degree of freedom ; see Demarta and McNeil (2005) for more detail. The results of the estimation are summarized in Figure 4 with each boxplot representing numbers of each parameter. From (B1) and (B5), the estimates of means and omegas are almost [math]. From (B3), most of the marginal white noise distributions are symmetrical but some are skewed. From (B4), their degrees of freedom range from two to ten, that is, the tail-heaviness of the return series is inhomogeneous over assets. Finally, (B8) shows that the pairwise correlations among the return series are typically from 0.2 to 0.4, and some have strong positive correlations.
Our goal in this study is to compute the conditional VaR contributions at time given the history . Under the model described above, the marginal distribution of the -th return at time is ST, where ST is a skew -distribution with density with defined in (25). Their copula is a student’s -copula with parameters and . Based on this multivariate model, conditional VaR contributions at time given histories are estimated by the same procedure as in section 5.1.
We estimated the conditional VaR contributions (ACAC) with confidence level by using MC, NW, GR, and MH methods. In MC, samples were generated and the total VaR was estimated as the -th largest sample among them. The run time of the MC simulation was minutes. The MC estimates of VaR contributions were then computed as sample means of the conditional samples whose sums fall into the set . The bandwidth was set to be so that there were conditional MC samples. Estimates of standard errors were also computed based on these samples. NW, GR, and MH estimators were computed analogously to the previous simulation study in section 5.1. For the MH estimator, the MpCN proposal distribution was chosen since the target distribution was expected to be heavy-tailed and elliptical to some extent. The length of the sample path was chosen to be , and the run time of the MH algorithm was minutes. We inspected the autocorrelation plots and ACR to check the validity of the proposal distribution. We observed that all autocorrelations decreased below if lags are larger than . Together with the ACR , we concluded that the choice of was appropriate.
Figure 5 shows the MC, NW, GR, and the MH estimates of the conditional VaR contributions (AC AC) of returns at time given histories plotted with the homogeneously allocated capitals VaR and the standardized marginal VaRs defined by , where is the so-called superadditivity ratio defined by
[TABLE]
For MC, GR, and MH estimators, the 95% confidence upper and lower bounds are also plotted. On the x-axis, the 452 assets are rearranged in increasing order of MH estimators.
Compared with the dotted line representing the homogeneous allocation, all the estimated allocated capitals show inhomogeneity among assets. Overall, the estimated VaR contributions are less volatile than the standardized marginal VaRs, which implies the benefit of the diversification effect. We can also observe that the MH estimates and GR estimates almost coincide for all assets. The confidence intervals of both estimators are much tighter than that of the MC estimator. NW estimates fluctuate around the line of the MH and GR estimates. On the other hand, the MC estimates deviate from these lines, which indicates that the MC estimators contain inevitable biases. In summary, although the true ACs are unknown, the GR and MH estimators retain stable performance compared with the MC and NW estimators even if the dimension is large and marginal distributions are inhomogeneous.
5.3 Advantages and disadvantages of the MH estimator
We summarize the advantages and disadvantages of the MH estimator compared with the other estimators. The first advantage is that the MH estimator is consistent whereas this is not always true for the other estimators. As explained in section 2, the MC, NW, and GR estimators have biases which cannot be easily eliminated. In fact, we observed in Table 1 that unignorable biases of the MC, NW, and GR estimators sometimes remain even when their standard errors are sufficiently small. In contrast, the MH estimator provides more accurate estimates of VaR contributions as due to its consistency. Since CLT also holds, the confidence interval of the true VaR contributions is also available. Secondly, the MH estimator has great sample efficiency compared with the MC estimator. While samples are generated from and most are discarded in the MC method, no samples are wasted in the MH method since it directly simulates . Consequently, the MH estimator can achieve low standard errors. Finally, the MH estimator can maintain high performance even when the conditional distribution is multimodal or heavy-tailed. As discussed in subsection 5.1.2, the performance of the GR estimator highly depends on the shape of . On the other hand, for the MH estimator, the shape of can be directly captured through the proposal distribution . By choosing an appropriate proposal distribution according to the shape of , the MH estimator can attain great performance. This advantage, however, can be seen as a disadvantage from the viewpoint of the simplicity of estimation. In general, estimation with MH requires two steps: first is to choose a family of proposal distribution, and the second is to determine its parameters. The second step of parameter estimation can be based on the MC samples falling into set , which is regarded as the pseudo samples from . Meanwhile, the first step is not so straightforward. We will discuss this issue in the next subsection 5.4. Another disadvantage of the MH estimator is that it typically requires a longer run time than other existing estimators. Since MH requires times of simulating the proposal distribution and evaluating the acceptance probability , careful programming and proposal selection are necessary to save computational time.
5.4 Guidelines for the choice of proposal distribution
A significant drawback of the MH estimator is that the choice of an appropriate proposal distribution is not as simple as the parameter selections of other existing estimators. An instruction for proposal selection is necessary since it highly affects the performance of the MH. In this subsection, we first investigate the symptoms caused by an inappropriate choice of . Then, we consider how to overcome these problems based on the numerical experiments provided above. Practical guidelines for choosing an appropriate proposal distribution are also provided.
An inappropriate choice of is largely classified into two cases. One is that proposal distribution often generates a candidate of which the probability measured by is quite small. This case occurs, for example, when does not fully capture the shape of . In such a case, the Markov chain moves quite slowly and this yields a high asymptotic standard error of the MH estimator. This symptom appears as quite a low acceptance rate and high autocorrelations. Another case is wherein generates only some parts of the whole support of . This case occurs, for example, when has distinct local modes and the variance of is so small that the chain cannot pass between ridges. In such a case, an estimate can be significantly biased, although the acceptance rate and autocorrelation plots are seemingly perfect. This symptom appears as a distorted plot of MCMC samples whose shapes are completely different from the target distribution .
How can we detect and avoid such fallacious estimates? First, as mentioned in section 3.2, it is indispensable to inspect the autocorrelation plots and ACR to prevent the first symptom. Additionally, to avoid the second symptom, we recommend drawing the plots of the generated Markov chain and comparing them with the plots of the MC samples whose componentwise sums belong to . Since such MC samples follow the distribution , one can detect the distortion of the generated Markov chain by comparing the two scatter plots of and . As an example from our simulation study in subsection 5.1, Figure 6 shows the scatter plots of the MC samples whose sums belong to overlaid on the scatter plots of the MH samples. In the figure, we can check that the shapes of the scatter plots of the MH samples bear striking resemblance to those of the MC samples for all risk models. If some part of the support of is covered by the MC samples but not by the MH samples, the choice of is questionable.
Finally, through numerical experiments we found that dependence information of the underlying risk model can be helpful for the selection of . When copula of the underlying risk model only has positive dependences for all pairs of loss variables, then the conditional distribution is likely to be unimodal and light-tailed since positive dependence among prevents them from being diversified under the constraint . In risk models (1) and (3) in subsection 5.1.1, where copula has only positive dependences, the contour plots in Figures 3 (i) and (iii) show that is unimodal and light-tailed. These features facilitate the estimation with MH since simple proposal distributions such as the random walk proposal (20) and independent proposal can perform well. Conversely, when copula has negative dependence, tends to be multimodal or heavy-tailed since negative dependence allows each component of to take extreme values under . In risk models (2) and (4) in subsection 5.1.1, where copula has negative dependences, Figure 3 (ii) indicates that is bimodal, and the contour plot in Figure 6 (d) shows that is heavy-tailed. In such cases, careful proposal selection is required for achieving an efficient MH estimator. When the losses are all nonnegative, then is supported on the bounded simplex defined in (26). Therefore, one can cover the whole support of by choosing as the independent proposal with the distribution defined on the simplex. Uniform distribution on can be the safest choice. It is also possible to choose other distributions that share the same features of observed in the MC samples. For instance, since bimodality is observed in the contour plot in Figure 6 (b), we choose as the independent proposal distribution with the Dirichlet distribution on , which can possess two distinct modes around the edges of the simplex. When is -valued and negatively dependent, an efficient MCMC is challenging since the target distribution is likely to be multimodal or heavy-tailed. As a special case, when is elliptical to some extent, then is likely to be elliptical again. In such a case, even if it is heavy-tailed, the MpCN proposal distribution is known to perform well, which is also demonstrated by the simulation study of the risk model (4) in subsection 5.1.1 and by the empirical study in subsection 5.2.
The discussions on choosing an appropriate proposal distribution are summarized as a flowchart in Figure 7. Together with the guidelines, the whole procedure of our MH estimator of VaR contributions presented in this paper is summarized as follows.
Algorithm 3: (Estimation of VaR contributions with MCMC)
Generate by MC. 2. 2.
Based on the samples generated in step 1, estimate VaR by . 3. 3.
For a bandwidth , extract subsamples such that for . 4. 4.
Choose a family of proposal distributions according to the guideline in Figure 7. 5. 5.
Based on the pseudo samples extracted in step 3, determine the parameters of the proposal distribution . 6. 6.
For a sample size , proposal density and the initial value , run Algorithm 1 to generate an -path of a Markov chain whose stationary distribution is . 7. 7.
To check the validity of proposal distribution , compute the acceptance rate, draw the autocorrelation plots, and compare the scatter plots of the MC and MH samples. 8. 8.
If the proposal selection is verified in step 7, set the MH estimator of VaR contributions based on the sample path generated in step 6. Otherwise, go to step 4 and choose another proposal distribution.
6 Concluding remarks
Computing VaR contributions for a risk model specified by joint density is generally a difficult task. To achieve this, the MH estimator of VaR contributions is proposed. Its sample efficiency is significantly improved since the MH method generates samples directly from the conditional density given the sum constraint. Moreover, since the MH estimator can capture the features of the risk model more directly than the existing estimators, it can maintain high performance even when the underlying loss distribution is multimodal or heavy-tailed. By the general theory of Markov chains, the MH estimator is consistent and asymptotically normally distributed. Through simulation and empirical studies based on real-world data, the performance of the MH estimator was compared with those of other existing estimators for various risk models. The numerical results demonstrated that in most risk models, the MH estimator had smaller bias and RMSE compared with other existing estimators even when the dimension of the portfolio was high, such as .
Potential future research includes a theoretical study of the conditional joint distribution of . Our main interest is in the influence of the underlying copula of a risk model on the tail behavior and multimodality of the density . We believe that revealing relationships among them can provide more promising guidelines for the proposal selection of the MH estimator.
Acknowledgements
We wish to thank to Paul Embrechts from ETH Zürich for his valuable comments regarding the simulation setup. We would also like to express our gratitude to Kengo Kamatani from Osaka University, and Marius Hofert from the University of Waterloo for fruitful discussions on MCMC and Archimedean copulas.
Funding
This work was supported by the Japan Society for the Promotion of Science (JSPS) under the Core-to-Core program at Keio University.
7 Consistency and asymptotic normality
In this appendix, we derive conditions on a copula and marginal distributions with which the corresponding MH estimator of VaR contributions satisfies consistency (16) and CLT for some choice of proposal distribution . This study reveals which proposal distribution is appropriate for a given risk model.
We classify the loss distribution into two cases; one wherein and another wherein . The former corresponds to the case wherein we model pure losses, and the latter to the case of profits and losses (P&L). Our result is mainly about the former case, and we provide some limited examples for the latter case. It should be emphasized that the former case of pure losses includes a broad range of loss models. To demonstrate this, let , and set , . If , then . For , the translation invariance of VaRp implies that
[TABLE]
Therefore, the allocated capital of is given by
[TABLE]
Consequently, one can estimate by first estimating based on the joint distribution of such that \text{supp}(f_{\tilde{\bm}{X}})=\mathbb{R}^{d}_{+}, and then subtracting from . Therefore, our result for the former case includes the case of P&L where the minimums of the profits are bounded.
7.1 Case of pure losses
When , the conditional distribution is supported on the following bounded set called the -simplex:
[TABLE]
Thanks to the compactness of the support, we can state simple conditions on the marginal loss densities and copula density, which leads to consistency and CLT of the MH estimator.
Theorem 7.1**.**
Suppose that the joint distribution is supported on and has marginal densities and a copula density . Then, -CLT holds for the MH estimator of VaR contributions if the following conditions hold:
- (C1)
,
- (C2)
* is positive and bounded above for any for , and*
- (C3)
* is positive and bounded above for any .*
Proof.
According to Theorem 23 in Roberts and Rosenthal (2004), -CLT holds if the Markov chain is uniformly ergodic whenever . Since , the moment the condition is satisfied by the inequality
[TABLE]
for any . Thus, it suffices to show that the Markov chain is uniformly ergodic. According to Theorem 1.3 in Mengersen and Tweedie (1996), the Markov chain is uniformly ergodic if (and only if) the minorization condition (Rosenthal, 1995) holds on the whole space ; that is, there exists a positive integer , a positive number , and a probability measure such that
[TABLE]
for any and , where . Our target distribution can be written as
[TABLE]
where and . Thus, by conditions , , and that , we have
[TABLE]
Using and condition , the minorization condition can be checked as follows. For any , define
[TABLE]
Then, for any , we have
[TABLE]
Therefore, the minorization condition holds for , , and . Consequently, the Markov chain is uniformly ergodic, and thus -CLT holds. Since the minorization condition (27) holds, consistency of follows by Theorem 1 in Nummelin (2002). ∎
An example of the pair of risk model and proposal distribution is given in the following example.
Example 1**.**
For , let follow Pareto distribution with density given by
[TABLE]
Suppose has a survival Clayton copula with the density given by
[TABLE]
Some simple calculations show that the marginal distribution (29) satisfies and the copula (30) satisfies under a very mild sufficient condition that . Therefore, with any choice of proposal distribution satisfying , the corresponding MH estimator (23) satisfies consistency and asymptotic normality. A possible choice of is the random walk proposal with the density of multivariate normal distribution with mean zero. Since for , is always positive.
It is worth noting that the condition is irrelevant to the copula on the upper tail part . Therefore, holds even if a copula density explodes at the upper corner, which is often the case with copulas having upper tail dependence. In fact, a more general result holds for survival Archimedean copulas. A -dimensional Archimedean copula with an Archimedean generator is given by
[TABLE]
where is a continuous and nonincreasing function satisfying , and , and is decreasing on . The inverse is well-defined on and is defined by . Let be the th derivative of . An Archimedean generator defines a proper -copula via (31) for any if and only if is completely monotone, that is, on for all ; see McNeil et al. (2009). We denote the class of completely monotone generators as . According to Bernstein’s Theorem (see, for example, Feller, 2008), admits the Laplace-Stieltjes representation for some positive random variable .
Theorem 7.2** (Sufficient condition of for survival Archimedean copulas).**
Let be a completely monotone Archimedean generator. If where is such that , then the survival Archimedean copula has a density satisfying the condition in Theorem 7.1; moreover, has a zero lower tail dependence coefficient.
Proof.
Denote and \underline{u}_{j}=1-\bar{u}_{j}>0. The density of the survival Archimedean copula is given by
[TABLE]
where and . When , we have 0<\underline{u}_{j}\leq 1-u_{j}\leq 1 and thus where \bar{t}_{j}=\psi^{-1}(\underline{u}_{j})<\infty. Thus, .
Since , it is of the form for some positive random variable . Therefore, on , we have for and since and for and by assumption. Consequently, the density (7.1) is bounded from below and above.
When , the corresponding Archimedean copula has an upper tail dependence coefficient
[TABLE]
where the last equality comes from l’Hpital’s rule. Since
[TABLE]
since and go to as . Thus, for the survival Archimedean copula, . ∎
According to the Theorem 7.2, the survival Clayton copula satisfies while the survival Gumbel copula does not because it has a positive lower tail dependence coefficient; see Hofert et al. (2012).
Remark 2* (Consistency and CLT for copulas with lower tail dependence).*
Condition does not hold for elliptical copulas with lower tail dependence, such as a student’s -copula with density
[TABLE]
By carefully checking the proof of Theorem 7.1, the consistency and asymptotic normality of still hold under a weaker condition than and ;
[TABLE]
for some positive constant , where . While it is not straightforward to determine, one sufficient condition of (34) under is that is bounded above on . Another condition is that the proposal density explodes faster than . An example of such can be an independent proposal distribution with the density of the Dirichlet distribution for , which explodes to as approaches to an axis. Therefore, by choosing such proposal distributions, consistency and CLT can still hold even if a copula density explodes at the lower corner .
7.2 Case of profits and losses
In contrast to the case of pure losses, showing the consistency and CLT of the MH estimator is challenging for the case wherein we model P&L. Since the conditional density is supported on the unbounded space , careful study of its tail behaviors is necessary. When the original loss random variable follows an elliptical distribution, the results of Kamatani (2017) can be applicable to justify the CLT of our MH estimator with the MpCN proposal distribution. An example of justification of CLT for the case wherein follows the multivariate student’s -distribution is provided below.
Example 2** (Justification of CLT for multivariate student’s -Distribution).**
We demonstrate that the MpCN proposal distribution achieves the CLT of VaR contributions when the underlying loss model is a multivariate student’s -distribution with density
[TABLE]
Let where , , and . Throughout the discussion, we set for simplicity. Write
[TABLE]
for , , and . Then, it holds that
[TABLE]
where , , and . Using this identity, we have that
[TABLE]
where . Provided , follows a -dimensional elliptical distribution with the location parameter , scale parameter , and the density generator given by
[TABLE]
This type of distribution is called a Pearson type distribution (Schmidt, 2002).
Consider the MH estimator where target distribution is , and proposal distribution is MpCN . According to Theorem 25 in Roberts and Rosenthal (2004), -CLT holds if the Markov chain is geometrically ergodic and . According to Proposition 3.4 in Kamatani (2016), the Markov chain with the MpCN proposal distribution is geometrically ergodic if for some , is strictly positive and continuous, and it is symmetrically regularly varying, that is,
[TABLE]
for some function such that for any , where . We will see that the moment condition holds, and the condition on tail is also satisfied for .
Write . It can be shown that is regularly varying (see, for example, Resnick, 2013) at with index ; that is,
[TABLE]
According to Proposition 3.7 in Schmidt (2002), is regularly varying with index . Then, according to Karamata’s Theorem (we referred to Resnick, 2013), is regularly varying with index . Therefore, holds for any ; see Mikosch (1999). Thus, all the moment conditions above are satisfied as long as . In the elliptical case, tail condition is a direct consequence of . Since for all , it holds that
[TABLE]
Thus, by taking
[TABLE]
in , is shown to be symmetrically regularly varying. Putting them together, we conclude that the MH estimator with the MpCN proposal distribution satisfies -CLT when the underlying loss vector follows a multivariate student’s -distribution with and . Note that in the numerical experiment in section 5, we set and . Since , CLT holds true.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Demarta and Mc Neil (2005) Demarta, S. and Mc Neil, A.J., The t copula and related copulas. International statistical review , 2005, 73 , 111–129.
- 2Denault (2001) Denault, M., Coherent allocation of risk capital. Journal of Risk , 2001, 4 , 1–34.
- 3Dev (2004) Dev, A., Economic capital: a practitioner guide , 2004 (Risk Books: New York).
- 4Fan et al. (2012) Fan, G., Zeng, Y. and Wong, W.K., Decomposition of portfolio Va R and expected shortfall based on multivariate Copula simulation. International Journal of Management Science and Engineering Management , 2012, 7 , 153–160.
- 5Feller (2008) Feller, W., An introduction to probability theory and its applications , Vol. 2, , 2008, John Wiley & Sons.
- 6Fernández and Steel (1998) Fernández, C. and Steel, M.F., On Bayesian modeling of fat tails and skewness. Journal of the American Statistical Association , 1998, 93 , 359–371.
- 7Geyer (2011) Geyer, C., Introduction to markov chain monte carlo. In Handbook of Markov Chain Monte Carlo , pp. 3–47, 2011 (Springer: New York).
- 8Glasserman (2005) Glasserman, P., Measuring marginal risk contributions in credit portfolios. Journal of Computational Finance , 2005, 9 , 1–41.
