Forecast Encompassing Tests for the Expected Shortfall
Timo Dimitriadis, Julie Schnaitmann

TL;DR
This paper develops new statistical tests to evaluate the accuracy of Expected Shortfall forecasts, a key risk measure in banking regulation, using robust theory and simulation studies.
Contribution
It introduces novel forecast encompassing tests for Expected Shortfall, incorporating joint loss functions and robust asymptotic theory, with applications to financial asset forecasting.
Findings
Tests perform well in finite samples
Forecast combination methods improve risk prediction
Applicable to various financial assets
Abstract
We introduce new forecast encompassing tests for the risk measure Expected Shortfall (ES). The ES currently receives much attention through its introduction into the Basel III Accords, which stipulate its use as the primary market risk measure for the international banking regulation. We utilize joint loss functions for the pair ES and Value at Risk to set up three ES encompassing test variants. The tests are built on misspecification robust asymptotic theory and we investigate the finite sample properties of the tests in an extensive simulation study. We use the encompassing tests to illustrate the potential of forecast combination methods for different financial assets.
| Str ES | Aux ES | VaR ES | VaR | Str ES | Aux ES | VaR ES | VaR | |||
| GARCH | ||||||||||
| 15.25 | 15.20 | 18.35 | 22.75 | 14.40 | 14.65 | 18.80 | 22.50 | |||
| 11.55 | 11.10 | 15.60 | 20.10 | 12.30 | 12.70 | 17.80 | 22.85 | |||
| 11.45 | 11.55 | 16.35 | 18.80 | 11.00 | 11.25 | 14.60 | 17.55 | |||
| 10.05 | 10.25 | 13.10 | 15.35 | 9.75 | 10.15 | 13.90 | 15.75 | |||
| Strict ES | Aux ES | Joint VaR ES | VaR | |||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Model | NR | E1 | E2 | C | NR | E1 | E2 | C | NR | E1 | E2 | C | NR | E1 | E2 | C | ||||
| IBM | ||||||||||||||||||||
| HS | 57 | 43 | 57 | 43 | 43 | 57 | 43 | 57 | ||||||||||||
| RM | 57 | 43 | 57 | 43 | 43 | 57 | 14 | 43 | 43 | |||||||||||
| GJR | 57 | 43 | 57 | 43 | 43 | 57 | 14 | 29 | 57 | |||||||||||
| GAS | 43 | 57 | 43 | 57 | 29 | 71 | 29 | 71 | ||||||||||||
| G1F | 14 | 29 | 43 | 14 | 14 | 29 | 43 | 14 | 14 | 86 | 14 | 86 | ||||||||
| G2F | 14 | 29 | 57 | 14 | 29 | 57 | 29 | 57 | 14 | 29 | 43 | 29 | ||||||||
| ASES | 14 | 86 | 14 | 86 | 14 | 57 | 29 | 14 | 57 | 29 | ||||||||||
| SAVES | 14 | 86 | 14 | 86 | 14 | 86 | 14 | 86 | ||||||||||||
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Forecast Encompassing Tests for the Expected Shortfall
Timo Dimitriadis111Corresponding Author. Schloß-Wolfsbrunnenweg 35, 69118 Heidelberg, Germany; Tel. +49 176 47796302.
Heidelberg Institute for Theoretical Studies
Institute of Economics, Universität Hohenheim
and
Julie Schnaitmann
Department of Economics, University of Konstanz
Abstract
We introduce new forecast encompassing tests for the risk measure Expected Shortfall (ES). The ES currently receives much attention through its introduction into the Basel III Accords, which stipulate its use as the primary market risk measure for the international banking regulation. We utilize joint loss functions for the pair ES and Value at Risk to set up three ES encompassing test variants. The tests are built on misspecification robust asymptotic theory and we investigate the finite sample properties of the tests in an extensive simulation study. We use the encompassing tests to illustrate the potential of forecast combination methods for different financial assets.
Keywords: evaluating forecasts, combining forecasts, loss function, model selection, statistical tests
1 Introduction
Through the recent introduction of Expected Shortfall (ES) as the primary market risk measure for the international banking regulation in the Basel III Accords (Basel Committee, , 2016, 2017), there is a great demand for reliable methods for evaluating and comparing the predictive ability of competing ES forecasts. The ES at probability level is defined as the expectation of the returns smaller than the respective -quantile (the Value at Risk, VaR), where is usually chosen to be 2.5% as proposed by the Basel Accords. The ES is replacing the VaR in the banking regulation as it overcomes several shortcomings of the latter such as being not coherent and its inability to capture tail risks beyond the -quantile (Artzner et al., , 1999; Danielsson et al., , 2001; Basel Committee, , 2013). While the empirical properties favor the ES over the VaR as a risk measure, the ES lacks elicitability, which implies that no strictly consistent loss functions exist. The non-elicitability of the ES is overcome by considering the pair VaR and ES which are jointly elicitable, i.e. there exist joint loss functions for the VaR and the ES (Fissler and Ziegel, , 2016). This discovery triggered a rapidly growing branch of literature in developing forecasting methods and forecast evaluation techniques for the ES, see Patton et al., (2019), Dimitriadis and Bayer, (2019), Bayer and Dimitriadis, (2020), Taylor, (2019), Barendse, (2020), Fissler et al., (2016) and Nolde and Ziegel, (2017) among others.
A desirable tool for the comparison of ES forecasts are encompassing tests, which however build upon the existence of strictly consistent loss functions. Given two competing forecasts A and B, forecast encompassing tests the null hypothesis that forecast A performs not worse than any (linear) combination of these forecasts. This is carried out by testing whether the optimal combination weight of forecast B deviates significantly from zero.222For the classical theory on forecast encompassing see Hendry and Richard, (1982), Mizon and Richard, (1986), Diebold, (1989), Ericsson, (1993), Harvey et al., (1998), Clark and McCracken, (2001), Giacomini and Komunjer, (2005), Newbold and Harvey, (2007) and Clements and Harvey, (2009) among others. This null hypothesis allows for the convenient interpretation that forecast B does not add any information to forecast A and thus, forecast A is superior to forecast B. The existence of appropriate loss functions is inevitable for encompassing tests for two reasons. First, the superior performance of competing forecasts is defined in the statistical sense by using strictly consistent loss functions. Second, loss and identification functions are crucial for M- or GMM-estimation of the optimal forecast combination weights through an appropriate regression framework for the risk measure under consideration.
In this paper, we introduce novel encompassing tests for the ES based on the joint loss functions for the VaR and ES developed in Fissler and Ziegel, (2016). We introduce the following three test variants for the ES. First, we propose to jointly test forecast encompassing for the VaR and ES, henceforth denoted the joint VaR and ES encompassing test. We introduce a second test variant, denoted the auxiliary ES encompassing test, which estimates the optimal combination weights for the vector of the VaR and ES, however, only tests the parameters associated with the ES. While incorporating both, VaR and ES forecasts, this variant only tests encompassing of the ES forecasts. The third variant overcomes the tests’ dependence on VaR forecasts and tests encompassing of competing ES forecasts stand-alone, which comes at the cost of a potential model misspecification. We henceforth call this test the strict ES encompassing test. This variant is particularly relevant due to the current set of rules established by the Basel Committee of Banking Supervision, which only imposes the financial institutions to report ES forecasts (Basel Committee, , 2016, 2017). Only this test variant can be applied in situations where the person evaluating the forecasts merely has forecasts for the ES at hand. However, in situations where both, the VaR and ES forecasts (stemming from the same model or forecasting procedure) are available, application of the joint or auxiliary tests is generally recommended.
We implement the encompassing tests through M-estimation of the optimal combination weights (Patton et al., , 2019; Dimitriadis and Bayer, , 2019) and in an environment with asymptotically non-vanishing estimation uncertainty of the forecasting procedures (Giacomini and Komunjer, , 2005; Giacomini and White, , 2006). As the strict ES encompassing test is potentially subject to model misspecification, we derive the asymptotic distribution of the test statistics in a general setting which allows for misspecified models. This generalizes the asymptotic theory of Patton et al., (2019), Dimitriadis and Bayer, (2019) and Bayer and Dimitriadis, (2020) to potentially misspecified (and nonlinear) models. We base the Wald test statistics of the encompassing tests on a misspecification-robust covariance estimator. Our implementation further introduces a link or combination function which captures the different linear and nonlinear forecast combination methods in the existing encompassing testing literature, see Clements and Harvey, (2009) and Clements and Harvey, (2010) among others.
We analyze the finite sample behavior of our encompassing tests and the effect of the potential model misspecification in an extensive simulation study using models from various model classes associated with the ES. For this, we consider classical GARCH models, the GAS (generalized auto-regressive score) models with time-varying higher moments of Creal et al., (2013), the GAS models for the VaR and ES of Patton et al., (2019) and the ES-CAViaR models of Taylor, (2019). Data stemming from the latter three model classes induces some model misspecification for the strict ES encompassing test, which allows us to evaluate the effect the misspecification has on our tests. We find that all tests exhibit approximately correct size and good power properties for all considered simulations. This also holds for the strict ES encompassing test which demonstrates that this test is robust to the degree of model misspecification we usually encounter in financial applications.
Tests for forecast encompassing are commonly used to establish a theoretical basis for forecast combinations in cases when encompassing is rejected for both forecasts (Clements and Harvey, , 2009; Newbold and Harvey, , 2007; Giacomini and Komunjer, , 2005). This implies that neither of the forecasts stand-alone performs as good as an optimal forecast combination, which indicates that a forecast combination incorporates more information than the individual forecasts. Giacomini and Komunjer, (2005), Timmermann, (2006), Halbleib and Pohlmeier, (2012) and Taylor, (2020) advocate general forecast combination methods for multiple reasons and particularly for risk measures with small probability levels, as it is customary for the VaR and the ES.
We apply our encompassing tests to ES forecasts from classical GARCH and GAS models, but also from the recently developed dynamic ES models of Taylor, (2019) and Patton et al., (2019) for daily returns of the IBM stock, the SP 500 and the DAX 30 indices. The test results imply that for the IBM stock, forecast combination methods outperform the stand-alone forecasting models in many instances. In comparison, this pattern seems to be less pronounced for the SP 500 and the DAX 30 indices, which are already well diversified through their versatile composition. Thus, classical diversification gains (Timmermann, , 2006) of forecast combination methods might be less pronounced for stock indices. The two ES based test variants exhibits very similar results, which further indicates that the strict ES test is robust against potential misspecifications in financial settings.
The classical idea of forecast encompassing goes back to Hendry and Richard, (1982), Chong and Hendry, (1986) and Mizon and Richard, (1986) and is developed for mean forecasts under the squared loss function. Broad reviews on encompassing testing are provided e.g. by Newbold and Harvey, (2007) and Clements and Harvey, (2009). Harvey and Newbold, (2000) extend the encompassing technique which classically focuses on two competing forecasts to encompassing of multiple forecasts. Giacomini and Komunjer, (2005) develop (conditional) encompassing of quantile forecasts and focus on encompassing tests for methods instead of models. Clements and Harvey, (2010) generalize encompassing tests to probabilistic forecasts by relying on strictly consistent scoring rules. Giacomini and Komunjer, (2005) and Clements and Harvey, (2010) investigate extensions of encompassing to more complicated functionals of the conditional distribution. Our work pursues this path by developing encompassing tests for the ES as a prominent example of higher-order elicitable functionals where only joint loss functions for vector-valued functionals are available. Our testing approach can be adapted to further higher-order elicitable functionals such as the pair mean, variance and the Range Value at Risk (Cont et al., , 2010; Embrechts et al., , 2018; Fissler and Ziegel, , 2019).
The rest of the paper is organized as follows. In Section 2, we introduce encompassing tests for the ES and derive the asymptotic distribution of the associated test statistics under model misspecification. Section 3 presents an extensive simulation study analyzing the size and power properties of our tests. In Section 4, we apply the testing procedure to daily financial returns of the IBM stock and the S&P 500 and DAX 30 indices and Section 5 concludes. All proofs are deferred to Appendix A. Technical details of the proofs and additional results are provided in the supplementary material.
2 Theory
We consider a stochastic process , which is defined on some common and complete probability space , where and . We partition the stochastic process as , where is an absolutely continuous random variable of interest and is a vector of explanatory variables. We denote the conditional distribution of given the information set by . Accordingly, , and denote the expectation, variance and density corresponding to . Following Giacomini and Komunjer, (2005), we consider (-measurable) one-step ahead forecasts, henceforth denoted by and , which are generated by a function f\big{(}\gamma_{t,m},Z_{t},Z_{t-1},\dots\big{)}, which is fixed over time. For this, denotes the (estimated) model parameters at time or alternatively the semi- or non-parametric estimator used in the construction of the forecasts. This construction allows for both, fixed forecasting schemes, where the model parameters are only estimated once, and rolling window forecasting schemes, where the parameters are re-estimated in each step. We denote general competing forecasts by , specific VaR (quantile) forecasts by and ES forecasts by .
In the context of evaluating point forecasts, an important property of risk measures (or more general statistical functionals) is elicitability (Gneiting, , 2011). Elicitability means that there exist strictly consistent loss functions, i.e. loss functions depending on the random variable and the issued forecast , whose expectation is uniquely minimized by the true risk measure . Using such a loss function, one can assess the quality of issued forecasts by comparing their average losses induced by the realizations of the predicted variable. Evaluating forecasts through strictly consistent loss functions has the desired impact that it incentivizes financial institutions to truthfully report their correct forecasts (Gneiting, , 2011; Fissler et al., , 2016). As a direct consequence, the literature on tests for forecast comparison and forecast rationality evolves around the associated loss functions, see Mizon and Richard, (1986), Diebold and Mariano, (1995), Elliott et al., (2005), Giacomini and Komunjer, (2005), Giacomini and White, (2006), Patton and Timmermann, (2007), Clements and Harvey, (2010), Gneiting, (2011) and Patton, (2011) among many others.
Many important statistical functionals such as the variance, the ES, the minimum, the maximum and the mode are not elicitable, i.e. no strictly consistent loss functions exist (Gneiting, , 2011; Heinrich, , 2014; Fissler and Ziegel, , 2016). This deficiency calls for generalized approaches in many academic disciplines. We built our test procedure for the ES on such an approach, which considers multiple functionals stacked as vectors and considers joint elicitability. Fissler and Ziegel, (2016) show that the ES is jointly elicitable with the VaR by constructing strictly consistent joint loss functions for this pair, which we utilize in our encompassing approach.
In the following section, we formally introduce the concept of forecast encompassing in the classical case of one-dimensional, real-valued and elicitable functionals. Subsequently, we make use of the higher-order elicitability of the ES and generalize the encompassing approach to ES forecasts in Section 2.2.
2.1 The Encompassing Principle
Following e.g. Hendry and Richard, (1982), Mizon and Richard, (1986), Diebold, (1989) and Giacomini and Komunjer, (2005), we formally introduce the classical concept of linear forecast encompassing for one-dimensional, real-valued and elicitable functionals. We assume that two competing forecasters predict the variable of interest and issue one-step ahead point forecasts \hat{\boldsymbol{f}_{t}}=\big{(}\hat{f}_{1,t},\hat{f}_{2,t}\big{)} for a given functional .333While we focus our approach on one-step ahead forecasts, extensions to multi-step ahead forecasts are straight-forward by employing a HAC-type estimator for the asymptotic covariance. In order to conduct the forecast evaluation in an out-of-sample fashion, we divide the sample size in an in-sample part of size and an out-of-sample part of size such that . The in-sample period is used to generate the forecasts and as described in the beginning of Section 2, while the out-of-sample period is used for the evaluation of the forecasts. This procedure poses little restrictions on how to generate the forecasts and allows for parametric, semiparametric or nonparametric techniques and for nested and non-nested forecasting procedures (Giacomini and Komunjer, , 2005).
Let \rho\big{(}Y_{t+1},\hat{f}_{t}\big{)} be a strictly consistent loss function for . Then, we say that forecast encompasses at time , if
[TABLE]
for all \big{(}\theta_{1},\theta_{2}\big{)}\in\Theta\subseteq{\mathbb{R}^{2}}. Equation (2.1) implies that, in terms of the loss induced by , the forecast is at least as good as any (linear) combination of and . Hence, forecast does not add any information on which is not already incorporated in . We define \big{(}\theta^{\ast}_{1},\theta^{\ast}_{2}\big{)} as the optimal combination parameters which minimize the expected loss,
[TABLE]
By definition, it holds that \mathbb{E}\left[\rho\big{(}Y_{t+1},\theta_{1}\hat{f}_{1,t}+\theta_{2}\hat{f}_{2,t}\big{)}\right]\geq\mathbb{E}\left[\rho\big{(}Y_{t+1},\theta^{\ast}_{1}\hat{f}_{1,t}+\theta^{\ast}_{2}\hat{f}_{2,t}\big{)}\right] for all \big{(}\theta_{1},\theta_{2}\big{)}\in\Theta. In particular, this implies that
[TABLE]
Combining (2.1) and (2.3) yields the following definition of forecast encompassing.
Definition 2.1 (Linear Forecast Encompassing for Elicitable Functionals).
We say that forecast encompasses at time with respect to the loss function if and only if
[TABLE]
which is equivalent to \big{(}\theta^{\ast}_{1},\theta^{\ast}_{2}\big{)}=\big{(}1,0\big{)}.
Tests for forecast encompassing are carried out through the following steps. First, we regress the realizations onto the forecasts and using an appropriate regression technique for the functional under consideration in order to obtain the estimated combination (or encompassing) parameters and their asymptotic distribution. Then, we test whether these parameters equal one and zero respectively.
As discussed e.g. in Clements and Harvey, (2009) and Clements and Harvey, (2010), there exist several different testing specifications available for the encompassing principle, which differ in terms of the admissible specifications of the linear (or nonlinear) forecast combination formula. We generalize and unify these approaches by introducing a general link or combination function,
[TABLE]
which maps the forecasts and the respective parameters onto a linear or nonlinear forecast combination and where denotes the random space of the issued forecasts. For this, the function and the parameter space have to be chosen such that there exists a , such that almost surely, which enables testing whether alone captures the full information provided by any forecast combination through testing the parametric restriction .
Definition 2.2 (General Forecast Encompassing for Elicitable Functionals).
We say that forecast encompasses at time with respect to the loss function and with respect to the link function if and only if
[TABLE]
which is equivalent to .
This general definition unifies the following existing specifications of forecast encompassing, but also allows for more general linear and nonlinear specifications, see e.g. Ericsson, (1993), Clements and Harvey, (2009) and Clements and Harvey, (2010).
Example 2.3.
Prominent examples for linear and nonlinear forecast encompassing are the following link functions and associated null hypotheses,
- (1)
and or , 2. (2)
and or , 3. (3)
and or , 4. (4)
and , 5. (5)
and , 6. (6)
and , 7. (7)
g(\hat{\boldsymbol{f}_{t}},\theta)=\theta_{1}\pm\exp\big{(}\theta_{2}\log(\pm\hat{f}_{1,t})+\theta_{3}\log(\pm\hat{f}_{2,t})\big{)} and .
2.2 Forecast Encompassing for the Expected Shortfall
In this section, we consider encompassing tests for the ES. For absolutely continuous distributions , the ES is formally defined as
[TABLE]
where denotes the conditional -quantile of given . As discussed in the previous section, the main ingredient of forecast encompassing tests is the specification of the underlying loss function, which has to be associated with the risk measures we consider forecasts for. As such loss functions do not exist for the ES stand-alone, we utilize a strictly consistent joint loss function for the pair consisting of the ES and the VaR, given by Fissler and Ziegel, (2016) as
[TABLE]
where the arguments , and denote the return realization, the quantile and the ES respectively. As this loss function exhibits the desirable property of having loss differences which are homogeneous of order zero, it is often denoted as the FZ0-loss function, see e.g. Patton et al., (2019). While there exist infinitely many strictly consistent loss functions for the pair VaR and ES, the recent literature seems to agree upon this choice: Dimitriadis and Bayer, (2019) find that it exhibits a stable numerical performance in M-estimation and empirically yields relatively efficient parameter estimates. Nolde and Ziegel, (2017) discuss the desirable property of homogeneity of these loss functions and Patton et al., (2019), Bayer and Dimitriadis, (2020) and Taylor, (2019) use this loss function to estimate dynamic ES models.
Following the specification of a link function in (2.5), we introduce the quantile- and ES-specific link functions
[TABLE]
where and denote the random spaces of the VaR and ES forecasts, and such that and . We assume that the functions , and the parameter space are chosen such that there exist values and , such that and almost surely.
In the following, we introduce the concept of joint forecast encompassing for the pair consisting of the VaR and the ES. Analogously to (2.2), we define the optimal combination parameters for the VaR and ES as
[TABLE]
Definition 2.4 (Joint VaR and ES Forecast Encompassing).
Let \big{(}\hat{q}_{1,t},\hat{e}_{1,t}\big{)} and \big{(}\hat{q}_{2,t},\hat{e}_{2,t}\big{)} denote pair-wise competing forecasts for the pair consisting of the conditional quantile and ES of . We say that \big{(}\hat{q}_{1,t},\hat{e}_{1,t}\big{)} encompasses \big{(}\hat{q}_{2,t},\hat{e}_{2,t}\big{)} at time with respect to the link functions and if and only if
[TABLE]
where the loss function is given in (2.8). This holds if and only if \big{(}\beta^{\ast},\eta^{\ast}\big{)}=\big{(}\beta_{0},\eta_{0}\big{)}.
We test whether the sequence of joint quantile and ES forecasts \big{(}\hat{q}_{1,t},\hat{e}_{1,t}\big{)} encompasses the sequence \big{(}\hat{q}_{2,t},\hat{e}_{2,t}\big{)} for all by estimating the parameters of the following semiparametric regression,
[TABLE]
where and almost surely for all by using the M-estimation technique introduced in Patton et al., (2019) and Dimitriadis and Bayer, (2019). We then test for \big{(}\beta^{\ast},\eta^{\ast}\big{)}=\big{(}\beta_{0},\eta_{0}\big{)} using a Wald type test statistic.
Definition 2.4 develops a joint encompassing test for the VaR and ES, which is reasonable given the joint elicitability property of the VaR and ES. However, a further objective of this paper is to construct encompassing tests for the ES stand-alone, which we do in the following.
Definition 2.5 (Auxiliary ES Forecast Encompassing).
Let \big{(}\hat{q}_{1,t},\hat{e}_{1,t}\big{)} and \big{(}\hat{q}_{2,t},\hat{e}_{2,t}\big{)} denote competing forecasts for the pair consisting of the conditional quantile and ES of . We say that auxiliarily encompasses at time with respect to the link functions and if and only if
[TABLE]
that is, if and only if .444It is important to notice that the optimal combination parameter on the left-hand side of (2.14) is given by (2.11) and not in the sense of an optimal combination parameter of a restricted model.
This parameter restriction is tested using a Wald type test statistic based on the estimates of the regression setup given in (2.13). As we do not test the quantile specific parameters , we do not impose that the underlying quantile forecast also encompasses its competitor under this null hypothesis. Hence, even though this test is based on the joint regression, it only tests encompassing of the ES forecasts. We call this test auxiliary ES encompassing test as it still depends on the auxiliary quantile forecasts which are used for the estimation of the optimal combination parameters.
Given that both, the VaR and ES forecasts are available, application of either the joint or auxiliary test is the most plausible approach given their joint elicitability. However, even though the emphasis of the auxiliary encompassing test is on the ES, it still requires quantile forecasts for the implementation of the parameter estimation. This can be problematic for two reasons. First, the quantile forecasts are still used in the estimation procedure and thus have an indirect effect on the parameter estimates of the ES specific parameters. E.g., the previous tests are not applicable for ES forecasts which are based on the same VaR forecasts, as this implies perfect collinearity of the quantile regressors. Second, the auxiliary test is only applicable in the setup where the person applying the test has access to the quantile forecasts. In the current implementation of the regulatory framework of the Basel Committee (Basel Committee, , 2016, 2017), the banks are only obligated to report their ES forecasts (at probability level ), but not the corresponding VaR forecasts. Thus, the accompanying VaR forecasts, which the ES forecasts are internally based on, are in general not available to the regulator who has to decide on an adequate risk management of the financial institution at hand.
In order to account for these scenarios, we further introduce the strict ES encompassing test, which only requires ES forecasts in the following. For this, we slightly modify the definition of (2.11) by replacing through ,555Note that the parameters denoted in (2.11) and in (2.15) can generally differ.
[TABLE]
Definition 2.6 (Strict ES Forecast Encompassing).
Let and denote competing ES forecasts of the underlying predictive distribution . We say that strictly encompasses at time with respect to the link functions and if and only if
[TABLE]
that is, if and only if .
We test whether strictly encompasses for all by setting up the slightly transformed regression
[TABLE]
where and almost surely for all . The crucial difference between this test and the joint and auxiliary encompassing tests is that instead of using the quantile forecasts in the quantile link function , we use the ES forecasts for both, the quantile and ES link functions and . We argue that this can be seen as a best feasible solution due to the lack of loss functions for the ES stand-alone together with the necessity of developing forecast evaluation methods for the ES stand-alone due to the current setup of the Basel III regulatory framework (Basel Committee, , 2016, 2017).
The underlying idea of this test is mainly motivated by pure scale models, i.e. , , which is still the most frequently used class of models for risk management with the GARCH and stochastic volatility models as prime examples. For this model class, the VaR and ES forecasts are perfectly colinear, , where and are the -quantile and -ES of the distribution . Hence, the quantile model is correctly specified, but with transformed quantile parameters .666For the prominent case of linear encompassing link formulas , it holds that . As we only test on the ES-specific parameters as described in Definition 2.6, our test is invariant to this (often linear) transformation of the parameter and thus, it is correctly specified for pure scale models.
In the general case, the quantile equation can possibly be misspecified. Thus, we provide asymptotic theory under general model misspecification for the M-estimator in the following section. The potential model misspecification might bias the pseudo-true parameters and challenge the interpretability of the test decision, but we argue that this effect is negligible for this setup. First, the misspecification is only slight in the sense that daily financial return data is approximated well by pure scale processes. Second, the misspecification is indirect in the sense that while the quantile parameters are potentially misspecified, we only test the ES parameters, which are influenced by the misspecification only indirectly through the joint estimation. Furthermore, we illustrate that the performance of our strict ES encompassing test is not negatively influenced by more general data generating processes in the simulation study in Section 3 by considering GAS models with time-varying higher moments of Creal et al., (2013) and the dynamic ES models of Patton et al., (2019) and Taylor, (2019).
Tests for equal (superior) predictive ability in the sense of Diebold and Mariano, (1995), Clark and McCracken, (2001), Giacomini and White, (2006), West, (2006) and the model confidence set approach of Hansen et al., (2011) can be seen as a general alternative to encompassing tests. As these tests are directly based on the average loss difference, they can only test the predictive ability of the VaR and ES jointly. In contrast, encompassing tests are based on the regression coefficients of the semiparametric quantile and ES models and hence, only indirectly on the respective loss function. This fundamental difference allows for stand-alone encompassing tests for ES forecasts, which constitutes a great advantage for ES encompassing tests.
Strictly speaking, strict consistency of loss functions only implies that the optimal forecast exhibits the smallest possible loss in expectation. In reality however, competing forecasts are often misspecified due to estimation error or misspecified forecasting models. Patton, (2019) shows that then, the ranking induced by the loss functions can be sensitive towards the choice of (strictly consistent) loss functions or even misleading. Holzmann and Eulert, (2014) show that for competing forecasts which are based on nested information sets and which are correctly specified given their underlying (but usually incomplete) information set (auto-calibrated), applying any strictly consistent loss function results in a correct ranking of the forecasts. In our case of testing forecast encompassing, we indeed build on nested information sets as it obviously holds that \sigma\big{\{}\hat{f}_{1,t},\hat{f}_{2,t}\big{\}}\supseteq\sigma\big{\{}\hat{f}_{1,t}\big{\}}. Thus, by further assuming that the issued forecasts are auto-calibrated given the forecaster’s information set, we can conclude that the ranking implied by (2.1) is indeed the correct one and invariant towards the choice of strictly consistent loss functions.
2.3 Asymptotic Theory under Model Misspecification
In the following, we use the short notation and (or in the case of the strict test). We define the M-estimator as
[TABLE]
and the pseudo-true parameter as777 The pseudo-true parameter can generally depend on the issued loss function, i.e. in this case on the zero homogeneous choice in (2.8).
[TABLE]
When the link (regression) functions and are correctly specified, we get that the pseudo-true parameter equals the classical true regression parameter and it is independent of the sample size . We further define the corresponding identification functions, which are almost surely the derivative of the loss function with respect to ,
[TABLE]
We restrict our attention to processes which satisfy the following conditions.
Assumption 2.7.
We assume that
- (a)
the process is strong mixing of size for some , 2. (b)
the parameter space is compact and non-empty, 3. (c)
the pseudo-true parameter defined in (2.19) is in the interior of and is the unique minimizer of the objective function and the sequence \mathbb{E}_{t}\big{[}\psi\big{(}Y_{t+1},g^{q}_{t}(\beta),g^{e}_{t}(\eta)\big{)}\big{]}, defined in (2.20) is uncorrelated, 4. (d)
the distribution of given , denoted by is absolutely continuous with continuous and strictly positive density , which is bounded from above almost surely on the whole support of and Lipschitz continuous, 5. (e)
for all in a neighborhood of , it holds that for some constant , 6. (f)
the link functions and are -measurable, twice continuously differentiable in on almost surely and if \mathbb{P}\big{(}g_{t}^{q}(\beta_{1})=g_{t}^{q}(\beta_{2})\cap g_{t}^{e}(\eta_{1})=g_{t}^{e}(\eta_{2})\big{)}=1, then , 7. (g)
the matrices and , defined in Proposition 2.9 are positive definite with a determinant bounded away from zero for all sufficiently large, 8. (h)
it holds that , , , , and , , , , for all in a neighborhood of , where the random variables are all -measurable and for some (from condition (a)), the following moments are bounded (i) , (ii) , (iii) , (iv) , (v) , (vi) , (vii) , (viii) , (ix) , (x) , (xi) , (xii) , (xiii) , (xiv) , (xv) , (xvi) , (xvii) , 9. (i)
for any , the term is almost surely bounded from above.
The following propositions show consistency and asymptotic normality of the M-estimator under potential model misspecification.
Proposition 2.8.
Given the conditions in Assumption 2.7, it holds that .
Proposition 2.9.
Given the conditions in Assumption 2.7, it holds that
[TABLE]
with , where , and \Sigma_{n}(\theta^{\ast}_{n})=\frac{1}{n}\sum_{t=m}^{T-1}\mathbb{E}\left[\psi\big{(}Y_{t+1},g^{q}_{t}(\beta^{\ast}_{n}),g^{e}_{t}(\eta^{\ast}_{n})\big{)}\cdot\psi\big{(}Y_{t+1},g^{q}_{t}(\beta^{\ast}_{n}),g^{e}_{t}(\eta^{\ast}_{n})\big{)}^{\top}\right]. Furthermore, the components of are given by
[TABLE]
where and are the Hessian matrices of and respectively.
The two preceding propositions extend the asymptotic theory of Patton et al., (2019) to the case of possibly misspecified models, and the misspecification theory for linear models of Bayer and Dimitriadis, (2020) to nonlinear models. The proofs in Appendix A combine, extend and go along the lines of the ideas of Engle and Manganelli, (2004) and Patton et al., (2019). The conditions closely resemble the regularity conditions of Patton et al., (2019). As we further allow for model misspecification, we impose the unique minimization condition (c) and slightly strengthen the moment conditions (h). In the baseline case of linear encompassing link functions and , the required moment conditions simplify to those given in Bayer and Dimitriadis, (2020).
For the estimation of the asymptotic covariance matrix under possible model misspecification, we follow the approach of Dimitriadis and Bayer, (2019) and Bayer and Dimitriadis, (2020). We deal with the three nuisance quantities in as follows. In order to estimate the density quantile function , we follow the nid-estimator of Hendricks and Koenker, (1992). As the degree of misspecification in the investigated financial time series is small (Bayer and Dimitriadis, , 2020), we approximate . For the conditional truncated variance, \operatorname{Var}_{t}\big{(}g^{q}_{t}(\beta^{\ast}_{n})-Y_{t+1}\big{|}Y_{t+1}\leq g^{q}_{t}(\beta^{\ast}_{n})\big{)}, we employ the scl-sp estimator of Dimitriadis and Bayer, (2019).
We now consider the asymptotic distributions of our three ES encompassing tests proposed in Section 2.2 under the null hypotheses and for general link functions, where we test certain -dimensional () sub-vectors of . For this, let be a selection matrix whose columns consist of -dimensional Cartesian unit (column) vectors , which are zero apart from a one in dimension . E.g., when and equal the linear link functions with intercept, given in the first point of Example 2.3, . Then, for the strict and auxiliary ES encompassing tests, and for the joint test . These choices pick the respective parameters from . Then, we define the respective test statistics by
[TABLE]
Theorem 2.10 (ES Encompassing Tests).
Given the conditions of Assumption 2.7 and given that , under the respective null hypotheses given in Definition 2.4 - 2.6, it holds that
[TABLE]
For linear link functions, this theorem implies that the limiting distribution of the joint test has four degrees of freedom, while the one of the strict and auxiliary tests has two degrees of freedom.
An important application of these ES encompassing tests is in the context of selecting the best-performing forecast, i.e. selecting at time a superior forecasting method for the future. This is particularly relevant as the ES is recently introduced into the Basel regulations without having proper forecast selection procedures at hand. Following Giacomini and Komunjer, (2005), we propose the following decision rule. We test the two encompassing hypotheses : encompasses and : encompasses for . Then, there are four possible scenarios: (1) if neither nor are rejected, the test is not helpful for forecast selection. (2) If is rejected while is not rejected, we can conclude that forecast does add information to forecast , while we cannot conclude the reverse. Thus, we decide to use the forecasting method of . (3) If is rejected while is not rejected, the same logic applies inversely and we use the forecasting method of . (4) If both, and are rejected, the test delivers statistical evidence that both forecasts contain exclusive information and that a forecast combination outperforms the stand-alone forecasts. Consequently, we use a combined forecast where the estimated combination parameters are obtained from the M-estimator proposed here.
Testing forecast encompassing conditional on some information set based on some -measurable vector of instruments in the sense of Giacomini and Komunjer, (2005) can be facilitated through estimating the regression parameters through (overidentified) GMM-estimation instead of M-estimation. However, for the strict ES test, this approach requires asymptotic theory under model misspecification for the overidentified GMM estimator based on nonsmooth objective functions. While such theory is available for smooth moment conditions (see e.g. Hall and Inoue, (2003) and Hansen and Lee, (2019)), its generalization to nonsmooth objective functions is not straight-forward and thus, we leave conditional ES encompassing tests based on misspecified GMM-estimation for future research. The moment conditions of our unconditional approach can be interpreted as conditional encompassing with respect to the instruments and . In the classical baseline case of linear forecast encompassing, these instruments simplify to and and thus, our approach tests conditional encompassing with respect to the information set , which in most cases already captures the most relevant information which is available.
3 Simulation Study
In this section, we evaluate the size and power properties of our three proposed ES encompassing tests and compare them to the VaR encompassing test of Giacomini and Komunjer, (2005). For this, we describe the simulation setup in Section 3.1 and we report and discuss the simulation results in Section 3.2. Section 3.3 considers three extensions of the simulation setup with respect to additional data generating processes (DGPs), loss and link functions.
3.1 The Simulation Setup
We employ the three encompassing tests based on the linear link functions and , where for the joint and auxiliary tests and for the strict test, together with the parameter space .888 We choose the constant large enough such that the parameter estimation is not restricted in realistic settings but the parameter space is indeed convex. For the respective encompassing tests, in each case we test the following two opposing hypotheses:
[TABLE]
In the following, we describe two DGPs where for the first, both forecasting models stem from classical GARCH models while the second considers two joint GAS models for the VaR and ES of Patton et al., (2019). For both model classes, we simulate data as a convex combination of two distinct models with a flexible convex combination weight . This implies that for , the first model encompasses the second, while for , the inverse holds. For all intermediate parameters , the data stems from a linear combination and both forecast encompassing null hypotheses should be rejected which indicates that a forecast combination method is preferred.
The GARCH DGP
The two GARCH models, which are calibrated to daily IBM returns, are given by , for , where and the two distinct volatility specifications are given by
[TABLE]
For both models, we obtain VaR and ES forecasts by and , for , where and are the -quantile and -ES of the standard normal distribution. Notice that the time index on indicates that it is a -measurable forecast for time . While the first specification in (3.2) is a classical GARCH(1,1) model (Bollerslev, , 1986), the second specification in (3.3) follows the GJR-GARCH model of Glosten et al., (1993), which allows for a leverage effect. We simulate data from the convex combination of these processes, Y_{t+1}=\big{(}(1-\pi)\hat{\sigma}_{1,t}+\pi\hat{\sigma}_{2,t}\big{)}u_{t+1} for 21 equally spaced values of , where .
The VaR/ES GAS DGP
In the second simulation setup, we implement the one-factor (1F) and two-factor (2F) GAS models for the VaR and ES of Patton et al., (2019). The 1F-GAS model evolves as
[TABLE]
The 2F-GAS model follows the specification
[TABLE]
where the forcing variable is given by \lambda_{t}=\big{(}\hat{q}_{2,t-1}(\alpha-\mathds{1}_{\{\tilde{Y}_{2,t}\leq\hat{q}_{2,t-1}\}}),\,\mathds{1}_{\{\tilde{Y}_{2,t}\leq\hat{q}_{2,t-1}\}}\tilde{Y}_{2,t}/\alpha-\hat{e}_{2,t-1}\big{)}^{\top}. For both models, , we simulate \tilde{Y}_{j,t+1}\sim\mathcal{N}\big{(}\hat{\mu}_{j,t},\hat{\sigma}_{j,t}^{2}\big{)}, where the conditional mean and standard deviations are given by and , such that and almost surely. The parameter values for this model are obtained from Table 8 of Patton et al., (2019) and correspond to calibrated parameters to daily S&P 500 returns.
In order to simulate returns which follow a convex combination of these two conditional distributions, we simulate Bernoulli draws for 21 equally spaced values of , and let . Thus, for , follows the 1F-GAS model, for , follows the 2F-GAS model and for , follows some convex combination of these two models.999 While generating returns stemming from convex combinations of GARCH-type volatility models is straight-forward by using convex combinations of the conditional volatilities, this is not as simple for the more general GAS models considered in this section. Consequently, we use this more involved approach based on Bernoulli draws in order to generate these convex model combinations.
Both models in the GARCH DGP generate data from a pure scale (volatility) process resulting in perfectly colinear VaR and ES forecasts. In contrast, the more general VaR/ES GAS models in the second DGP generate VaR and ES forecasts which are not colinear and consequently introduce misspecification in the quantile model of the strict ES encompassing test. As the utilized parameters are calibrated to daily financial returns, these models reflect a realistic degree of misspecification encountered in practical risk management.
3.2 Simulation Results
Table 3.2 reports the empirical sizes of the three different ES encompassing tests introduced in Section 2 together with the VaR encompassing test of Giacomini and Komunjer, (2005) at a 10 nominal significance level based on Monte Carlo replications. Table S.1 and S.2 in the supplementary material present equivalent results for nominal sizes of 1 and 5. The column panel indicates that we test whether model 1 encompasses model 2, while the panel indicates the reverse.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Andrews, (1999) Andrews, D. W. K. (1999). Estimation when a parameter is on a boundary. Econometrica , 67(6):1341–1383.
- 2Artzner et al., (1999) Artzner, P., Delbaen, F., Eber, J.-M., and Heath, D. (1999). Coherent Measures of Risk. Mathematical Finance , 9(3):203–228.
- 3Barendse, (2020) Barendse, S. (2020). Efficiently Weighted Estimation of Tail and Interquartile Expectations. Working Paper, available at https://drive.google.com/file/d/1n I 0QA Wb M_Vch AZD Vg 79p 2v Jc Ko Cr QB 8o/view .
- 4Basel Committee, (2013) Basel Committee (2013). Fundamental review of the trading book: A revised market risk framework. Technical report, Bank for International Settlements. Available at http://www.bis.org/publ/bcbs 265.pdf .
- 5Basel Committee, (2016) Basel Committee (2016). Minimum capital requirements for Market Risk. Technical report, Bank for International Settlements. Available at http://www.bis.org/bcbs/publ/d 352.pdf .
- 6Basel Committee, (2017) Basel Committee (2017). Pillar 3 disclosure requirements – consolidated and enhanced framework. Technical report, Basel Committee on Banking Supervision. Available at http://www.bis.org/bcbs/publ/d 400.pdf .
- 7Bayer and Dimitriadis, (2020) Bayer, S. and Dimitriadis, T. (2020). Regression based expected shortfall backtesting. Journal of Financial Econometrics (forthcoming) . available at https://arxiv.org/abs/1801.04112 , doi: 10.1093/jjfinec/nbaa 013 . · doi ↗
- 8Bollerslev, (1986) Bollerslev, T. (1986). Generalized autoregressive conditional heteroskedasticity. Journal of Econometrics , 31(3):307–327.
