Testing normality for unconditionally heteroscedastic macroeconomic variables
Hamdi Ra\"issi

TL;DR
This paper evaluates methods for testing normality in macroeconomic time series with heteroscedasticity, proposing a bootstrap approach to improve accuracy over traditional tests like Jarque-Bera.
Contribution
It introduces a bootstrap-based method to address size distortions in normality testing of heteroscedastic macroeconomic data, improving reliability.
Findings
Kernel smoothing approach can suffer from size distortion in macroeconomic samples.
Bootstrap methodology effectively corrects size distortions in normality tests.
Analysis of inflation data from US, Korea, and Australia demonstrates the method's practical utility.
Abstract
In this paper the testing of normality for unconditionally heteroscedastic macroeconomic time series is studied. It is underlined that the classical Jarque-Bera test (JB hereafter) for normality is inadequate in our framework. On the other hand it is found that the approach which consists in correcting the heteroscedasticity by kernel smoothing for testing normality is justified asymptotically. Nevertheless it appears from Monte Carlo experiments that such methodology can noticeably suffer from size distortion for samples that are typical for macroeconomic variables. As a consequence a parametric bootstrap methodology for correcting the problem is proposed. The innovations distribution of a set of inflation measures for the U.S., Korea and Australia are analyzed.
| 100 | 200 | 400 | 800 | |
|---|---|---|---|---|
| 4.0 | 5.2 | 4.9 | 4.2 | |
| 7.2 | 7.5 | 5.6 | 5.0 | |
| 4.5 | 5.6 | 5.0 | 4.7 |
| 100 | 200 | 400 | 800 | |
|---|---|---|---|---|
| 8.7 | 13.0 | 11.5 | 19.1 | |
| 9.4 | 9.2 | 8.3 | 7.8 | |
| 4.4 | 6.5 | 6.3 | 6.3 |
| 100 | 200 | 100 | 200 | |
|---|---|---|---|---|
| 11.3 | 14.0 | 12.0 | 14.3 | |
| 6.8 | 10.2 | 7.6 | 11.1 | |
| U.S. | Korea | Australia | |
|---|---|---|---|
| 3.8 | 16.4 | 50.9 | |
| 2.3 | 82.0 | 21.0 | |
| 8.2 | 87.0 | 49.0 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Testing normality for unconditionally heteroscedastic macroeconomic variables
Hamdi Raïssi111Instituto de Estadística, PUCV. Address: Av. Errazuriz 2734, Valparaíso, V región, Chile. E-mail: [email protected]. This paper was supported by CONICYT-FONDECYT under grant . The author gratefully acknowledge Steffen Grønneberg and Genaro Sucarrat for helpful discussions during his stay at the BI Norwegian Business School.
Abstract: In this paper the testing of normality for unconditionally heteroscedastic macroeconomic time series is studied. It is underlined that the classical Jarque-Bera test (JB hereafter) for normality is inadequate in our framework. On the other hand it is found that the approach which consists in correcting the heteroscedasticity by kernel smoothing for testing normality is justified asymptotically. Nevertheless it appears from Monte Carlo experiments that such methodology can noticeably suffer from size distortion for samples that are typical for macroeconomic variables. As a consequence a parametric bootstrap methodology for correcting the problem is proposed. The innovations distribution of a set of inflation measures for the U.S., Korea and Australia are analyzed.
Keywords: Unconditionally heteroscedastic time series; Jarque-Bera test.
JEL: C12, C15, C18
1 Introduction
In the econometric literature, the Jarque Bera (1980) test is routinely used to assess the normality of variables. The properties of this test are well documented for stationary conditionally heteroscedastic processes. For instance Fiorentini, Sentana and Calzolari (2003), Lee, Park and Lee (2010) and Lee (2012) investigated the JB test in the context of GARCH models. However few studies are available on the distributional specification of time series in presence of unconditional heteroscedasticity. Drees and Stărică (2002), Mikosch and Stărică (2004) and Fryźlewicz (2005) investigated the possibility of modelling financial returns by nonparametric methods. To this end, Drees and Stărică (2002) and Mikosch and Stărică (2004) examined the distribution of S&P500 returns corrected from heteroscedasticity. On the other hand Fryźlewicz (2005) pointed out that large sample kurtosis for financial time series may be explained by non constant unconditional variance. In general we did not found references that specifically address the problem of assessing the distribution of unconditionally heteroscedastic time series. Note that non constant variance constitutes an important pattern for time series in general, and macroeconomic variables in particular. Reference can be made to Sensier and van Dijk (2004) who found that most of the 214 U.S. macroeconomic time series they studied have a time-varying variance. In this paper we aim to provide a reliable methodology for testing normality for small samples time series with non constant unconditional variance.
The structure of the paper is as follows. In Section 2 we first set the dynamics ruling the observed process. In particular the unconditional heteroscedastic structure of the errors is given. The inadequacy of the standard JB test in our framework is highlighted. The approach consisting in correcting the errors from the heteroscedasticity for building a JB test is presented. We then introduce a parametric bootstrap procedure that is intended to improve the normality testing for unconditionally heteroscedastic macroeconomic time series. In Section 3 numerical experiments are conducted to shed some light on the finite sample behaviors of the studied tests. In particular it is found that when estimating the non constant variance structure by kernel smoothing, a correct bandwidth choice is a necessary condition for the good implementation of the normality tests based on heteroscedasticity correction. We illustrate our outputs examining the distributional properties of the U.S., Korean and Australian GDP implicit price deflators.
2 Testing normality in presence of unconditional heteroscedasticity
We consider processes which can be written as
[TABLE]
where are available, being the sample size and . The conditional mean of is driven by the autoregressive parameters . We make the following assumption on the conditional mean.
Assumption A0: The , , are such that for all , with .
In the assumption A1 below, the well known rescaling device introduced by Dahlhaus (1997) is used to specify the errors process . For a random variable we define , with and .
Assumption A1: We assume that where:
- (i)
for some constant , and satisfies , where is a measurable deterministic function on the interval , such that . The function satisfies a Lipschitz condition piecewise on a finite number of some sub-intervals that partition .
- (ii)
The process is iid and such that , and for some .
The non constant variance induced by A1(i) allows for a wide range of non stationarity patterns commonly faced in practice, as for instance abrupt shifts, smooth changes or cyclical behaviors. Note that in the zero mean AR case, tools needed to carry out the Box and Jenkins specification-estimation-validation modeling cycle, are provided in Patilea and Raïssi (2013) and Raïssi (2015). For define the estimator , and for any . Writing , it can be shown that
[TABLE]
using the Beveridge-Nelson decomposition. Now let
[TABLE]
where
[TABLE]
with . With these notations define the OLS estimator and the unfeasible estimator . Straightforward computations give , so that using the results of Patilea and Raïssi (2012) we have
[TABLE]
Once the conditional mean is filtered in accordance to (2.2) and (2.4), we can proceed to the test of the following hypotheses:
[TABLE]
with the usual slight abuse of interpretation inherent of the use JB test for normality testing. Clearly the skewness and kurtosis of correspond to those of . However in practice and is checked using the JB test statistic:
[TABLE]
where
[TABLE]
with and . The ’s are the residuals obtained from the estimation step. Let us denote by convergence in distribution. If we suppose the process homoscedastic ( is constant), then the standard result is retrieved (see Yu (2007), Section 2.2). However under A0 and A1 with non constant (the unconditionally heteroscedastic case) we have:
[TABLE]
where . Hence if we suppose the errors process unconditionally heteroscedastic with , we have for some strictly positive constant . As a consequence, the classical JB test will tend to detect spuriously departures from the null hypothesis of a normal distribution in our framework. This argument was used by Fryźlewicz (2004) to underline that unconditionally heteroscedastic specifications can cover financial time series that typically exhibit an excess of kurtosis.
In order to assess the distribution of S&P500 returns, Drees and Stărică (2002) considered data corrected from heteroscedasticity, using a kernel estimator of the variance. We will follow this approach in the sequel considering
[TABLE]
with , if and where is a kernel function on the real line and is the bandwidth. The following assumption is needed for our variance estimator.
Assumption A2: (i) The kernel is a bounded density function defined on the real line such that is nondecreasing on and decreasing on and . The function is differentiable except a finite number of points and the derivative satisfies Moreover, the Fourier Transform of satisfies for some .
(ii) The bandwidth is taken in the range with and as , for some small .
Let . We are now ready to consider the following JB test statistic:
[TABLE]
where
[TABLE]
with . The following proposition gives the asymptotic distribution of .
Proposition 1**.**
Under the assumptions A0, A1 and A2, we have as
[TABLE]
uniformly with respect to .
Proposition 1 can be proved using the same arguments given in Yu (2007), together with those of the proof of Proposition 4 in Patilea and Raïssi (2014). Therefore we skip the proof. For building a test using the above result, we will consider the normal kernel in the next section. On the other hand we suggest to choose a bandwidth by minimizing the cross-validation (CV) criterion (see Wasserman (2006,p69-70)), unless otherwise specified. The test obtained using (2.7) and the above settings will be denoted by . The standard test, that does not take into account the unconditional heteroscedasticity, is denoted by .
For high frequency time series it is reasonable to suppose that the approximation (2.7) is satisfactory when the bandwidth is carefully chosen. Nevertheless considering the above sophisticated procedure for small is questionable. Therefore we propose to apply the following parametric bootstrap algorithm inspired from Francq and Zakoïan (2010,p335).
- 1-
Generate , , build the bootstrap errors , and the bootstrap series using (2.1), but with and (see (2.2) and (2.3)).
- 2-
Estimate the autoregressive parameters and a constant as in (2.1), but using the ’s. Build the kernel estimators from the resulting residuals .
- 3-
Compute for . Compute .
- 4-
Repeat the steps 1 to 3 B times for some large B. Use the ’s to compute the p-values of the bootstrap JB test.
The test obtained using the above parametric bootstrap procedure is denoted by .
3 Numerical illustrations
The finite sample properties of the , and tests are first examined by means of Monte Carlo experiments. The distribution of the U.S., Korean and Australian GDP implicit price deflator is then investigated. Throughout this section the asymptotic nominal level of the tests is 5%. In the sequel, we fixed .
3.1 Monte Carlo experiments
We simulate trajectories of AR(1) processes:
[TABLE]
where and with iid(0,1). Under the null hypothesis we set . On the other hand under the alternative hypothesis is taken, with , , , and being mutually independent. In order to study the case where the series are actually homoscedastic, we set . For the heteroscedastic case, the variance structure is given by
[TABLE]
In such situation the variance structure exhibits a global monotone behavior together with a cyclical/seasonal component that is common in macroeconomic data (see e.g. Trimbur, and Bell (2010) for seasonal effects in the variance). In all our experiments, the mean in (3.1) is treated as unknown. More precisely the AR parameter in (3.1) is estimated using , where is given in (2.2), and then the resulting centered residuals are used to compute the test statistics.
The outputs obtained under the null hypothesis are first analyzed. The results are given in Table 1 for the homoscedastic case and in Table 2 for the heteroscedastic case. Noting that macroeconomic time series with noticeable heteroscedasticity are relatively large but smaller than in general, a special emphasis will be put on interpreting results for samples . Since processes are simulated, and under the hypothesis that the finite sample size of a given test is , the relative rejection frequencies should be between the significant limits 3.65% and 6.35% with probability 0.95. The outputs outside these confidence bands will be displayed in bold type.
From Table 1, it appears that the is oversized for small samples ( and ). This could be explained by the fact that this test is too much sophisticated for the standard case. When the samples are increased the relative rejection frequencies become close to the 5% ( and ). On the other hand the and tests have good results for all the samples. Of course if there is no evidence of heteroscedasticity, the simple should be used. However Table 1 reveals that in case of doubt, the use of the is a good alternative.
In the heteroscedastic case, it is seen from Table 2 that the test fails to control the type I error as . This was expected from (2.6). Next it seems that the relative rejection frequencies of the test are somewhat far from the nominal level 5%, even when . From Table 2 it also emerges that the control reasonably well the type I error. Therefore we can draw the conclusion that the gives a substantial improvement for samples that are typical for heteroscedastic macroeconomic variables.
Note that the test have better results for larger samples (). For instance conducting similar experiences to those of Table 2, we obtained 7.4% rejections for and 6.9% rejections for . Hence the potential improvements of the in comparison to the should become slight as . For this reason if high frequency time series are analyzed, the should certainly be preferred to the computationally intensive .
In general it is important to point out that the bandwidth must be carefully selected to ensure a good implementation of the and tests. Its turns out from our experiments that selecting the bandwidth by cross-validation leads to relatively correct results. Indeed we found that the rejection frequencies of the converge to the 5%, and that the rejection frequencies of the remain close to the nominal level in such a case. However other choices can deteriorate the control of the type I errors. For instance let us consider the test which consists in correcting the heteroscedasticity, but with fixed bandwidth as , where is the sample variance and is a constant. The corresponding bootstrap test will be denoted by . Here the normal kernel is kept. We only study the heteroscedastic case. The results given in Table 3 show that the rejection frequencies are strongly affected by this way of selecting the bandwidth.
Finally let us point out that when the heteroscedastic structure is relatively easy to estimate (for instance if the sinus part is removed in (3.2)), we found better results (not displayed here) for the and tests in comparison to those of Table 2.
Now we turn to the analysis of the behavior of the tests under the alternative hypothesis. For a fair comparison we only studied the and in the homoscedastic case. The sample size is fixed and recall that the parameter defines the departures from the null hypothesis. The outputs of our simulations, displayed in Figure 1, show that the test does not suffer from a lack of power in comparison to the . In conclusion it turns out that the improves the distribution analysis, in the sense that it ensures a good control of the type I error, but without entailing noticeable loss of power.
3.2 Real data analysis
The inflation measures data are commonly used to analyze macroeconomic facts. Reference can be made to the numerous empirical papers studying the relation between price levels and money supply (see e.g. Jones and Uri (1986)). On the other hand inflation is of great importance in finance, as many central banks adjust their interest rates in view of targeting a certain inflation level. Accordingly, constructing valid confidence intervals for inflation forecasts may be often crucial. In such kind of investigations clearly the distributional analysis can help to build a model for the data. In a stationary setting, authors aimed to detected ARCH effects assessing asymmetry and/or leptokurticity in inflation variables following Engle (1982) (see Broto and Ruiz (2008p22) among others). In the same way, it is reasonable to think that a test for normality taking into account the time-varying variance, can help to choose between a deterministic specification, as in A1, and the case where in addition to unconditional heteroscedasiticity, second order dynamics are present (as in the case of spline-GARCH processes introduced by Engle and Rangel (2008)). In other words, once the unconditional heteroscedasiticity is removed from , the JB tests can help to decide whether ARCH effects are present or not in .
In this part we will study the normality of the log differences of the quarterly GDP implicit price deflators for the U.S., Korea and Australia from 10/01/1983 to 01/01/2017 (). More precisely we use . The data can be downloaded from the webpage of the research division of the federal reserve bank of Saint Louis: https://fred.stlouisfed.org. The studied variables plotted in Figure 2 seem to show cyclical heteroscedasticity. In the case of Korea we can suspect a global decreasing behavior leading to a stabilization after the Asian crisis. The times series are first filtered according to (2.1). The non correlation of the residuals is tested using the adaptive portmanteau test of Patilea and Raïssi (2013). On the other hand we applied tests for second order dynamics developed by Patilea and Raïssi (2014). The outputs (not displayed here) show that the hypothesis of no ARCH effects cannot be rejected. Hence the deterministic specification of the time-varying variance in A1 seems valid. Once the linear dynamics of the series seem captured in an appropriate way, the tests considered in this paper are applied to the residuals. The results are given in Table 4. When the null hypothesis of normality is rejected at the 5% level, the p-value is displayed in bold type. It emerges that the outputs of the test are in general clearly different from those of the and tests. The p-values of the are all lower than those of the . Note that in the case of the U.S. GDP implicit price deflator, the difference between the on one hand, and the , tests on the other hand, lead to different conclusions. In view of the outputs obtained from the simulations experiments, it is reasonable to decide that the normality assumption cannot be rejected for the U.S. data. It is likely that rejecting normality will suggest more sophisticated models, and could entail misspecifications for the confidence intervals of the forecasts by fitting a heavy tailed distribution to the U.S. data.
References
Broto, C., and Ruiz, E. (2008) Testing for conditional heteroscedasticity in the components of inflation. Working document, banco de España.
Dahlhaus, R. (1997) Fitting time series models to nonstationary processes. Annals of Statistics 25, 1-37.
Drees, H., and Stărică, C. (2002) A simple non stationary model for stock returns. Preprint. Universität des Saaland.
Engle, R.F. (1982) Autoregressive conditional heteroscedasticity with estimates of the variance of United Kingdom inflation. Econometrica 50, 987-1007.
Engle, R.F., and Rangel, J.G. (2008) The spline GARCH model for unconditional volatility and its global macroeconomic causes. Review of Financial Studies 21, 1187-1222.
Fiorentini, G., Sentana, E., Calzolari, G. (2004) On the validity of the Jarque-Bera normality test in conditionally heteroscedastic dynamic regression models. Economic Letters 83, 307-312.
Francq, C., and Zakoïan, J-M. (2010) GARCH models : structure, statistical inference, and financial applications. Wiley, Chichester.
Fryźlewicz, P. (2005) Modelling and forecasting financial log-returns as locally stationary wavelet processes. Journal of Applied Statistics 32, 503-528.
Jarque, C.M., and Bera, A.K. (1980) Efficient tests for normality, homoscedasticity and serial independence of regression residuals. Economics Letters 6, 255-259.
Jones, J.D., and Uri, N. (1986) Money, inflation and causality (another look at the empirical evidence for the USA, 1953-84). Applied Economics 19, 619-634.
Lee, S., Park, S., and Lee, T. (2010) A note on the Jarque-Bera normality test for GARCH innovations. Journal of the Korean Statistical Society 39, 93-102.
Lee, T. (2010) A note on Jarque-Bera normality test for ARMA-GARCH innovations. Journal of the Korean Statistical Society 41, 37-48.
Mikosch, T., and Stărică, C. (2004) Stock market risk-return inference. An unconditional non-parametric approach. Research report, the Danish national research foundation: Network in Mathematical Physics and Stochastics.
Patilea, V., and Raïssi, H. (2012) Adaptive estimation of vector autoregressive models with time-varying variance: application to testing linear causality in mean. Journal of Statistical Planning and Inference 142, 2891-2912.
Patilea, V., and Raïssi, H. (2013) Corrected portmanteau tests for VAR models with time-varying variance. Journal of Multivariate Analysis 116, 190-207.
Patilea, V., and Raïssi, H. (2014) Testing second order dynamics for autoregressive processes in presence of time-varying variance. Journal of the American Statistical Association 109, 1099-1111.
Sensier, M., and van Dijk, D. (2004) Testing for volatility changes in U.S. macroeconomic time series. Review of Economics and Statistics 86, 833-839.
Trimbur, T.M., and Bell, W.R. (2010) Seasonal heteroscedasticity in time series data: modeling, estimation, and testing. In W. Bell, S. Holan, and T. Mc Elroy (Eds.), Economic Time Series: Modelling and Seasonality. Chapman and Hall, New York.
Wasserman, L. (2006) All of Nonparametric Statistics. Springer, New-York.
Raïssi, H. (2015) Autoregressive order identification for VAR models with non-constant variance. Communications in Statistics: Theory and Methods 44, 2059-2078.
Yu, H. (2007) High moment partial sum processes of residuals in ARMA models and their applications. Journal of Time Series Analysis 28, 72-91.
Tables and Figures
