Nonparametric volatility change detection

Maria Mohr; Natalie Neumeyer

arXiv:1906.02996·math.ST·June 10, 2019

Nonparametric volatility change detection

Maria Mohr, Natalie Neumeyer

PDF

TL;DR

This paper introduces nonparametric tests for detecting changes in the conditional variance of heteroscedastic time series, combining empirical process methods with classical CUSUM tests, and demonstrates their effectiveness through simulations and real data.

Contribution

It proposes new nonparametric change detection tests for variance functions that are consistent and asymptotically distribution-free in certain cases.

Findings

01

Tests are consistent against general alternatives.

02

Asymptotically distribution-free for univariate covariates.

03

Good performance demonstrated in simulations and exchange rate data.

Abstract

We consider a nonparametric heteroscedastic time series regression model and suggest testing procedures to detect changes in the conditional variance function. The tests are based on a sequential marked empirical process and thus combine classical CUSUM tests with marked empirical process approaches known from goodness-of-fit testing. The tests are consistent against general alternatives of a change in the conditional variance function, a feature that classical CUSUM tests are lacking. We derive a simple limiting distribution and in the case of univariate covariates even obtain asymptotically distribution-free tests. We demonstrate the good performance of the tests in a simulation study and consider exchange rate data as a real data application.

Figures1

Click any figure to enlarge with its caption.

Tables2

Table 1. Table 1: Rejection frequencies in model 1

$s_{0}$	$n$	$T_{n 1}$	$T_{n 2}$	$K S$	$C M$	$T_{n 1}$	$T_{n 2}$	$K S$	$C M$
		model 1 (a)				model 1 (b)
$0$	$100$	$0.035$	$0.058$	$0.046$	$0.042$	$0.052$	$0.064$	$0.059$	$0.047$
	$300$	$0.053$	$0.073$	$0.058$	$0.048$	$0.056$	$0.068$	$0.064$	$0.055$
	$500$	$0.057$	$0.071$	$0.062$	$0.055$	$0.062$	$0.064$	$0.059$	$0.043$
$0.25$	$100$	$0.041$	$0.080$	$0.052$	$0.048$	$0.053$	$0.081$	$0.060$	$0.054$
	$300$	$0.112$	$0.157$	$0.072$	$0.062$	$0.099$	$0.155$	$0.072$	$0.051$
	$500$	$0.187$	$0.266$	$0.069$	$0.050$	$0.216$	$0.294$	$0.097$	$0.080$
$0.50$	$100$	$0.063$	$0.122$	$0.053$	$0.057$	$0.068$	$0.120$	$0.073$	$0.066$
	$300$	$0.210$	$0.276$	$0.091$	$0.073$	$0.199$	$0.279$	$0.081$	$0.068$
	$500$	$0.413$	$0.521$	$0.097$	$0.067$	$0.428$	$0.510$	$0.096$	$0.074$
$0.75$	$100$	$0.055$	$0.092$	$0.074$	$0.067$	$0.051$	$0.084$	$0.061$	$0.066$
	$300$	$0.130$	$0.174$	$0.079$	$0.053$	$0.120$	$0.196$	$0.080$	$0.068$
	$500$	$0.222$	$0.291$	$0.086$	$0.074$	$0.239$	$0.304$	$0.096$	$0.074$
$1$	$100$	$0.045$	$0.072$	$0.062$	$0.057$	$0.046$	$0.076$	$0.055$	$0.055$
	$300$	$0.053$	$0.064$	$0.056$	$0.041$	$0.076$	$0.088$	$0.081$	$0.064$
	$500$	$0.063$	$0.071$	$0.072$	$0.050$	$0.064$	$0.070$	$0.064$	$0.051$

Table 2. Table 2: Rejection frequencies in model 2

$s_{0}$	$n$	$T_{n 1}$	$T_{n 2}$	$K S$	$C M$	$T_{n 1}$	$T_{n 2}$	$K S$	$C M$
		model 2 (a)				model 2 (b)
$0$	$100$	$0.036$	$0.066$	$0.041$	$0.045$	$0.039$	$0.070$	$0.046$	$0.045$
	$300$	$0.056$	$0.066$	$0.062$	$0.041$	$0.040$	$0.054$	$0.046$	$0.045$
	$500$	$0.059$	$0.064$	$0.070$	$0.056$	$0.059$	$0.074$	$0.066$	$0.064$
$0.25$	$100$	$0.054$	$0.094$	$0.086$	$0.079$	$0.068$	$0.096$	$0.080$	$0.081$
	$300$	$0.165$	$0.209$	$0.218$	$0.214$	$0.153$	$0.202$	$0.200$	$0.194$
	$500$	$0.317$	$0.365$	$0.405$	$0.376$	$0.274$	$0.338$	$0.364$	$0.350$
$0.50$	$100$	$0.086$	$0.134$	$0.123$	$0.137$	$0.100$	$0.141$	$0.143$	$0.142$
	$300$	$0.414$	$0.433$	$0.507$	$0.470$	$0.423$	$0.438$	$0.510$	$0.470$
	$500$	$0.743$	$0.746$	$0.829$	$0.780$	$0.748$	$0.746$	$0.809$	$0.782$
$0.75$	$100$	$0.076$	$0.110$	$0.109$	$0.115$	$0.082$	$0.128$	$0.115$	$0.119$
	$300$	$0.329$	$0.361$	$0.410$	$0.376$	$0.340$	$0.353$	$0.402$	$0.368$
	$500$	$0.655$	$0.636$	$0.724$	$0.667$	$0.631$	$0.614$	$0.705$	$0.651$
$1$	$100$	$0.049$	$0.065$	$0.054$	$0.050$	$0.044$	$0.082$	$0.053$	$0.048$
	$300$	$0.069$	$0.063$	$0.068$	$0.054$	$0.054$	$0.073$	$0.063$	$0.051$
	$500$	$0.064$	$0.075$	$0.081$	$0.052$	$0.056$	$0.065$	$0.061$	$0.045$

Equations118

Y_{t} = m (X_{t}) + U_{t},

Y_{t} = m (X_{t}) + U_{t},

U_{t} = σ_{t} (X_{t}) ε_{t}, t \in Z,

U_{t} = σ_{t} (X_{t}) ε_{t}, t \in Z,

Var (Y_{t} ∣ X_{t}) = E [U_{t}^{2} ∣ X_{t}] = σ_{t}^{2} (X_{t}) \mbox a . s .

Var (Y_{t} ∣ X_{t}) = E [U_{t}^{2} ∣ X_{t}] = σ_{t}^{2} (X_{t}) \mbox a . s .

H_{0} : σ_{t}^{2} (\cdot) = σ^{2} (\cdot), t = 1, \dots, n,

H_{0} : σ_{t}^{2} (\cdot) = σ^{2} (\cdot), t = 1, \dots, n,

\hat{T}_{n} (s, z) = \frac{1}{n} t = 1 \sum ⌊ n s ⌋ ((Y_{t} - \overset{m}{^}_{n} (X_{t}))^{2} - \overset{σ}{^}_{n}^{2} (X_{t})) ω_{n} (X_{t}) I {X_{t} \leq z}

\hat{T}_{n} (s, z) = \frac{1}{n} t = 1 \sum ⌊ n s ⌋ ((Y_{t} - \overset{m}{^}_{n} (X_{t}))^{2} - \overset{σ}{^}_{n}^{2} (X_{t})) ω_{n} (X_{t}) I {X_{t} \leq z}

\overset{m}{^}_{n} (x)

\overset{m}{^}_{n} (x)

\overset{σ}{^}_{n}^{2} (x)

\overset{σ}{^}_{n}^{2} (x)

T_{n 1} := z \in R^{d} sup s \in [0, 1] sup \hat{T}_{n} (s, z)

T_{n 1} := z \in R^{d} sup s \in [0, 1] sup \hat{T}_{n} (s, z)

U_{t}^{2} = σ_{t}^{2} (X_{t}) + ξ_{t}, t \in Z,

U_{t}^{2} = σ_{t}^{2} (X_{t}) + ξ_{t}, t \in Z,

(Y_{t} - \overset{m}{^}_{n} (X_{t}))^{2} - \overset{σ}{^}_{n}^{2} (X_{t}) =: \hat{ξ}_{t}

(Y_{t} - \overset{m}{^}_{n} (X_{t}))^{2} - \overset{σ}{^}_{n}^{2} (X_{t}) =: \hat{ξ}_{t}

T_{n} (s, z) = \frac{1}{n} t = 1 \sum ⌊ n s ⌋ ξ_{t} I {X_{t} \leq z}, s \in [0, 1], z \in R^{d},

T_{n} (s, z) = \frac{1}{n} t = 1 \sum ⌊ n s ⌋ ξ_{t} I {X_{t} \leq z}, s \in [0, 1], z \in R^{d},

\mbox Cov (G (s_{1}, z_{1}), G (s_{2}, z_{2})) = (s_{1} \land s_{2}) Σ (z_{1} \land z_{2}),

\mbox Cov (G (s_{1}, z_{1}), G (s_{2}, z_{2})) = (s_{1} \land s_{2}) Σ (z_{1} \land z_{2}),

\mbox Cov (G_{0} (s_{1}, z_{1}), G_{0} (s_{2}, z_{2})) = (s_{1} \land s_{2} - s_{1} s_{2}) Σ (z_{1} \land z_{2}) .

\mbox Cov (G_{0} (s_{1}, z_{1}), G_{0} (s_{2}, z_{2})) = (s_{1} \land s_{2} - s_{1} s_{2}) Σ (z_{1} \land z_{2}) .

T_{n 1} n \to \infty \to D z \in R^{d} sup s \in [0, 1] sup ∣ G_{0} (s, z) ∣ .

T_{n 1} n \to \infty \to D z \in R^{d} sup s \in [0, 1] sup ∣ G_{0} (s, z) ∣ .

T = s \in [0, 1] sup t \in [0, 1] sup ∣ K_{0} (s, t) ∣

T = s \in [0, 1] sup t \in [0, 1] sup ∣ K_{0} (s, t) ∣

\overset{c}{^}_{n} := \frac{1}{n} i = 1 \sum n ((Y_{i} - \overset{m}{^}_{n} (X_{i}))^{2} - \overset{σ}{^}_{n}^{2} (X_{i}))^{2} ω_{n} (X_{i}),

\overset{c}{^}_{n} := \frac{1}{n} i = 1 \sum n ((Y_{i} - \overset{m}{^}_{n} (X_{i}))^{2} - \overset{σ}{^}_{n}^{2} (X_{i}))^{2} ω_{n} (X_{i}),

H_{1} : \exists s_{0} \in (0, 1) : σ_{n, t}^{2} (\cdot) = {σ_{(1)}^{2} (\cdot), σ_{(2)}^{2} (\cdot), t = 1, \dots, ⌊ n s_{0} ⌋ t = ⌊ n s_{0} ⌋ + 1, \dots, n,

H_{1} : \exists s_{0} \in (0, 1) : σ_{n, t}^{2} (\cdot) = {σ_{(1)}^{2} (\cdot), σ_{(2)}^{2} (\cdot), t = 1, \dots, ⌊ n s_{0} ⌋ t = ⌊ n s_{0} ⌋ + 1, \dots, n,

Y_{n, t} = m (X_{n, t}) + U_{n, t}, t = 1, \dots, n,

Y_{n, t} = m (X_{n, t}) + U_{n, t}, t = 1, \dots, n,

\overset{σ}{ˉ}_{n}^{2} (x)

\overset{σ}{ˉ}_{n}^{2} (x)

(- \infty, z] \int (σ_{(1)}^{2} (u) - σ_{(2)}^{2} (u)) \overset{ˉ}{f}^{(s_{0})} (u) (1 - \frac{f ˉ ^{(s_{0})} ( u )}{f ˉ ^{(1)} ( u )}) d u,

(- \infty, z] \int (σ_{(1)}^{2} (u) - σ_{(2)}^{2} (u)) \overset{ˉ}{f}^{(s_{0})} (u) (1 - \frac{f ˉ ^{(s_{0})} ( u )}{f ˉ ^{(1)} ( u )}) d u,

\int (σ_{(1)}^{2} (u) - σ_{(2)}^{2} (u)) \overset{ˉ}{f}^{(s_{0})} (u) (1 - \frac{f ˉ ^{(s_{0})} ( u )}{f ˉ ^{(1)} ( u )}) d u,

\int (σ_{(1)}^{2} (u) - σ_{(2)}^{2} (u)) \overset{ˉ}{f}^{(s_{0})} (u) (1 - \frac{f ˉ ^{(s_{0})} ( u )}{f ˉ ^{(1)} ( u )}) d u,

Y_{t} = m (X_{t}) + σ_{t} (X_{t}) ε_{t}, ε_{t} \sim N (0, 1),

Y_{t} = m (X_{t}) + σ_{t} (X_{t}) ε_{t}, ε_{t} \sim N (0, 1),

σ_{t} (x) = {0.5 exp (- 0.2 x), 0.5 exp (0.2 x), t = 1, \dots, ⌊ n s_{0} ⌋ t = ⌊ n s_{0} ⌋ + 1, \dots, n,

Y_{t} = m (Y_{t - 1}) + σ (Y_{t - 1}) ε_{t}, ε_{t} \sim N (0, 1),

Y_{t} = m (Y_{t - 1}) + σ (Y_{t - 1}) ε_{t}, ε_{t} \sim N (0, 1),

σ_{t} (x) = {0.1 + 0.1 x^{2}, 0.1 + 0.7 x^{2}, t = 1, \dots, ⌊ n s_{0} ⌋ t = ⌊ n s_{0} ⌋ + 1, \dots, n .

p_{n}

p_{n}

0 < q_{n}

max {x \in [- c_{n} - 2 h_{n} C, c_{n} + 2 h_{n} C]^{d} sup D^{k} m (x), x \in [- c_{n} - 2 h_{n} C, c_{n} + 2 h_{n} C]^{d} sup D^{k} σ (x)} = O (q_{n}) .

max {x \in [- c_{n} - 2 h_{n} C, c_{n} + 2 h_{n} C]^{d} sup D^{k} m (x), x \in [- c_{n} - 2 h_{n} C, c_{n} + 2 h_{n} C]^{d} sup D^{k} σ (x)} = O (q_{n}) .

(\frac{lo g n}{n h _{n}^{d + 2 (l + 1)}} + h_{n}^{r} p_{n}) p_{n}^{l + 1} δ_{n}^{l + 2} = O (1),

(\frac{lo g n}{n h _{n}^{d + 2 (l + 1)}} + h_{n}^{r} p_{n}) p_{n}^{l + 1} δ_{n}^{l + 2} = O (1),

(\frac{lo g n}{n h _{n}^{d + 2 (l + 1)}} + h_{n}^{r} p_{n}) p_{n}^{l + η} q_{n}^{2} δ_{n}^{l + 1 + η} = o (1) .

(\frac{lo g n}{n h _{n}^{d + 2 (l + 1)}} + h_{n}^{r} p_{n}) p_{n}^{l + η} q_{n}^{2} δ_{n}^{l + 1 + η} = o (1) .

\frac{( lo g n ) ^{3 + \frac{d}{l + η}}}{n ^{1 - \frac{d}{l + η}} h _{n}^{d}} q_{n}^{3} δ_{n}^{2} = o (1), \frac{lo g h _{n}}{n h _{n}^{d}} = o (1), n h_{n}^{r} p_{n} q_{n}^{2} = o (1), (lo g n)^{3} h_{n} q_{n}^{3} = o (1)

\frac{( lo g n ) ^{3 + \frac{d}{l + η}}}{n ^{1 - \frac{d}{l + η}} h _{n}^{d}} q_{n}^{3} δ_{n}^{2} = o (1), \frac{lo g h _{n}}{n h _{n}^{d}} = o (1), n h_{n}^{r} p_{n} q_{n}^{2} = o (1), (lo g n)^{3} h_{n} q_{n}^{3} = o (1)

Y_{t} = a_{1} Y_{t - 1} + \dots + a_{d} Y_{t - d} + (b_{0} + b_{1} Y_{t - 1}^{2} + \dots + b_{d} Y_{t - d}^{2})^{1/2} ε_{t}, t \in Z,

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Nonparametric volatility change detection

Maria Mohr and Natalie Neumeyer

Department of Mathematics, University of Hamburg

Abstract

We consider a nonparametric heteroscedastic time series regression model and suggest testing procedures to detect changes in the conditional variance function. The tests are based on a sequential marked empirical process and thus combine classical CUSUM tests with marked empirical process approaches known from goodness-of-fit testing. The tests are consistent against general alternatives of a change in the conditional variance function, a feature that classical CUSUM tests are lacking. We derive a simple limiting distribution and in the case of univariate covariates even obtain asymptotically distribution-free tests. We demonstrate the good performance of the tests in a simulation study and consider exchange rate data as a real data application.

Key words: change point, conditional variance function, CUSUM, heteroscedasticity, kernel estimation, Kolmogorov-Smirnov test, marked empirical process, structural change

AMS 2010 Classification: Primary 62M10, Secondary 62G08, 62G10

1 Introduction

The paper is concerned with the investigation of structural stability of the conditional variance function (volatility function) in nonparametric heteroscedastic time series regression models. Those models have gained much attention over the last decades and contain as special cases nonparametric AR-ARCH models, which are also called nonparametric CHARN (conditional heteroscedastic autoregressive nonlinear) models; see Fan and Yao (2003) or Gao (2007) for overviews. They have been successfully applied to model econometric time series such as foreign exchange rates or stock market indices, see e.g. Yang et al. (1999) and Zhao and Wu (2008). Here tests for structural changes in the volatility function are of special importance.

A lot of research has been devoted to the parametric case, notably for ARCH and GARCH models. Among others, Kokoszka and Leipus (1999) suggested a CUSUM type test for parameter stability in ARCH models, while Kulperger and Yu (2005) considered partial sums of higher powers of residuals to test for a parameter change in GARCH models. Berkes et al. (2004) considered tests for parameter stability in GARCH models based on likelihood ratios. Kengne’s (2012) test, which is based on quasi likelihood estimators, is applicable to more general parametric causal time series models. Lee and Lee (2014) suggested a residual based CUSUM test for change points in parametric AR-GARCH models, while Lee and Song (2008) and Song and Kang (2018) considered ARMA-GARCH models. Very few results are available in the nonparametric framework. Chen et al. (2005) studied a nonparametric heteroscedastic time series model with a scale change in volatility. However, they assume a compact support of regressors, which is problematic when considering autoregression models. Tests for change points in the unconditional variance in time series models have been considered as well. Lee et al. (2003) considered parametric autoregression models, as well as fixed design nonparametric regression models with strongly mixing errors using a CUSUM testing procedure. Chen and Tian (2014) constructed a ratio test for change point detection in the variance in random design nonparametric regression models. However, their test does not allow for autoregressive effects, as a compact support of regressors is assumed. A related strand of the literature deals with change point detection in the error distribution of a time series regression model. In the parametric framework Koul (1996) considered non-linear regression models and Ling (1998) non-stationary AR models, to just mention a few, while Selk and Neumeyer (2013) considered nonparametric heteroscedastic autoregression models.

Recently, Mohr and Neumeyer (2019) suggested a test for change point in the regression function in nonparametric time series models. They combine traditional CUSUM tests as considered by Hidalgo (1995), Honda (1997) and Su and Xiao (2008) in the nonparametric context with the marked empirical process approach originally suggested by Stute (1997) and widely used in the goodness-of-fit literature. Compared with the CUSUM approach the new test shows better power properties, in theory as well as in finite sample simulations. In the paper at hand we will modify the CUSUM marked empirical process test in order to test for a change point in the conditional volatility function. We obtain tests with very simple limiting distributions, which are consistent against general fixed alternatives. In the case of univariate covariates one can even obtain tests that are asymptotically distribution-free.

The paper is organized as follows. In section 2 we define the process on which the test statistics are built. In section 3 we give the limiting distribution of the process under the null hypothesis of no change in the variance function. We further discuss consistency against fixed alternatives of one change point. In section 4 we describe a simulation study and discuss a real data example of currency exchange rates. Section 5 concludes the paper, whereas in the appendix we list the regularity assumptions and prove the asymptotic results.

2 The model and test statistic

Consider a strictly stationary and strongly mixing time series $(Y_{t},\bm{X}_{t})$ , $t\in\mathbb{Z}$ , following the nonparametric model

[TABLE]

where $E[U_{t}|\mathcal{F}^{t}]=0$ a.s. for the sigma-field $\mathcal{F}^{t}=\sigma(U_{j-1},\bm{X}_{j}:j\leq t)$ , and $m:\mathbb{R}^{d}\to\mathbb{R}$ does not depend on $t$ . Further, let the following representation for the innovations $U_{t}$ hold,

[TABLE]

for some functions $\sigma_{t}:\mathbb{R}^{d}\to\mathbb{R}$ and an i.i.d. sequence $(\varepsilon_{t})_{t\in\mathbb{Z}}$ , such that $\varepsilon_{t}$ is independent of $\bm{X}_{j}$ for all $j\leq t$ and fulfills $E[\varepsilon_{1}]=0$ , $E[\varepsilon_{1}^{2}]=1$ and $E[\varepsilon_{1}^{4}]<\infty$ . With these restrictions, $\sigma_{t}^{2}$ is the variance function of $Y_{t}$ , conditioned on $\bm{X}_{t}$ , as

[TABLE]

The $d$ -dimensional absolutely continuous covariate $\bm{X}_{t}$ may include finitely many lagged values of $Y_{t}$ , for instance $\bm{X}_{t}=(Y_{t-1},\dots,Y_{t-d})^{T}$ , such that the model includes nonparametric AR-ARCH models.

Our aim is to test whether the function $\sigma_{t}^{2}(\cdot)$ is stable in time $t$ . Given observations $(Y_{1},\bm{X}_{1}),\dots,(Y_{n},\bm{X}_{n})$ the null hypothesis

[TABLE]

for some not further specified function $\sigma^{2}:\mathbb{R}^{d}\to\mathbb{R}$ (not depending on time $t$ ) will be considered.

The idea is to base tests for $H_{0}$ on a sequential marked empirical process of residuals,

[TABLE]

indexed in $s\in[0,1]$ and $\bm{z}\in\mathbb{R}^{d}$ . Throughout $I\{\dots\}$ denotes an indicator function. Further $\omega_{n}(\cdot)=I\{\cdot\in\bm{J}_{n}\}$ is a weight function with $\bm{J}_{n}$ specified in assumption (J) in appendix A. The regression and volatility functions are estimated as

[TABLE]

and

[TABLE]

respectively, with kernel function $K$ and bandwidth $h_{n}$ as considered in the assumptions in appendix A. The null hypothesis $H_{0}$ of no change in the variance will be rejected for large values of, e.g., a Kolmogorov-Smirnov type test statistic

[TABLE]

due to the following motivation. Note that the volatility function $\sigma^{2}_{t}$ from (2.2) can be viewed as regression function in a regression model

[TABLE]

with covariate $\bm{X}_{t}$ , response variable $U_{t}^{2}$ and innovations $\xi_{t}=U_{t}^{2}-\sigma^{2}_{t}(\bm{X}_{t})$ , that satisfy $E[\xi_{t}|\bm{X}_{t}]=0$ and $E[\xi_{t}^{2}|\bm{X}_{t}]=\sigma_{t}^{4}(\bm{X}_{t})E[(\varepsilon^{2}_{t}-1)^{2}]$ a.s. However, this is not a feasible model as $U_{t}=Y_{t}-m(\bm{X}_{t})$ is unobservable and has to be estimated. The term

[TABLE]

in the definition of the process $\hat{T}_{n}$ can be seen as estimator for the innovation $\xi_{t}$ in the ‘non-feasible’ model above under the null hypothesis $\sigma_{t}^{2}(\cdot)=\sigma^{2}(\cdot)\forall t$ . Thus $n^{-1/2}\hat{T}_{n}$ will vanish for $n\to\infty$ under the null hypothesis. The limiting process of $\hat{T}_{n}$ will be given in Corollary 3.2 below. From this result critical values for a test based on the Kolmogorov-Smirnov type test statistic $T_{n1}$ can be approximated. The behavior of $T_{n1}$ under fixed alternatives will be demonstrated in Remark 3.3 in order to motivate consistency of the test. The process $\hat{T}_{n}$ is a consistent improvement of CUSUM tests analogous to the procedure in Mohr and Neumeyer (2019) developed for changes in the regression function.

Remark 2.1.

In model (2.1) we assume a regression function $m$ that is stable in time $t$ . For testing of a change in the variance function this assumption makes sense if beforehand one can test for a change in the regression function applying a testing procedure which only reacts sensitive to changes in the regression function, not to changes in the variance function. Mohr and Neumeyer (2019) provide such a bootstrap test, which can be applied in cases of unstable variances, but as desired only reacts sensitive to changes in the regression function. Consecutively applying the bootstrap test in Mohr and Neumeyer (2019) and, if it does not reject, the test in the paper at hand, gives the knowledge of whether a change occurs in the mean or the variance function.

3 Asymptotic results

Under the regularity assumptions in appendix A one can derive the following decomposition of the process $\hat{T}_{n}$ defined in (2.3) in terms of the process

[TABLE]

as well as the weak convergence of $T_{n}$ .

Theorem 3.1.

Assume model (2.1), (2.2) under the null hypothesis $H_{0}$ and assumptions (G), ( $\bm{\xi}$ ), (M), (J), (F1), (F2), (K), (B1) and (B2) from appendix A.

(i)* Then, $\hat{T}_{n}(s,\bm{z})=T_{n}(s,\bm{z})-sT_{n}(1,\bm{z})+o_{P}(1)$ uniformly in $s\in[0,1]$ and $\bm{z}\in\mathbb{R}^{d}$ .*

(ii)* The process $T_{n}=\{T_{n}(s,\bm{z}):s\in[0,1],\bm{z}\in\mathbb{R}^{d}\}$ converges weakly in $\ell^{\infty}([0,1]\times\mathbb{R}^{d})$ to a centered Gaussian process ${G}$ with*

[TABLE]

where ${\Sigma}(\bm{z}):=E[(\varepsilon^{2}_{1}-1)^{2}]\int_{(-\bm{\infty},\bm{z}]}\sigma^{4}(\bm{x})f(\bm{x})d\bm{x}$ .

Here and throughout we define $(-\bm{\infty},\bm{z}]=(-\infty,z_{1}]\times\cdots\times(-\infty,z_{d}]$ for $\bm{z}=(z_{1},\dots,z_{d})\in\mathbb{R}^{d}$ . The proof of Theorem 3.1 is given in appendix B. An application of the continuous mapping theorem and Slutsky’s lemma give the following weak convergence result for the process $\hat{T}_{n}$ .

Corollary 3.2.

Suppose that the assumptions of Theorem 3.1 and $H_{0}$ are satisfied. Then the process $\hat{T}_{n}$ converges weakly in $\ell^{\infty}([0,1]\times\mathbb{R}^{d})$ to a centered Gaussian process ${G}_{0}$ with

[TABLE]

The continuous mapping theorem then implies convergence in distribution of the Kolmogorov-Smirnov test statistic,

[TABLE]

In particular in the case $d=1$ using continuity of ${\Sigma}$ and the scaling property of the Brownian motion, it holds that $T_{n1}$ converges in distribution to $c^{1/2}T$ , where

[TABLE]

and $K_{0}$ is a Kiefer-Müller process. The constant ${c}=E[((Y_{1}-m(X_{1}))^{2}-\sigma^{2}(X_{1}))^{2}]$ can be consistently estimated as

[TABLE]

and the test statistic $T_{n1}/\hat{{c}}_{n}^{1/2}$ is asymptotically distribution-free. We reject $H_{0}$ at asymptotic level $\alpha$ if $T_{n1}/\hat{{c}}_{n}^{1/2}$ is larger than the (known) $(1-\alpha)$ -quantile of $T$ .

Remark 3.3.

To see that the test is consistent against simple fixed alternatives of one change in the volatility function,

[TABLE]

for some functions with $\sigma^{2}_{(1)}\not\equiv\sigma^{2}_{(2)}$ , consider a triangular array

[TABLE]

with regression function $m$ stable in time and innovations such that $E[U_{n,t}|\mathcal{F}_{n}^{t}]=0$ and $E[U_{n,t}^{2}|\bm{X}_{n,t}]=\sigma_{n,t}^{2}(\bm{X}_{n,t})$ a.s. Further assume that the covariate $\bm{X}_{n,t}$ is absolutely continuous with density function $f_{n,t}$ . Then $\hat{\sigma}_{n}^{2}(\bm{x})$ will estimate the function

[TABLE]

Now assume that for each $s\in(0,1)$ , the limit of $n^{-1}\sum_{i=1}^{\left\lfloor ns\right\rfloor}f_{n,i}$ exists and denote it by $\bar{f}^{(s)}$ . Then $n^{-1/2}\hat{T}_{n}(s_{0},\bm{z})$ will converge in probability to the integral

[TABLE]

which, under $H_{1}$ , does not vanish for at least one $\bm{z}=\bm{z}_{0}$ (provided that $\bar{f}^{(s_{0})}\neq\bar{f}^{(1)}$ ). As $T_{n1}\geq|\hat{T}_{n}(s_{0},\bm{z}_{0})|$ , the test statistic will converge to infinity in probability and the test is consistent.

Remark 3.4.

A traditional CUSUM test statistic in our context would be defined as $\sup_{s\in[0,1]}|\hat{T}_{n}(s,\infty)|$ . With the same reasoning as in Remark 3.3, $n^{-1/2}\hat{T}_{n}(s_{0},\infty)$ will converge in probability to

[TABLE]

which could be zero, even under the alternative $H_{1}$ . In such a case the CUSUM test is not consistent.

4 Finite sample properties

4.1 Simulations

A Monte Carlo study is conducted in order to compare the results for $T_{n1}$ from section 2 and a Cramér-von Mises type test $T_{n2}:=\sup_{z\in\mathbb{R}}\int_{0}^{1}|\hat{T}_{n}(s,z)|^{2}ds$ with those of the traditional CUSUM versions denoted by $KS:=\sup_{s\in[0,1]}|\hat{T}_{n}(s,\infty)|$ and $CM:=\int|\hat{T}_{n}(s,\infty)|^{2}ds$ . All simulations are carried out with a level of $5\%$ , $1000$ replications and for sample sizes $n\in\{100,300,500\}$ . For the nonparametric estimators $\hat{m}_{n}$ and $\hat{\sigma}^{2}_{n}$ we use an Epanechnikov kernel $K$ and $h_{n}=n^{-1/3}$ as a simple ad hoc bandwidth. Furthermore, we set $c_{n}=\log(n)$ for the weighting function. The data is simulated from the following models.

[TABLE]

where $X_{t}$ is an exogenous variable following the AR(1) model $X_{t}=0.4X_{t-1}+\xi_{t}$ with $\xi_{t}$ being i.i.d. $\sim\mathcal{N}(0,1)$ .

[TABLE]

For both model 1 and 2 we consider $s_{0}\in\{0,0.25,0.5,0.75,1\}$ and two different choices for the regression function, namely $m(x)=0.5x$ (case (a)) and $m(x)=-0.5x$ (case (b)).

Model 1 is a heteroscedastic regression model with autoregressive covariables while model 2 is a heteroscedastic autoregression (AR-ARCH) model. In both cases $H_{0}$ is satisfied for $s_{0}\in\{0,1\}$ and $H_{1}$ is satisfied for $s_{0}\in\{0.25,0.5,0.75\}$ . Further, note that data generated from both models fulfill the stationarity and mixing assumptions when $s_{0}\in\{0,1\}$ (see Remark A.1 in appendix A).

Table 1 shows the rejection frequencies for model 1. To summarize the performance of the tests it is to mention that all level simulations ( $s_{0}\in\{0,1\}$ ) show reasonably good results. The tests based on $T_{n1}$ and $T_{n2}$ show nice consistency properties ( $s_{0}\in\{0.25,0.5,0.75\}$ ), rejecting the null more frequently with increasing sample sizes, where $T_{n2}$ has larger power. The classical CUSUM tests, however, clearly fail in detecting the change, having a power that does not exceed $10\%$ for all cases (see Remark 3.4). All of the tests perform rather poorly when the sample size is small, i.e. for $n=100$ . Furthermore, we note that changes occurring at $s_{0}=0.5$ are easiest to detect.

The corresponding results in model 2 can be found in table 2. The level of $5\%$ is approximately hold for all tests, even in the case where the variance has a relatively large influence ( $s_{0}=0$ ). The power simulations suggest that our tests as well as the classical CUSUM tests result in reasonable rejection probabilities, detecting the change more often for increasing sample sizes. Again changes in $s_{0}=0.5$ are easiest to detect.

4.2 Data example

In this section we will apply our test to a financial data set that is concerned with exchange rates of currencies. Exchange rate regimes indicate how a country manages its currency with respect to other currencies, it can vary from ”fixed”, over ”pegged” to ”floating”. In the case of a fixed regime, the currency is more or less fixed to some other currency. Contrarily with a floating regime the currency is allowed to fluctuate freely by market forces. Pegged regimes are somehow in between, the currency then has limited flexibility when compared with other currencies. As Zeileis et al. (2010) point out, information on the exchange rate regime of a country is not always fully disclosed by the corresponding central bank. Hence, data driven methods such as linear regression became popular to classify the exchange rate regime in operation. Zeileis et al. (2010) suggest that a vanishing error variance can be interpreted as a fixed currency regime, while a small or large error variance can indicate a pegged or floating regime respectively. This is illustrating that the error variance is an important quantity when looking for changes in the exchange rate regime. As such changes are often caused by policy interventions, tests for sudden breaks (rather than smooth transitions) are of reasonable interest.

We consider the exchange rates of the Chinese Yuan Renminbi (CNY) regressed on the exchange rates of the US Dollar (USD). The reason to do so is that China decided to give up on a fixed exchange rate to the US dollar in 2005. More precisely, we consider 251 data points which are the daily log-difference returns from July 26nd, 2005 to July 25nd, 2006 of the CNY and USD each with respect to the Swiss franc (CHF) as numeraire currency. This is the first year of observations of a data set considered by Zeileis et al. (2010) as well as Kirch and Weber (2018). Both studies use a linear regression model and a basket of four currencies as regressors, namely the USD, Japanese yen (JPY), Euro (EUR) and the British Pound (GBP). However, the results of Zeileis et al. (2010) indicate nearly vanishing regression coefficients for the JPY, the EUR and the GBP over the whole investigated time period from July 26nd, 2005 to July 31st, 2009.

We first apply the bootstrap test by Mohr and Neumeyer (2019) to test for changes in the unknown regression function. With a p-value of $90\%$ it suggests a stable regression function.

Secondly, we apply our test based on $T_{n1}$ using the $95\%$ -quantile of the limiting distribution $T$ as critical values. The test clearly rejects the null with a p-value smaller than $0.001\%$ , indicating a change in the conditional variance function. The possible change point can be estimated by $\mbox{argmax}_{s\in[0,1]}(\sup_{z\in\mathbb{R}}|\hat{T}_{n}(s,z)|)$ and suggests a change of the exchange rate regime in March 3rd, 2006 which is consistent with the results of Zeileis et al. (2010). Figure 1 shows the cumulative sum, $\sup_{z\in\mathbb{R}}|\hat{T}_{n}(\cdot,z)|$ (top plot), as well as the exchange rates of the CNY plotted against the time (bottom plot). The green dashed line is indicating the critical value while the red dashed line corresponds to the estimated change point.

Note that applying the tests to the full data set, no change in the regression function is detected (p-value $16\%$ ), but a change in the variance is clearly detected (p-value smaller than $0.001\%$ ). However, as the data set is rather large and from the findings of Zeileis et al. (2010) we expect more than one change in the variance when looking at the full set of observations, which makes the estimation of possible changes more complicated (see also section 5).

5 Concluding remarks

This paper closes a gap in the change point testing theory for nonparametric time series models. Assume that one already has accepted that there is no change in the (nonparametric) regression function, but one suspects a change in the (nonparametric) volatility function. In such a case the new test gives a valid procedure. To the best knowledge of the authors the new test is the first that can be applied to (nonparametric) autoregressive models (no assumption of bounded support of the covariates) and is consistent against general alternatives of a change point in the variance function.

Under the assumption that only one change occurs, an estimator for the change point is given by $\mbox{argmax}_{s\in[0,1]}(\sup_{\bm{z}\in\mathbb{R}^{d}}|\hat{T}_{n}(s,\bm{z})|)$ . Asymptotic properties of this estimator will be considered in future research. If more than one change occurs it might be necessary to modify this estimator. For instance Fryzlewicz (2014) proposes a wild binary segmentation procedure for the estimation of multiple changes in a simple piecewise-constant signal model, which possibly can be adapted to our setting.

For our theoretical result Theorem 3.1 we need stationarity under the null. However, if there are no changes in both regression function $m$ and variance function $\sigma^{2}$ , there still could be a change in the error distribution of $\varepsilon_{t}$ . In this case, a bootstrap test similar to the wild bootstrap proposal of Mohr and Neumeyer (2019) can be conducted that is sensible to changes in the variance function but not to changes in the error distribution. If both tests of Mohr and Neumeyer (2019) and the bootstrap version of the test at hand do not indicate a change in the regression and variance function respectively, the procedure of Selk and Neumeyer (2013) can be used to detect changes in the error distribution.

Appendix A Assumptions

(G)

Let $(Y_{t},\bm{X}_{t})_{t\in\mathbb{Z}}$ be strictly stationary and $\alpha$ -mixing with mixing coefficient $\alpha(\cdot)$ such that $\alpha(t)=O(a^{-t})$ for some $a\in(1,\infty)$ . 2. ( $\bm{\xi}$ )

For $\xi_{t}:=U_{t}^{2}-\sigma^{2}(\bm{X}_{t})$ let there exist some $\gamma>0$ and some even $Q>(d+1)(2+\gamma)$ such that $E[\xi_{t}|\mathcal{F}^{t}]=0$ , where $\mathcal{F}^{t}=\sigma(U_{j-1},\bm{X}_{j}:j\leq t)$ , $E[\xi_{t}^{2}|\bm{X}_{t}]=\tau^{2}(\bm{X}_{t})$ and $E[|\xi_{t}|^{Q\frac{2+\gamma}{2}}|\bm{X}_{t}]\leq c(\bm{X}_{t})^{Q}$ a.s. for all $t\in\mathbb{Z}$ , for some functions $c,\tau^{2}:\mathbb{R}^{d}\to\mathbb{R}$ with $\int\bar{c}(\bm{u})f(\bm{u})d(\bm{u})\leq M_{1}$ for some $M_{1}<\infty$ and $\bar{c}(\bm{u})=\max\left\{\tau^{2}(\bm{u}),c(\bm{u})^{2},\dots,c(\bm{u})^{Q}\right\}$ . 3. ( $\bm{\sigma}$ )

For $Q$ , $\gamma$ from assumption ( $\bm{\xi}$ ) let $\int|\sigma^{2}(\bm{u})|^{Q\frac{2+\gamma}{2}}f(\bm{u})d(\bm{u})\leq M_{2}$ for some $M_{2}<\infty$ . 4. (M)

For some $b>2$ let $E[|Y_{1}|^{2b}]<\infty$ and let $\bm{X}_{1}$ be absolutely continuous with density function $f:\mathbb{R}^{d}\to\mathbb{R}$ that satisfies $\sup_{\bm{x}\in\mathbb{R}^{d}}E[|Y_{1}|^{2b}|\bm{X}_{0}=\bm{x}]f(\bm{x})<\infty$ and $\sup_{\bm{x}\in\mathbb{R}^{d}}f(\bm{x})<\infty$ . Let there exist some $j^{*}<\infty$ such that $\sup_{\bm{x}_{1},\bm{x}_{j}}E[Y_{1}^{2}Y_{j}^{2}|\bm{X}_{1}=\bm{x}_{1},\bm{X}_{j}=\bm{x}_{j}]f_{1j}(\bm{x}_{1},\bm{x}_{j})<\infty$ for all $j\geq j^{*}$ , where $f_{1j}$ is the density function of $(\bm{X}_{1},\bm{X}_{j})$ . 5. (J)

Let $(c_{n})_{n\in\mathbb{N}}$ be a positive sequence of real numbers satisfying $c_{n}\to\infty$ and $c_{n}=O((\log{n})^{1/d})$ and let $\bm{J}_{n}=[-c_{n},c_{n}]^{d}$ . 6. (F1)

For some $C<\infty$ and $c_{n}$ from assumption (J) let $\bm{I}_{n}=[-c_{n}-Ch_{n},c_{n}+Ch_{n}]^{d}$ , where $h_{n}$ is from assumption (B1) and (B2) and let $\delta_{n}^{-1}=\inf_{\bm{x}\in\bm{J}_{n}}f(\bm{x})>0$ for all $n\in\mathbb{N}$ . Further, let for some $r,l\in\mathbb{N}$ and for all $n\in\mathbb{N}$

[TABLE]

where $|\bm{i}|=\sum_{j=1}^{d}i_{j}$ and $D^{\bm{i}}=\frac{\partial^{|\bm{i}|}}{\partial x_{1}^{i_{1}}\dots\partial x_{d}^{i_{d}}}$ for $\bm{i}=(i_{1},\dots,i_{d})\in\mathbb{N}_{0}^{d}$ . 7. (F2)

For $q_{n}$ from assumption (F1), $c_{n}$ from assumption (J) and $C$ from assumption (K), let for all $\bm{k}\in\mathbb{N}_{0}^{d}$ with $|\bm{k}|=2$ ,

[TABLE] 8. (K)

Let $K:\mathbb{R}^{d}\to\mathbb{R}$ be symmetric in each component, $l+1$ times differentiable with $\int_{\mathbb{R}^{d}}K(\bm{z})d\bm{z}=1$ and compact support $[-C,C]^{d}$ . Additionally, let $r\geq 2$ and $\int_{\mathbb{R}^{d}}K(\bm{z})\bm{z}^{\bm{k}}d\bm{z}=0$ for all $\bm{k}\in\mathbb{N}_{0}^{d}$ with $1\leq|\bm{k}|\leq r-1$ , where $\bm{z}^{\bm{k}}=z_{1}^{k_{1}}\cdots z_{d}^{k_{d}}$ . For all $L\in\{K\}\cup\{D^{\bm{k}}K:\bm{k}\in\mathbb{N}_{0}^{d}\text{ with }1\leq|\bm{k}|\leq l+1\}$ let $|L(\bm{u})|<\infty$ for all $\bm{u}\in\mathbb{R}^{d}$ and $|L(\bm{u})-L(\bm{u^{\prime}})|\leq\Lambda\|\bm{u}-\bm{u^{\prime}}\|$ for some $\Lambda<\infty$ and for all $\bm{u},\bm{u^{\prime}}\in\mathbb{R}^{d}$ . (Here, $r,l$ and $C$ are from assumption (F1).) 9. (B1)

For $\delta_{n},p_{n},q_{n}$ and $r,l$ from assumption (F1) let

[TABLE]

and for some $\eta\in(0,1)$ let

[TABLE] 10. (B2)

For $l,p_{n},q_{n},\delta_{n}$ from assumption (F1) and $\eta$ from assumption (B1) let

[TABLE]

and $\dfrac{(\log n)^{2+\frac{d}{l+\eta}}}{\sqrt{n^{1-\frac{1}{q}-\frac{d}{l+\eta}}}}q_{n}\delta_{n}=o(1)$ for $q=Q\frac{2+\gamma}{2}$ with $Q$ and $\gamma$ from assumption ( $\bm{\xi}$ ).

Remark A.1.

Assumption (G) is fulfilled by data following causal and stationary ARMA models as they have an MA( $\infty$ ) representation with coefficients that decay exponentially fast (see for instance Fan and Yao (2003) Subsection 2.6.1 (iii), p. 69). For more general nonlinear AR-ARCH processes both Lu (1998) and Liebscher (2005) give sufficient conditions on regression function, volatility function and the innovations under which the mixing condition in (G) holds. In the linear model

[TABLE]

where $(\varepsilon_{t})_{t}\overset{\text{i.i.d.}}{\sim}\mathcal{N}(0,1)$ , the condition in Lu (1998) simplifies to $(\sum_{i=1}^{d}|a_{i}|)^{2}+\sum_{i=1}^{d}b_{i}<1$ .

Remark A.2.

In order to satisfy the first bandwidth assumption in (B2), a necessary condition is $l+\eta>d$ , hence for higher dimensional covariate $\bm{X}_{t}$ , the existence of higher order partial derivatives of $f$ and $m$ is needed. In order to satisfy both the first and third bandwidth assumption in (B2) at the same time, depending on the dimension $d$ and the smoothness parameters $l$ and $\eta$ , the order of the kernel $r$ needs to be chosen such that $r>\frac{d}{2}\frac{l+\eta}{l+\eta-d}$ holds. As a rule of thumb, one can choose $h_{n}=O(n^{-k})$ for some $0<k<\frac{1}{d}-\frac{1}{l+\eta}$ and a kernel, such that $r>\frac{1}{2k}$ . That choice satisfies the assumptions given negligible rates for $q_{n}$ and $\delta_{n}$ .

Further note that the last constraint in (B2) is merely a trade off between existence of moments of $\xi_{t}$ , dimension $d$ and smoothness parameters $l$ and $\eta$ . It is satisfied if $q>\frac{l+\eta}{l+\eta-d}$ (given negligible rates for $q_{n}$ and $\delta_{n}$ ).

Appendix B Proofs

Lemma B.1.

Under the assumptions of Theorem 3.1 and under $H_{0}$ the following rates of convergence can be obtained for the kernel estimators $\hat{m}_{n}$ and $\hat{\sigma}^{2}_{n}$ ,

(i)
(a)

$\sup\limits_{\bm{x}\in\bm{J}_{n}}\left|\hat{m}_{n}(\bm{x})-m(\bm{x})\right|=O_{P}\left(\left(\sqrt{\frac{\log{n}}{nh_{n}^{d}}}+h_{n}^{r}p_{n}\right)q_{n}\delta_{n}\right)$ , 2. (b)

$\sup\limits_{\bm{x}\in\bm{J}_{n}}\left|D^{\bm{k}}\left(\hat{m}_{n}(\bm{x})-m(\bm{x})\right)\right|=O_{P}\left(\left(\sqrt{\frac{\log{n}}{nh_{n}^{d+2|\bm{k}|}}}+h_{n}^{r}p_{n}\right)p_{n}^{|\bm{k}|}q_{n}\delta_{n}^{|\bm{k}|+1}\right)$ * for all $1\leq|\bm{k}|\leq l+1$ ,* 3. (c)

$\displaystyle\sup\limits_{\begin{subarray}{c}\bm{x},\bm{y}\in\bm{J}_{n}\\ \bm{x}\neq\bm{y}\end{subarray}}\frac{\left|D^{\bm{k}}\left(\hat{m}_{n}(\bm{x})-m(\bm{x})\right)-D^{\bm{k}}\left(\hat{m}_{n}(\bm{y})-m(\bm{y})\right)\right|}{\|\bm{x}-\bm{y}\|^{\eta}}=o_{P}(1)$ * for all $|\bm{k}|=l$ , * 2. (ii)

(a)

$\sup\limits_{\bm{x}\in\bm{J}_{n}}\left|\hat{\sigma}^{2}_{n}(\bm{x})-\sigma^{2}(\bm{x})\right|=O_{P}\left(\left(\sqrt{\frac{\log{n}}{nh_{n}^{d}}}+h_{n}^{r}p_{n}\right)q_{n}^{2}\delta_{n}\right)$ , 2. (b)

$\sup\limits_{\bm{x}\in\bm{J}_{n}}\left|D^{\bm{k}}\left(\hat{\sigma}^{2}_{n}(\bm{x})-\sigma^{2}(\bm{x})\right)\right|=O_{P}\left(\left(\sqrt{\frac{\log{n}}{nh_{n}^{d+2|\bm{k}|}}}+h_{n}^{r}p_{n}\right)p_{n}^{|\bm{k}|}q_{n}^{2}\delta_{n}^{|\bm{k}|+1}\right)$ * for all $1\leq|\bm{k}|\leq l+1$ ,* 3. (c)

$\displaystyle\sup\limits_{\begin{subarray}{c}\bm{x},\bm{y}\in\bm{J}_{n}\\ \bm{x}\neq\bm{y}\end{subarray}}\frac{\left|D^{\bm{k}}\left(\hat{\sigma}^{2}_{n}(\bm{x})-\sigma^{2}(\bm{x})\right)-D^{\bm{k}}\left(\hat{\sigma}^{2}_{n}(\bm{y})-\sigma^{2}(\bm{y})\right)\right|}{\|\bm{x}-\bm{y}\|^{\eta}}=o_{P}(1)$ * for all $|\bm{k}|=l$ . *

Note that the results for the Nadaraya-Watson estimator $\hat{m}_{n}$ in (i) are also stated in Lemma A.1 in Mohr and Neumeyer (2019). The proof of Lemma B.1 is similar to the proof of Theorem 8 in Hansen (2008) and omitted for the sake of brevity.

Lemma B.2.

Under the assumptions of Theorem 3.1 and under $H_{0}$ we have uniformly in $s\in[0,1]$ and $\bm{z}\in\mathbb{R}^{d}$

[TABLE]

Proof.

For some $l$ -times differentiable function $h:\bm{J}_{n}\to\mathbb{R}$ define the norm

[TABLE]

and the function class $\mathcal{H}:=\mathcal{C}_{1,n}^{l+\eta}(\bm{J}_{n}):=\{h:\bm{J}_{n}\to\mathbb{R}:\|h\|_{l+\eta}\leq 1,\sup_{\bm{x}\in\bm{J}_{n}}\left|h(\bm{x})\right|\leq z_{n}(\log n)^{1/2}\}$ with $z_{n}:=q_{n}\delta_{n}((\log n)/(nh_{n}^{d}))^{1/2}$ . The third bandwidth condition in (B2) implies

[TABLE]

and thus Lemma B.1 (i) implies that $P(\hat{h}_{n}\in\mathcal{C}_{1,n}^{l+\eta}(\bm{J}_{n}))\to 1$ as $n\to\infty$ holds for $\hat{h}_{n}(\bm{x})=(m(\bm{x})-\hat{m}_{n}(\bm{x}))\omega_{n}(\bm{x})$ . It is then sufficient to consider $n^{-1/2}\sum_{i=1}^{\left\lfloor ns\right\rfloor}h(\bm{X}_{i})U_{i}I\{\bm{X}_{i}\leq\bm{z}\}$ for $s\in[0,1]$ , $\bm{z}\in\mathbb{R}^{d}$ and $h\in\mathcal{H}$ . Furthermore, using ( $\bm{\xi}$ ) and ( $\bm{\sigma}$ ) it can be shown that for $q:=Q\frac{2+\gamma}{2}>2$

[TABLE]

holds uniformly in $s\in[0,1]$ and $\bm{z}\in\mathbb{R}^{d}$ . Defining the function class $\mathcal{F}:=\{(u,\bm{x})\mapsto uI\{|u|\leq n^{1/q}\}I\{\bm{x}\leq\bm{z}\}:\bm{z}\in\mathbb{R}^{d}\}$ and imposing $(U_{1},\bm{X}_{1})\sim P$ , the assertion then follows if we show

[TABLE]

To this end let $\varepsilon_{n1}=n^{-1/2}n^{-1/q}$ , $\varepsilon_{n2}=n^{-1/2}$ and $\varepsilon_{n3}=n^{-1/2}/(\log n)$ and let further $0=s_{1}<\dots<s_{K_{n}}=1$ partition $[0,1]$ in intervals of length $2\varepsilon_{n1}$ such that $K_{n}=O(\varepsilon_{n1}^{-1})$ . Furthermore, we use the bracketing numbers $J_{n}:=N_{[~{}]}\left(\varepsilon_{n2},\mathcal{F},\|\cdot\|_{L_{2}(P)}\right)$ and $M_{n}:=N_{[~{}]}\left(\varepsilon_{n3},\mathcal{H},\|\cdot\|_{\infty}\right)$ , where $\|\cdot\|_{\infty}$ is the supremum norm on $\bm{J}_{n}$ . Let $[\varphi_{1}^{l},\varphi_{1}^{u}],\dots,[\varphi_{J_{n}}^{l},\varphi_{J_{n}}^{u}]$ denote the brackets needed to cover $\mathcal{F}$ . Let furthermore $[h_{1}^{l},h_{1}^{u}],\dots,[h_{M_{n}}^{l},h_{M_{n}}^{u}]$ define the brackets needed to cover $\mathcal{H}$ . It can be shown that $J_{n}=O\left(\varepsilon_{n2}^{-2d}\right)$ and $M_{n}=O(\exp(c_{n}^{d}\varepsilon_{n3}^{-d/(l+\eta)}))$ and further

[TABLE]

In what follows we only consider the first line on the right hand side, while the other ones can be treated similarly. We apply Theorem 2.1 of Liebscher (1996) to the random variable (for $m,j,k$ fixed)

[TABLE]

The mixing coefficient of $\{Z_{t}:1\leq t\leq n\}$ can be bounded by the mixing coefficient of $\{(U_{t},\bm{X}_{t}):t\in\mathbb{Z}\}$ due to Bradley (1985), Section 2, remark (iv). Further, the variables are centered and have a bound of order $O(z_{n}(\log n)^{1/2}n^{1/q})$ . Applying Theorem 2.1 to $\sum_{i=1}^{n}Z_{i}$ yields for all $\epsilon>0$ and $n\in\mathbb{N}$ large enough

[TABLE]

where the first, second and last bandwidth constraint in (B2) were used in the last equality. Details are omitted for the sake of brevity.

∎

Lemma B.3.

Under the assumptions of Theorem 3.1 and under $H_{0}$ we have uniformly in $s\in[0,1]$ and $\bm{z}\in\mathbb{R}^{d}$

[TABLE]

Note that the proof of Lemma B.3 is similar to the proof of Theorem 3.1 (i) in Mohr and Neumeyer (2019). It will only be sketched for the sake of brevity.

Proof.

Using $\xi_{t}=U_{t}^{2}-\sigma^{2}(\bm{X}_{t})$ under $H_{0}$ , it holds that

[TABLE]

By strict stationarity of $\{(\xi_{t},\bm{X}_{t}):t\in\mathbb{Z}\}$ and the moment constraints from ( $\bm{\xi}$ ) we deduce that uniformly in $s\in[0,1]$ and $\bm{z}\in\mathbb{R}^{d}$

[TABLE]

Making use of the uniform convergence rates of $\hat{\sigma}^{2}_{n}$ stated in Lemma B.1 (ii) we furthermore obtain

[TABLE]

uniformly in $s\in[0,1]$ and $\bm{z}\in\mathbb{R}^{d}$ . Continuing by inserting the definition of $\hat{\sigma}^{2}_{n}$ , using $Y_{i}=m(\bm{X}_{i})+U_{i}$ and finally $\xi_{i}=U_{i}^{2}-\sigma^{2}(\bm{X}_{i})$ under $H_{0}$ , it holds that

[TABLE]

Concerning (B.1) and (B.2), it can be shown that

[TABLE]

and

[TABLE]

uniformly in $\bm{z}\in\mathbb{R}^{d}$ respectively. Using the uniform rates of convergences of $\hat{m}_{n}$ from Lemma B.1 (i) (a), which also hold on the slightly larger set $\bm{I}_{n}=[-c_{n}-Ch_{n},c_{n}+Ch_{n}]^{d}$ , it can be shown that the term (B.3) is negligible uniformly in $\bm{z}\in\mathbb{R}^{d}$ . Finally, using similar methods as for the proof of Lemma B.2, it can be shown that the term (B.4) is as well negligible uniformly in $\bm{z}\in\mathbb{R}^{d}$ . Putting the results together, the assertion of the lemma follows.

∎

Proof of Theorem 3.1.

The assertion (i) follows by Lemma B.2 and Lemma B.3 and by Lemma B.1 (i) (a) together with the bandwidth constraints as

[TABLE]

For (ii) note that $\{(\xi_{t},\bm{X}_{t}):t\in\mathbb{Z}\}$ is strictly stationary and strongly mixing under $H_{0}$ and assumption (G). Denote by $P$ the marginal distribution of $(\xi_{1},\bm{X}_{1})$ . The assertion then follows by an application of Corollary 2.7 in Mohr (2019) to the sequential empirical process $\{n^{-1/2}\sum_{i=1}^{\left\lfloor ns\right\rfloor}(\varphi(\xi_{i},\bm{X}_{i})-\int\varphi dP):s\in[0,1],\varphi\in\mathcal{F}\}$ indexed in the function class $\mathcal{F}:=\{(\xi,\bm{x})\mapsto\xi I\{\bm{x}\leq\bm{z}\}:\bm{z}\in\mathbb{R}^{d}\}$ . The conditions that are needed for the asymptotic equicontinuity of the process are implied by assumptions (G) and ( $\bm{\xi}$ ). The convergence of the finite dimensional distributions can be shown by applying Corollary 1 in Rio (1995), which is a central limit theorem for strongly mixing triangular arrays.

∎

Bibliography32

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Berkes et al. (2004) Berkes, I., Horváth, L., and Kokoszka, P. (2004). Testing for parameter constancy in GARCH(p,q) models. Stat. Probab. Lett. , 70:182–195.
2Bradley (1985) Bradley, R. C. (1985). Basic properties of strong mixing conditions. In Eberlein, E. and Taqqu, M. S., editors, Dependence in Probability and Statistics , pages 165–192. Birkhäuser, Boston.
3Chen et al. (2005) Chen, G., Choi, Y. K., and Zhou, Y. (2005). Nonparametric estimation of structural change points in volatility models for time series. J. Econom. , 126:79–114.
4Chen and Tian (2014) Chen, Z. and Tian, Z. (2014). Ratio tests for variance change in nonparametric regression. Statistics , 48:1–16.
5Fan and Yao (2003) Fan, J. and Yao, Q. (2003). Nonlinear Time Series: Nonparametric and Parametric Methods . Springer, New York.
6Fryzlewicz (2014) Fryzlewicz, P. (2014). Wild binary segmentation for multiple change-point detection. Ann. Statist. , 42:2243–2281.
7Gao (2007) Gao, J. (2007). Nonlinear Time Series: Semiparametric and Nonparametric Methods . Chapman & Hall/CRC, Boca Raton.
8Hansen (2008) Hansen, B. E. (2008). Uniform convergence rates for kernel estimation with dependent data. Econom. Theory , 24:726–748.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Nonparametric volatility change detection

Abstract

1 Introduction

2 The model and test statistic

Remark 2.1**.**

3 Asymptotic results

Theorem 3.1**.**

Corollary 3.2**.**

Remark 3.3**.**

Remark 3.4**.**

4 Finite sample properties

4.1 Simulations

4.2 Data example

5 Concluding remarks

Appendix A Assumptions

Remark A.1**.**

Remark A.2**.**

Appendix B Proofs

Lemma B.1**.**

Lemma B.2**.**

Proof.

Lemma B.3**.**

Proof.

Proof of Theorem 3.1.

Remark 2.1.

Theorem 3.1.

Corollary 3.2.

Remark 3.3.

Remark 3.4.

Remark A.1.

Remark A.2.

Lemma B.1.

Lemma B.2.

Lemma B.3.