Robust test for dispersion parameter change in discretely observed   diffusion processes

Junmo Song

arXiv:1906.12208·math.ST·July 1, 2019·Comput. Stat. Data Anal.

Robust test for dispersion parameter change in discretely observed diffusion processes

Junmo Song

PDF

Open Access

TL;DR

This paper introduces a robust CUSUM test for detecting changes in dispersion parameters of discretely observed diffusion processes, effectively handling outliers and outperforming traditional methods in simulations and real data applications.

Contribution

It proposes a novel trimmed-residual based CUSUM test that is robust to outliers and demonstrates its effectiveness through theoretical convergence and empirical evaluations.

Findings

01

The test converges weakly to a Brownian bridge under the null hypothesis.

02

Simulations show strong robustness against outliers.

03

Application to KOSPI200 data identifies change points missed by naive tests.

Abstract

This paper deals with the problem of testing for dispersion parameter change in discretely observed diffusion processes when the observations are contaminated by outliers. To lessen the impact of outliers, we first calculate residuals using a robust estimate and then propose a trimmed-residual based CUSUM test. The proposed test is shown to converge weakly to a function of the Brownian bridge under the null hypothesis of no parameter change. We conduct simulations to evaluate performances of the proposed test in the presence of outliers. Numerical results confirm that the proposed test posses a strong robust property against outliers. In real data analysis, we fit the Ornstein-Uhlenbeck process to KOSPI200 volatility index data and locate some change points that are not detected by a naive CUSUM test.

Tables9

Table 1. Table 1: Empirical sizes of T n subscript 𝑇 𝑛 T_{n} , T 1 , n α superscript subscript 𝑇 1 𝑛 𝛼 T_{1,n}^{\alpha} and T 2 , n α superscript subscript 𝑇 2 𝑛 𝛼 T_{2,n}^{\alpha} in the case of h n = n − 0.75 subscript ℎ 𝑛 superscript 𝑛 0.75 h_{n}=n^{-0.75} and no contamination

			$M = z_{0.005}^{2}$					$M = z_{0.025}^{2}$
$n$	$T_{n}$	$T_{j, n}^{α}$	$α = 0.1$	$α = 0.2$	$α = 0.3$	$α = 0.5$	$α = 1$	$α = 0.1$	$α = 0.2$	$α = 0.3$	$α = 0.5$	$α = 1$
200	0.039	$T_{1, n}^{α}$	0.037	0.037	0.037	0.037	0.037	0.042	0.041	0.041	0.039	0.039
		$T_{2, n}^{α}$	0.038	0.039	0.038	0.037	0.038	0.042	0.041	0.041	0.041	0.040
500	0.045	$T_{1, n}^{α}$	0.046	0.046	0.046	0.047	0.047	0.047	0.047	0.048	0.048	0.048
		$T_{2, n}^{α}$	0.048	0.049	0.048	0.048	0.049	0.051	0.051	0.051	0.049	0.049
1000	0.046	$T_{1, n}^{α}$	0.044	0.045	0.045	0.045	0.046	0.047	0.048	0.048	0.048	0.048
		$T_{2, n}^{α}$	0.045	0.044	0.044	0.045	0.045	0.047	0.047	0.047	0.047	0.047
3000	0.049	$T_{1, n}^{α}$	0.047	0.047	0.047	0.048	0.048	0.047	0.046	0.046	0.046	0.046
		$T_{2, n}^{α}$	0.047	0.048	0.048	0.048	0.048	0.048	0.048	0.048	0.048	0.047

Table 2. Table 2: Empirical powers of T n subscript 𝑇 𝑛 T_{n} , T 1 , n α superscript subscript 𝑇 1 𝑛 𝛼 T_{1,n}^{\alpha} and T 2 , n α superscript subscript 𝑇 2 𝑛 𝛼 T_{2,n}^{\alpha} without outliers when ( θ 0 , σ 0 ) subscript 𝜃 0 subscript 𝜎 0 (\theta_{0},\sigma_{0}) changes from (1,1) to ( θ 1 , σ 1 ) subscript 𝜃 1 subscript 𝜎 1 (\theta_{1},\sigma_{1})

				$M = z_{0.005}^{2}$					$M = z_{0.025}^{2}$
$(θ_{1}, σ_{1})$	$n$	$T_{n}$	$T_{j, n}^{α}$	$α = 0.1$	$α = 0.2$	$α = 0.3$	$α = 0.5$	$α = 1.0$	$α = 0.1$	$α = 0.2$	$α = 0.3$	$α = 0.5$	$α = 1.0$
(1,1.2)	200	0.297	$T_{1, n}^{α}$	0.303	0.302	0.302	0.301	0.301	0.288	0.287	0.287	0.287	0.288
			$T_{2, n}^{α}$	0.280	0.279	0.279	0.275	0.272	0.191	0.189	0.186	0.184	0.180
	500	0.708	$T_{1, n}^{α}$	0.699	0.700	0.699	0.699	0.698	0.664	0.662	0.662	0.660	0.658
			$T_{2, n}^{α}$	0.664	0.662	0.659	0.657	0.653	0.451	0.448	0.444	0.441	0.433
	1000	0.957	$T_{1, n}^{α}$	0.953	0.953	0.953	0.953	0.953	0.936	0.936	0.935	0.934	0.933
			$T_{2, n}^{α}$	0.934	0.933	0.934	0.932	0.930	0.755	0.751	0.747	0.744	0.739
(1,1.5)	200	0.922	$T_{1, n}^{α}$	0.929	0.929	0.928	0.927	0.925	0.914	0.912	0.912	0.911	0.909
			$T_{2, n}^{α}$	0.904	0.901	0.898	0.894	0.889	0.746	0.734	0.718	0.693	0.662
	500	1.000	$T_{1, n}^{α}$	1.000	1.000	1.000	1.000	1.000	1.000	1.000	1.000	1.000	1.000
			$T_{2, n}^{α}$	1.000	1.000	1.000	1.000	1.000	0.990	0.988	0.985	0.978	0.971
	1000	1.000	$T_{1, n}^{α}$	1.000	1.000	1.000	1.000	1.000	1.000	1.000	1.000	1.000	1.000
			$T_{2, n}^{α}$	1.000	1.000	1.000	1.000	1.000	1.000	1.000	1.000	1.000	1.000
(5,1)	200	0.036	$T_{1, n}^{α}$	0.039	0.039	0.039	0.040	0.041	0.040	0.039	0.039	0.039	0.041
			$T_{2, n}^{α}$	0.039	0.038	0.037	0.036	0.036	0.039	0.040	0.039	0.038	0.037
	500	0.048	$T_{1, n}^{α}$	0.049	0.049	0.049	0.049	0.049	0.046	0.046	0.047	0.046	0.048
			$T_{2, n}^{α}$	0.045	0.045	0.045	0.046	0.047	0.041	0.042	0.042	0.042	0.042
	1000	0.043	$T_{1, n}^{α}$	0.045	0.045	0.045	0.045	0.044	0.046	0.046	0.045	0.044	0.044
			$T_{2, n}^{α}$	0.045	0.045	0.045	0.045	0.045	0.044	0.044	0.043	0.043	0.042
(5,1.2)	200	0.210	$T_{1, n}^{α}$	0.215	0.216	0.216	0.216	0.217	0.203	0.204	0.205	0.206	0.207
			$T_{2, n}^{α}$	0.201	0.202	0.201	0.201	0.198	0.133	0.132	0.131	0.130	0.125
	500	0.628	$T_{1, n}^{α}$	0.629	0.629	0.629	0.629	0.627	0.588	0.589	0.589	0.589	0.588
			$T_{2, n}^{α}$	0.583	0.583	0.582	0.582	0.579	0.393	0.393	0.390	0.388	0.382
	1000	0.940	$T_{1, n}^{α}$	0.939	0.938	0.938	0.938	0.937	0.910	0.910	0.910	0.910	0.909
			$T_{2, n}^{α}$	0.908	0.907	0.907	0.906	0.907	0.719	0.717	0.716	0.714	0.709

Table 3. Table 3: Empirical sizes of T n subscript 𝑇 𝑛 T_{n} , T 1 , n α superscript subscript 𝑇 1 𝑛 𝛼 T_{1,n}^{\alpha} and T 2 , n α superscript subscript 𝑇 2 𝑛 𝛼 T_{2,n}^{\alpha} under the contamination with p = 0.5 % 𝑝 percent 0.5 p=0.5\% and σ v 2 = 1 superscript subscript 𝜎 𝑣 2 1 \sigma_{v}^{2}=1

			$M = z_{0.005}^{2}$					$M = z_{0.025}^{2}$
$n$	$T_{n}$	$T_{j, n}^{α}$	$α = 0.1$	$α = 0.2$	$α = 0.3$	$α = 0.5$	$α = 1.0$	$α = 0.1$	$α = 0.2$	$α = 0.3$	$α = 0.5$	$α = 1.0$
200	0.051	$T_{1, n}^{α}$	0.054	0.051	0.051	0.051	0.052	0.051	0.049	0.048	0.048	0.049
		$T_{2, n}^{α}$	0.039	0.039	0.039	0.039	0.039	0.041	0.040	0.041	0.041	0.040
500	0.100	$T_{1, n}^{α}$	0.061	0.061	0.061	0.061	0.062	0.055	0.055	0.055	0.055	0.056
		$T_{2, n}^{α}$	0.046	0.046	0.046	0.046	0.047	0.047	0.047	0.047	0.049	0.049
1000	0.157	$T_{1, n}^{α}$	0.068	0.067	0.067	0.068	0.069	0.053	0.053	0.053	0.053	0.053
		$T_{2, n}^{α}$	0.038	0.037	0.038	0.037	0.037	0.044	0.044	0.044	0.045	0.046
3000	0.213	$T_{1, n}^{α}$	0.078	0.078	0.078	0.077	0.078	0.062	0.062	0.062	0.062	0.062
		$T_{2, n}^{α}$	0.048	0.048	0.047	0.047	0.047	0.049	0.048	0.048	0.048	0.048

Table 4. Table 4: Empirical powers of T n subscript 𝑇 𝑛 T_{n} , T 1 , n α superscript subscript 𝑇 1 𝑛 𝛼 T_{1,n}^{\alpha} and T 2 , n α superscript subscript 𝑇 2 𝑛 𝛼 T_{2,n}^{\alpha} in the case of h n = n − 0.75 subscript ℎ 𝑛 superscript 𝑛 0.75 h_{n}=n^{-0.75} , p = 0.5 % 𝑝 percent 0.5 p=0.5\% and σ v 2 = 1 superscript subscript 𝜎 𝑣 2 1 \sigma_{v}^{2}=1 when ( θ 0 , σ 0 ) subscript 𝜃 0 subscript 𝜎 0 (\theta_{0},\sigma_{0}) changes from (1,1) to ( θ 1 , σ 1 ) subscript 𝜃 1 subscript 𝜎 1 (\theta_{1},\sigma_{1})

				$M = z_{0.005}^{2}$					$M = z_{0.025}^{2}$
$(θ_{1}, σ_{1})$	$n$	$T_{n}$	$T_{j, n}^{α}$	$α = 0.1$	$α = 0.2$	$α = 0.3$	$α = 0.5$	$α = 1.0$	$α = 0.1$	$α = 0.2$	$α = 0.3$	$α = 0.5$	$α = 1.0$
(1,1.2)	200	0.201	$T_{1, n}^{α}$	0.294	0.294	0.292	0.289	0.290	0.285	0.282	0.281	0.280	0.277
			$T_{2, n}^{α}$	0.274	0.272	0.269	0.267	0.262	0.191	0.186	0.182	0.177	0.173
	500	0.274	$T_{1, n}^{α}$	0.634	0.635	0.637	0.638	0.639	0.631	0.629	0.628	0.628	0.625
			$T_{2, n}^{α}$	0.645	0.640	0.637	0.634	0.632	0.456	0.442	0.438	0.431	0.425
	1000	0.224	$T_{1, n}^{α}$	0.909	0.909	0.909	0.910	0.910	0.914	0.914	0.913	0.914	0.913
			$T_{2, n}^{α}$	0.927	0.926	0.925	0.924	0.924	0.770	0.757	0.752	0.745	0.740
(1,1.5)	200	0.646	$T_{1, n}^{α}$	0.873	0.878	0.880	0.880	0.881	0.887	0.885	0.885	0.883	0.880
			$T_{2, n}^{α}$	0.896	0.892	0.888	0.882	0.875	0.739	0.718	0.700	0.677	0.644
	500	0.507	$T_{1, n}^{α}$	0.999	0.999	0.999	0.999	0.999	1.000	1.000	1.000	1.000	1.000
			$T_{2, n}^{α}$	1.000	1.000	1.000	1.000	1.000	0.990	0.987	0.984	0.981	0.974
	1000	0.377	$T_{1, n}^{α}$	1.000	1.000	1.000	1.000	1.000	1.000	1.000	1.000	1.000	1.000
			$T_{2, n}^{α}$	1.000	1.000	1.000	1.000	1.000	1.000	1.000	1.000	1.000	1.000
(5,1)	200	0.050	$T_{1, n}^{α}$	0.052	0.051	0.051	0.052	0.054	0.045	0.045	0.045	0.044	0.045
			$T_{2, n}^{α}$	0.038	0.037	0.037	0.036	0.039	0.041	0.040	0.040	0.039	0.037
	500	0.092	$T_{1, n}^{α}$	0.067	0.066	0.066	0.066	0.067	0.058	0.057	0.057	0.057	0.056
			$T_{2, n}^{α}$	0.050	0.050	0.050	0.050	0.051	0.044	0.044	0.043	0.043	0.043
	1000	0.150	$T_{1, n}^{α}$	0.074	0.073	0.073	0.073	0.073	0.061	0.060	0.061	0.061	0.061
			$T_{2, n}^{α}$	0.048	0.048	0.048	0.048	0.048	0.048	0.047	0.047	0.047	0.047
(5,1.2)	200	0.158	$T_{1, n}^{α}$	0.225	0.225	0.225	0.226	0.226	0.211	0.212	0.214	0.213	0.211
			$T_{2, n}^{α}$	0.200	0.202	0.201	0.201	0.201	0.136	0.135	0.134	0.132	0.131
	500	0.256	$T_{1, n}^{α}$	0.561	0.567	0.567	0.569	0.570	0.566	0.566	0.565	0.564	0.563
			$T_{2, n}^{α}$	0.570	0.569	0.569	0.569	0.567	0.400	0.391	0.389	0.389	0.388
	1000	0.219	$T_{1, n}^{α}$	0.877	0.879	0.880	0.881	0.880	0.882	0.884	0.884	0.885	0.883
			$T_{2, n}^{α}$	0.899	0.897	0.896	0.895	0.895	0.723	0.716	0.712	0.709	0.706

Table 5. Table 5: Empirical sizes of T n subscript 𝑇 𝑛 T_{n} , T 1 , n α superscript subscript 𝑇 1 𝑛 𝛼 T_{1,n}^{\alpha} and T 2 , n α superscript subscript 𝑇 2 𝑛 𝛼 T_{2,n}^{\alpha} in the case of h n = n − 0.75 subscript ℎ 𝑛 superscript 𝑛 0.75 h_{n}=n^{-0.75} , p = 5 % 𝑝 percent 5 p=5\% and σ v 2 = 1 superscript subscript 𝜎 𝑣 2 1 \sigma_{v}^{2}=1

			$M = z_{0.005}^{2}$					$M = z_{0.025}^{2}$
$n$	$T_{n}$	$T_{j, n}^{α}$	$α = 0.1$	$α = 0.2$	$α = 0.3$	$α = 0.5$	$α = 1.0$	$α = 0.1$	$α = 0.2$	$α = 0.3$	$α = 0.5$	$α = 1.0$
200	0.137	$T_{1, n}^{α}$	0.167	0.148	0.140	0.138	0.139	0.140	0.111	0.106	0.105	0.106
		$T_{2, n}^{α}$	0.075	0.055	0.053	0.053	0.053	0.058	0.048	0.047	0.047	0.047
500	0.196	$T_{1, n}^{α}$	0.198	0.170	0.165	0.165	0.166	0.149	0.123	0.116	0.117	0.120
		$T_{2, n}^{α}$	0.065	0.054	0.054	0.053	0.053	0.059	0.050	0.050	0.049	0.049
1000	0.217	$T_{1, n}^{α}$	0.203	0.181	0.178	0.176	0.181	0.152	0.127	0.125	0.124	0.128
		$T_{2, n}^{α}$	0.068	0.058	0.057	0.056	0.057	0.054	0.052	0.053	0.053	0.052
3000	0.254	$T_{1, n}^{α}$	0.210	0.196	0.194	0.195	0.202	0.145	0.132	0.129	0.130	0.135
		$T_{2, n}^{α}$	0.064	0.062	0.061	0.062	0.061	0.058	0.059	0.060	0.060	0.058

Table 6. Table 6: Empirical powers of T n subscript 𝑇 𝑛 T_{n} , T 1 , n α superscript subscript 𝑇 1 𝑛 𝛼 T_{1,n}^{\alpha} and T 2 , n α superscript subscript 𝑇 2 𝑛 𝛼 T_{2,n}^{\alpha} in the case of h n = n − 0.75 subscript ℎ 𝑛 superscript 𝑛 0.75 h_{n}=n^{-0.75} , p = 5 % 𝑝 percent 5 p=5\% and σ v 2 = 1 superscript subscript 𝜎 𝑣 2 1 \sigma_{v}^{2}=1 when ( θ 0 , σ 0 ) subscript 𝜃 0 subscript 𝜎 0 (\theta_{0},\sigma_{0}) changes from (1,1) to ( θ 1 , σ 1 ) subscript 𝜃 1 subscript 𝜎 1 (\theta_{1},\sigma_{1})

				$M = z_{0.005}^{2}$					$M = z_{0.025}^{2}$
$(θ_{1}, σ_{1})$	n	$T_{n}$	$T_{j, n}$	$α = 0.1$	$α = 0.2$	$α = 0.3$	$α = 0.5$	$α = 1.0$	$α = 0.1$	$α = 0.2$	$α = 0.3$	$α = 0.5$	$α = 1.0$
(1,1.2)	200	0.151	$T_{1, n}^{α}$	0.233	0.240	0.242	0.246	0.246	0.251	0.254	0.252	0.252	0.250
			$T_{2, n}^{α}$	0.211	0.222	0.225	0.224	0.225	0.222	0.199	0.187	0.179	0.177
	500	0.197	$T_{1, n}^{α}$	0.353	0.394	0.400	0.404	0.401	0.425	0.472	0.477	0.478	0.476
			$T_{2, n}^{α}$	0.484	0.535	0.540	0.538	0.540	0.512	0.452	0.429	0.419	0.425
	1000	0.231	$T_{1, n}^{α}$	0.491	0.570	0.582	0.589	0.582	0.641	0.698	0.705	0.707	0.703
			$T_{2, n}^{α}$	0.815	0.855	0.856	0.857	0.859	0.825	0.761	0.736	0.727	0.740
(1,1.5)	200	0.190	$T_{1, n}^{α}$	0.444	0.518	0.548	0.568	0.574	0.574	0.647	0.668	0.681	0.685
			$T_{2, n}^{α}$	0.612	0.717	0.744	0.754	0.752	0.721	0.708	0.682	0.657	0.634
	500	0.225	$T_{1, n}^{α}$	0.671	0.822	0.849	0.866	0.867	0.861	0.945	0.953	0.958	0.958
			$T_{2, n}^{α}$	0.947	0.994	0.996	0.996	0.996	0.993	0.990	0.982	0.977	0.975
	1000	0.244	$T_{1, n}^{α}$	0.887	0.969	0.978	0.981	0.980	0.985	0.999	0.999	0.999	0.999
			$T_{2, n}^{α}$	0.999	1.000	1.000	1.000	1.000	1.000	1.000	1.000	1.000	1.000
(5,1)	200	0.123	$T_{1, n}^{α}$	0.175	0.152	0.146	0.144	0.144	0.148	0.116	0.110	0.107	0.110
			$T_{2, n}^{α}$	0.082	0.060	0.056	0.053	0.054	0.077	0.047	0.044	0.042	0.042
	500	0.171	$T_{1, n}^{α}$	0.206	0.177	0.173	0.170	0.174	0.156	0.128	0.120	0.120	0.122
			$T_{2, n}^{α}$	0.077	0.061	0.060	0.059	0.060	0.065	0.059	0.060	0.061	0.060
	1000	0.206	$T_{1, n}^{α}$	0.208	0.187	0.182	0.182	0.187	0.158	0.135	0.131	0.131	0.134
			$T_{2, n}^{α}$	0.072	0.060	0.058	0.058	0.058	0.057	0.055	0.056	0.056	0.056
(5,1.2)	200	0.123	$T_{1, n}^{α}$	0.208	0.216	0.221	0.223	0.224	0.206	0.216	0.217	0.217	0.221
			$T_{2, n}^{α}$	0.135	0.152	0.163	0.165	0.169	0.125	0.136	0.134	0.134	0.134
	500	0.186	$T_{1, n}^{α}$	0.314	0.355	0.363	0.367	0.366	0.353	0.406	0.415	0.420	0.418
			$T_{2, n}^{α}$	0.378	0.460	0.465	0.467	0.470	0.400	0.375	0.365	0.361	0.369
	1000	0.201	$T_{1, n}^{α}$	0.474	0.552	0.567	0.571	0.564	0.603	0.668	0.681	0.685	0.680
			$T_{2, n}^{α}$	0.753	0.804	0.807	0.809	0.811	0.757	0.705	0.684	0.676	0.693

Table 7. Table 7: Test statistics [p-values] of T n subscript 𝑇 𝑛 T_{n} , T 1 , n α superscript subscript 𝑇 1 𝑛 𝛼 T_{1,n}^{\alpha} , and T 2 , n α superscript subscript 𝑇 2 𝑛 𝛼 T_{2,n}^{\alpha}

$T_{n}$	$T_{j, n}^{α}$	$α = 0.1$	$α = 0.2$	$α = 0.3$	$α = 0.4$	$α = 0.5$	$α = 1$
0.957	$T_{1, n}^{α}$	1.863	1.843	1.807	1.769	1.789	1.835
$[0.319]$		[0.002]	[0.002]	[0.003]	[0.004]	[0.003]	[0.002]
	$T_{2, n}^{α}$	1.460	1.369	1.625	1.648	1.613	1.566
		[0.028]	[0.047]	[0.010]	[0.009]	[0.011]	[0.015]

Table 8. Table 8: ML and MDPD estimates for each sub-period

Period			MDPDE	$\hat{λ}$	$\hat{μ}$	$\hat{σ}$
Jan 2, 2015	$\sim$	Aug 25, 2015	MLE	14.93	14.23	16.57
			$α = 0.1$	14.23	13.05	7.88
			$α = 0.2$	13.99	12.87	7.53
			$α = 0.3$	13.53	12.76	7.37
			$α = 0.4$	13.11	12.68	7.28
			$α = 0.5$	12.72	12.62	7.23
			$α = 1.0$	11.38	12.33	7.24
Aug 26, 2015	$\sim$	Sep 22, 2015	MLE	177.02	20.52	25.80
			$α = 0.1$	180.32	20.55	26.23
			$α = 0.2$	183.21	20.58	26.58
			$α = 0.3$	185.77	20.61	26.84
			$α = 0.4$	188.06	20.63	27.03
			$α = 0.5$	190.14	20.66	27.13
			$α = 1.0$	198.43	20.78	26.62
Sep 23, 2015	$\sim$	Dec 10, 2015	MLE	31.76	14.71	10.73
			$α = 0.1$	30.56	14.63	10.60
			$α = 0.2$	28.97	14.56	10.41
			$α = 0.3$	26.83	14.49	10.18
			$α = 0.4$	23.99	14.44	9.88
			$α = 0.5$	20.61	14.40	9.51
			$α = 1.0$	11.66	14.86	8.09
Dec 11, 2015	$\sim$	Mar 2, 2016	MLE	43.43	17.88	23.28
			$α = 0.1$	44.94	17.55	22.56
			$α = 0.2$	46.46	17.28	21.83
			$α = 0.3$	47.81	17.08	21.24
			$α = 0.4$	48.97	16.93	20.88
			$α = 0.5$	50.06	16.83	20.70
			$α = 1.0$	60.70	16.51	20.20
Mar 3, 2016	$\sim$	Aug 31, 2017	MLE	30.34	12.78	13.20
			$α = 0.1$	32.43	12.32	10.31
			$α = 0.2$	33.74	12.05	8.55
			$α = 0.3$	33.38	11.92	7.68
			$α = 0.4$	32.45	11.84	7.30
			$α = 0.5$	31.79	11.79	7.13
			$α = 1.0$	30.64	11.64	6.99
Jan 2, 2015	$\sim$	Aug 31, 2017	MLE	17.85	13.75	15.87
			$α = 0.1$	20.30	13.01	12.24
			$α = 0.2$	20.21	12.60	10.45
			$α = 0.3$	20.25	12.35	9.28
			$α = 0.4$	20.01	12.20	8.61
			$α = 0.5$	19.68	12.12	8.26
			$α = 1.0$	19.65	11.93	7.93

Table 9. Table 9: 1-step-ahead forecasting errors for the models without and with change-points

OU process without parameter change				OU process with parameter changes
MDPDE	RMSE	RMSPE	#	MDPDE	RMSE	RMSPE	#
MLE	0.6171	0.0489	78	MLE	0.6035	0.0475	76
$α = 0.1$	0.6092	0.0480	76	$α = 0.1$	0.6014	0.0469	76
$α = 0.2$	0.6072	0.0477	76	$α = 0.2$	0.6028	0.0468	75
$α = 0.3$	0.6068	0.0475	74	$α = 0.3$	0.6042	0.0468	73
$α = 0.4$	0.6070	0.0475	73	$α = 0.4$	0.6051	0.0468	71
$α = 0.5$	0.6074	0.0475	73	$α = 0.5$	0.6057	0.0469	70
$α = 1.0$	0.6081	0.0475	73	$α = 1.0$	0.6075	0.0470	70
# denotes the number of observations included in 95% prediction intervals.

Equations117

d X_{t} = a (X_{t}, θ) d t + σ d W_{t}, X_{0} = x_{0},

d X_{t} = a (X_{t}, θ) d t + σ d W_{t}, X_{0} = x_{0},

H_{0} : σ does not change over 0 \leq t \leq n h_{n} . vs . H_{1} : not H_{0} .

H_{0} : σ does not change over 0 \leq t \leq n h_{n} . vs . H_{1} : not H_{0} .

\hat{Z}_{i} := \frac{X _{t_{i}} - X _{t_{i - 1}} - a ( X _{t_{i - 1}} , θ ^ _{n} ) h _{n}}{h _{n} σ ^ _{n}}, i = 1, \dots, n,

\hat{Z}_{i} := \frac{X _{t_{i}} - X _{t_{i - 1}} - a ( X _{t_{i - 1}} , θ ^ _{n} ) h _{n}}{h _{n} σ ^ _{n}}, i = 1, \dots, n,

X_{t_{i}} \approx X_{t_{i - 1}} + a (X_{t_{i - 1}}, θ) h_{n} + σ h_{n} \frac{W _{t_{i}} - W _{t_{i - 1}}}{h _{n}} .

X_{t_{i}} \approx X_{t_{i - 1}} + a (X_{t_{i - 1}}, θ) h_{n} + σ h_{n} \frac{W _{t_{i}} - W _{t_{i - 1}}}{h _{n}} .

\displaystyle T_{n}:=\frac{1}{\sqrt{n}\hat{\tau}_{n}}\max_{1\leq k\leq n}\Big{|}\sum_{i=1}^{k}\hat{Z}^{2}_{i}-\frac{k}{n}\sum_{i=1}^{n}\hat{Z}_{i}^{2}\Big{|}

\displaystyle T_{n}:=\frac{1}{\sqrt{n}\hat{\tau}_{n}}\max_{1\leq k\leq n}\Big{|}\sum_{i=1}^{k}\hat{Z}^{2}_{i}-\frac{k}{n}\sum_{i=1}^{n}\hat{Z}_{i}^{2}\Big{|}

T_{n} \to d 0 \leq t \leq 1 sup ∣ W_{t}^{o} ∣ a s n \to \infty,

T_{n} \to d 0 \leq t \leq 1 sup ∣ W_{t}^{o} ∣ a s n \to \infty,

(\hat{θ}_{n}^{α}, \overset{σ}{^}_{n}^{α}) = (θ, σ) \in Θ argmin \frac{1}{n} i = 1 \sum n H_{i}^{α} (θ, σ),

(\hat{θ}_{n}^{α}, \overset{σ}{^}_{n}^{α}) = (θ, σ) \in Θ argmin \frac{1}{n} i = 1 \sum n H_{i}^{α} (θ, σ),

H_{i}^{α} (θ, σ)

H_{i}^{α} (θ, σ)

\displaystyle=\!\left\{\begin{array}[]{ll}\displaystyle\frac{1}{\sigma^{\alpha}}\Big{[}\frac{1}{\sqrt{1+\alpha}}-\Big{(}1+\frac{1}{\alpha}\Big{)}\,\exp\Big{\{}-\frac{\alpha}{2\sigma^{2}h_{n}}(X_{t_{i}}-X_{t_{i-1}}-a(X_{t_{i-1}},\theta)h_{n})^{2}\Big{\}}\Big{]}&,\alpha>0\,,\\ \\ \displaystyle\frac{1}{\sigma^{2}h_{n}}\big{(}X_{t_{i}}-X_{t_{i-1}}-a(X_{t_{i-1}},\theta)h_{n}\big{)}^{2}+\log\sigma^{2}&,\alpha=0\,.\end{array}\right.

f_{1, M} (x)

f_{1, M} (x)

f_{2, M} (x)

\displaystyle T_{j,n}^{\alpha}:=\frac{1}{\sqrt{n}\hat{\tau}_{j,n}}\max_{1\leq k\leq n}\Big{|}\sum_{i=1}^{k}f_{j,M}(\hat{Z}^{2}_{\alpha,i})-\frac{k}{n}\sum_{i=1}^{n}f_{j,M}(\hat{Z}^{2}_{\alpha,i})\Big{|},

\displaystyle T_{j,n}^{\alpha}:=\frac{1}{\sqrt{n}\hat{\tau}_{j,n}}\max_{1\leq k\leq n}\Big{|}\sum_{i=1}^{k}f_{j,M}(\hat{Z}^{2}_{\alpha,i})-\frac{k}{n}\sum_{i=1}^{n}f_{j,M}(\hat{Z}^{2}_{\alpha,i})\Big{|},

T_{j, n}^{α} \to d 0 \leq t \leq 1 sup ∣ W_{t}^{o} ∣ a s n \to \infty, f or j = 1, 2.

T_{j, n}^{α} \to d 0 \leq t \leq 1 sup ∣ W_{t}^{o} ∣ a s n \to \infty, f or j = 1, 2.

\hat{Z}_{α, i} = \frac{X _{t_{i}} - X _{t_{i - 1}}}{h _{n} σ ^ _{n}^{α}} + a (X_{t_{i - 1}}, \hat{θ}_{n}^{α}) O_{p} (h_{n}) .

\hat{Z}_{α, i} = \frac{X _{t_{i}} - X _{t_{i - 1}}}{h _{n} σ ^ _{n}^{α}} + a (X_{t_{i - 1}}, \hat{θ}_{n}^{α}) O_{p} (h_{n}) .

\displaystyle\tilde{T}_{j,n}^{\alpha}:=\frac{1}{\sqrt{n}\hat{\tau}_{g,j}}\max_{1\leq k\leq n}\Big{|}\sum_{i=1}^{k}g_{j,M}^{2}(\hat{Z}_{i}^{\alpha})-\frac{k}{n}\sum_{i=1}^{n}g_{j,M}^{2}(\hat{Z}_{i}^{\alpha})\Big{|},

\displaystyle\tilde{T}_{j,n}^{\alpha}:=\frac{1}{\sqrt{n}\hat{\tau}_{g,j}}\max_{1\leq k\leq n}\Big{|}\sum_{i=1}^{k}g_{j,M}^{2}(\hat{Z}_{i}^{\alpha})-\frac{k}{n}\sum_{i=1}^{n}g_{j,M}^{2}(\hat{Z}_{i}^{\alpha})\Big{|},

d X_{t} = - θ X_{t} d t + σ d W_{t}, X_{0} = 0.

d X_{t} = - θ X_{t} d t + σ d W_{t}, X_{0} = 0.

P {0 \leq t \leq 1 sup ∣ W_{t}^{o} ∣ \leq u} = k = - \infty \sum \infty (- 1)^{k} exp (- 2 k^{2} u^{2}) .

P {0 \leq t \leq 1 sup ∣ W_{t}^{o} ∣ \leq u} = k = - \infty \sum \infty (- 1)^{k} exp (- 2 k^{2} u^{2}) .

d X_{t} = λ (μ - X_{t}) d t + σ d W_{t},

d X_{t} = λ (μ - X_{t}) d t + σ d W_{t},

\hat{X}_{t_{s + 1}} = X_{t_{s}} + \hat{λ}_{t_{s}} (X_{t_{s}} - \overset{μ}{^}_{t_{s}}) h_{n},

\hat{X}_{t_{s + 1}} = X_{t_{s}} + \hat{λ}_{t_{s}} (X_{t_{s}} - \overset{μ}{^}_{t_{s}}) h_{n},

\text{RMSE}=\sqrt{\frac{1}{78}\sum_{s=660}^{737}(X_{t_{s}}-\hat{X}_{t_{s}})^{2}}\qquad\text{and}\qquad\text{RMSPE}=\sqrt{\frac{1}{78}\sum_{s=660}^{737}\bigg{(}\frac{X_{t_{s}}-\hat{X}_{t_{s}}}{X_{t_{s}}}\bigg{)}^{2}},

\text{RMSE}=\sqrt{\frac{1}{78}\sum_{s=660}^{737}(X_{t_{s}}-\hat{X}_{t_{s}})^{2}}\qquad\text{and}\qquad\text{RMSPE}=\sqrt{\frac{1}{78}\sum_{s=660}^{737}\bigg{(}\frac{X_{t_{s}}-\hat{X}_{t_{s}}}{X_{t_{s}}}\bigg{)}^{2}},

\displaystyle Z_{i}=\frac{1}{\sqrt{h_{n}}}\big{(}W_{t_{i}}-W_{t_{i-1}})\quad and\quad\Delta_{i}=\int_{t_{i-1}}^{t_{i}}\big{\{}a(X_{s},\theta_{0})-a(X_{t_{i-1}},\theta_{0})\big{\}}ds.

\displaystyle Z_{i}=\frac{1}{\sqrt{h_{n}}}\big{(}W_{t_{i}}-W_{t_{i-1}})\quad and\quad\Delta_{i}=\int_{t_{i-1}}^{t_{i}}\big{\{}a(X_{s},\theta_{0})-a(X_{t_{i-1}},\theta_{0})\big{\}}ds.

1 \leq i \leq n max E (∣ Δ_{i} ∣^{k}) ≲ h_{n}^{1.5 k} for any k \in N .

1 \leq i \leq n max E (∣ Δ_{i} ∣^{k}) ≲ h_{n}^{1.5 k} for any k \in N .

\displaystyle n^{q}h_{n}^{m}\max_{i\leq n}\big{|}(1+|X_{t_{i-1}}|)^{C}\,Z_{i}^{k}\,\Delta_{i}^{l}\big{|}=o(1)\quad\textit{a.s.},

\displaystyle n^{q}h_{n}^{m}\max_{i\leq n}\big{|}(1+|X_{t_{i-1}}|)^{C}\,Z_{i}^{k}\,\Delta_{i}^{l}\big{|}=o(1)\quad\textit{a.s.},

\displaystyle\sum_{n=1}^{\infty}P\Big{(}n^{q}h_{n}^{m}\max_{i\leq n}\big{|}(1+|X_{t_{i-1}}|)^{C}Z_{i}^{k}\Delta_{i}^{l}\big{|}>\epsilon\Big{)}

\displaystyle\sum_{n=1}^{\infty}P\Big{(}n^{q}h_{n}^{m}\max_{i\leq n}\big{|}(1+|X_{t_{i-1}}|)^{C}Z_{i}^{k}\Delta_{i}^{l}\big{|}>\epsilon\Big{)}

\frac{1}{n} i = 1 \sum n g_{n} (Z_{i}) f (X_{t_{i - 1}}) - E g (Z) \int f (x) μ_{0} (d x) = o (1) a . s ..

\frac{1}{n} i = 1 \sum n g_{n} (Z_{i}) f (X_{t_{i - 1}}) - E g (Z) \int f (x) μ_{0} (d x) = o (1) a . s ..

\frac{1}{n h _{n}} \int_{0}^{n h_{n}} f (X_{s}) d s ⟶ a . s . \int f (x) μ_{0} (d x) as n \to \infty.

\frac{1}{n h _{n}} \int_{0}^{n h_{n}} f (X_{s}) d s ⟶ a . s . \int f (x) μ_{0} (d x) as n \to \infty.

\displaystyle E\Big{|}\frac{1}{n}\sum_{i=1}^{n}\frac{1}{h_{n}}\int_{t_{i-1}}^{t_{i}}\big{\{}f(X_{t_{i-1}})-f(X_{s})\big{\}}ds\Big{|}^{2r}

\displaystyle E\Big{|}\frac{1}{n}\sum_{i=1}^{n}\frac{1}{h_{n}}\int_{t_{i-1}}^{t_{i}}\big{\{}f(X_{t_{i-1}})-f(X_{s})\big{\}}ds\Big{|}^{2r}

\displaystyle\leq\frac{1}{nh_{n}}\sum_{i=1}^{n}\int_{t_{i-1}}^{t_{i}}\big{\{}E(X_{t_{i-1}}-X_{s})^{4r}\big{\}}^{\frac{1}{2}}\Big{[}\int_{0}^{1}E\big{\{}\partial_{x}f(X_{s}+u(X_{t_{i-1}}-X_{s}))\big{\}}^{4r}du\Big{]}^{\frac{1}{2}}ds

= O (h_{n}^{r}) = o (n^{- r /2}) .

\displaystyle\sum_{n=1}^{\infty}P\Big{(}\Big{|}\frac{1}{n}\sum_{i=1}^{n}\frac{1}{h_{n}}\int_{t_{i-1}}^{t_{i}}\big{\{}f(X_{t_{i-1}})-f(X_{s})\big{\}}ds\Big{|}>\epsilon\Big{)}\lesssim\sum_{n=1}^{\infty}o(n^{-r/2})<\infty,

\displaystyle\sum_{n=1}^{\infty}P\Big{(}\Big{|}\frac{1}{n}\sum_{i=1}^{n}\frac{1}{h_{n}}\int_{t_{i-1}}^{t_{i}}\big{\{}f(X_{t_{i-1}})-f(X_{s})\big{\}}ds\Big{|}>\epsilon\Big{)}\lesssim\sum_{n=1}^{\infty}o(n^{-r/2})<\infty,

\frac{1}{n} i = 1 \sum n f (X_{t_{i - 1}}) ⟶ a . s . \int f (x) μ_{0} (d x) as n \to \infty.

\frac{1}{n} i = 1 \sum n f (X_{t_{i - 1}}) ⟶ a . s . \int f (x) μ_{0} (d x) as n \to \infty.

S_{i}:=\big{\{}g_{n}(Z_{i})-E[g_{n}(Z_{1})]\big{\}}f(X_{t_{i-1}}).

S_{i}:=\big{\{}g_{n}(Z_{i})-E[g_{n}(Z_{1})]\big{\}}f(X_{t_{i-1}}).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Financial Risk and Volatility Modeling · Stochastic processes and financial applications

Full text

Robust test for dispersion parameter change

in discretely observed diffusion processes

Junmo Song Department of Statistics, Kyungpook National University, 80 Daehakro, Bukgu, Daegu, 41566, Korea. Email: [email protected] Department of Statistics, Kyungpook National University

Abstract

This paper deals with the problem of testing for dispersion parameter change in discretely observed diffusion processes when the observations are contaminated by outliers. To lessen the impact of outliers, we first calculate residuals using a robust estimate and then propose a trimmed-residual based CUSUM test. The proposed test is shown to converge weakly to a function of the Brownian bridge under the null hypothesis of no parameter change. We conduct simulations to evaluate performances of the proposed test in the presence of outliers. Numerical results confirm that the proposed test posses a strong robust property against outliers. In real data analysis, we fit the Ornstein-Uhlenbeck process to KOSPI200 volatility index data and locate some change points that are not detected by a naive CUSUM test.

Key words and phrases: Diffusion processes, parameter change test, dispersion parameter, outliers, CUSUM of squares test, robust test.

1 Introduction

Diffusion processes are usually expressed as solutions to stochastic differential equations (SDEs). Since SDEs are useful in describing stochastic phenomena, diffusion processes have long been popular in various fields. In the field of finance, for example, the processes have been widely used to model the prices of underlying assets and instantaneous interest rates. Naturally, the need for statistical inference on diffusion processes has increased. In particular, estimation of discretely observed diffusion processes has attracted much attention. See Dacunha-Castelle and Florens-Zmirou (1986), Kessler (1997), Aït-Sahalia (2002), and Beskos et al (2009). Statistical testings such as parameter change test and specification test have also investigated by some authors. See, for example, Iacus and Yoshida (2012) and Chen et al. (2008).

In this study, we are concerned with change point problem in diffusion processes. It is well known that ignoring changes can lead to false inference. Hence, change point problem has received a great deal of attention from researchers and practitioners. See the recent review paper by Horváth and Rice (2015). For diffusion processes, Gregorio and Iacus (2008), Song and Lee (2009), Lee (2011), and Iacus and Yoshida (2012) investigated the problem of testing for dispersion parameter constancy in discretely observed diffusion processes. Since the dispersion parameter is closely related to the volatility of underlying assets and the volatility plays a crucial role in pricing financial derivatives, the exact inference on the dispersion parameter is particularly important in financial applications. In the cases where a continuous observation is assumed to be obtained, detection of drift parameter change is typically considered because dispersion coefficient can be exactly estimated in this framework. See, for example, Negri and Nishiyama (2012) and Tsukuda (2017).

This paper focuses on the problem of detecting the dispersion parameter change, particularly when a data set includes deviating observations. In the literature, deviating observations are commonly treated as jumps or outliers. In the former cases, stochastic models with jump terms, usually induced by Poisson processes, have been proposed to describe spiky observations. See, for example, Kou (2002). In the latter cases, on the other hand, various robust methods for reducing the effect of outliers have been developed. For an overview on this area, we refer the reader to Maronna et al. (2006). In this study, we deal with the outlying observations from the latter point of view.

As is widely recognized, statistical inference such as estimation and testing are unduly influenced by outliers. Recently, Lee and Song (2013) and Song (2017) addressed that estimation of diffusion processes tends to be severely damaged by a small portion of outliers, particularly when sampling interval is short, as in high-frequency sampling cases. This is largely due to the fact that the transition distribution of the diffusion process approaches Gaussian distribution as the sampling interval gets shorter. It should be noted that tests constructed using an estimator sensitive to outliers are likely to lead to false conclusions. Furthermore, such events that can cause deviating observations or parameter changes in fitted model are often observed in actual practice. In finance, changes of monetary policy and critical social events can be examples. When outlying observations are included in a data set being suspected of having parameter changes, it is not easy to determine whether the testing results are due to genuine changes or not. These technical and empirical reasons motivate us to consider the problem.

The objective of this paper is to propose a parameter change test that is robust against outliers. We introduce a very intuitive and easy-to-implement test procedure: to lessen the impact of outliers on the procedure, (i) we first calculate residuals using a robust estimate and truncate the squares of the obtained residuals; (ii) then, we construct a CUSUM test using the trimmed ones. As a robust estimator, we employ the minimum density power divergence estimator (MDPDE) for diffusion processes introduced by Lee and Song (2013). Our simulation study below shows that the proposed test has strong robustness against outliers, whereas a naive CUSUM test without any robust procedure is seriously compromised by outliers. Further, our real data application demonstrates that analysis incorporating the proposed test can improve forecasting performance.

The rest of the paper is organized as follows. In Section 2, we introduce the residual-based CUSUM test and the MDPDE for diffusion processes. Then, we propose a robust CUSUM test for parameter change and derive its asymptotic null distribution. In Section 3, we conduct a simulation study to investigate the finite sample performance. Section 4 illustrates a real data application to KOSPI200 volatility index. Section 5 concludes and technical proofs are given in Section 6.

2 Main Result

Let us consider the following time-homogeneous diffusion process $\{X_{t}|t\geq 0\}$ defined by

[TABLE]

where $(\theta,\sigma)\in\mathbb{R}^{p}\times\mathbb{R}^{+}$ is unknown parameter and $\{W_{t}|t\geq 0\}$ denotes the standard Wiener process. The real valued function $a$ is assumed to be known apart from $\theta$ and smooth enough to admit a unique solution. We assume that a sample $\{X_{t_{i}}|0\leq i\leq n\}$ is discretely observed, where $t_{i}=ih_{n}$ and $\{h_{n}\}$ is a sequence of positive numbers with $h_{n}\rightarrow 0$ and $nh_{n}\rightarrow\infty$ . It is noteworthy that the diffusion processes of the form $dX_{t}=a(X_{t},\theta)dt+\sigma\,b(X_{t})dW_{t}$ can be reduced to (1) by using the Lamperti transformation and Ito’s lemma.

2.1 Naive CUSUM test for diffusion processes

Based on the discrete observations, we now wish to test the following hypotheses:

[TABLE]

For this, we employ the CUSUM of squares test based on residuals as in Lee (2011). Residuals for the diffusion process (1) are defined as follows:

[TABLE]

where $(\hat{\theta}_{n},\hat{\sigma}_{n})$ is an estimate of $(\theta,\sigma)$ . The above is deduced from the following Euler approximation of (1)

[TABLE]

Using the residuals, we first introduce the CUSUM of squares statistics:

[TABLE]

where $\displaystyle\hat{\tau}_{n}^{2}=\frac{1}{n}\sum_{i=1}^{n}\hat{Z}_{i}^{4}-\Big{(}\frac{1}{n}\sum_{i=1}^{n}\hat{Z}_{i}^{2}\Big{)}^{2}$ .

In order to establish the limiting null distribution of $T_{n}$ , the following regularity conditions are required. We assume that the true parameter $(\theta_{0},\sigma_{0})$ belongs to the parameter space $\Theta$ , which is a bounded subset of $\mathbb{R}^{p}\times[c,\infty)$ for some $c>0$ .

A0.

The estimator $(\hat{\theta}_{n},\hat{\sigma}_{n})$ satisfies that $\sqrt{nh_{n}}||\hat{\theta}_{n}-\theta_{0}||=O_{P}(1)$ and $\sqrt{n}|\hat{\sigma}_{n}-\sigma_{0}|=O_{P}(1)$ . 2. A1.

There exists a constant $C>0$ such that $|a(x,\theta_{0})-a(y,\theta_{0})|\leq C|x-y|$ for any $x,y\in\mathbb{R}$ . 3. A2.

The process $X$ from (1) is ergodic with its invariant measure $\mu_{0}$ such that $\int x^{k}\mathrm{d}\mu_{0}(x)<\infty$ for all $k\geq 0$ . 4. A3.

$\sup_{t}E|X_{t}|^{k}<\infty$ for all $k\geq 0$ . 5. A4.

The function $a$ is continuously differentiable with respect to $x$ for all $\theta$ and the derivatives belong to $\mathcal{P}:=\{f(x,\theta)\big{|}\ |f|\leq C(1+|x|^{C})\ \textrm{for some }\ C>0\},$ where $C$ does not depend on the parameter.

Then, from Lemma 6.3 with $S_{n}=\mathbb{R}$ , we can obtain the following result.

Theorem 2.1.

Assume that A0 – A4 hold. If $nh_{n}^{2}\rightarrow 0$ , then under $H_{0}$ ,

[TABLE]

where $\{W_{t}^{o}|t\geq 0\}$ denotes a standard Brownian bridge.

2.2 Robust CUSUM test for diffusion processes

Now, we consider the situation where the observations are contaminated by outliers. It is well known that estimators using a Gaussian quasi-likelihood are strongly influenced by outliers. Also, it should be recalled that many estimation methods for diffusion processes employ the maximum likelihood (ML) technique based on an approximated transition density; see, for example, Li (2013) and the papers therein. As aforementioned in the Introduction, since the transition distributions of the diffusion processes get close to the normal distribution as the sampling interval approaches zero, such estimators using ML methods are likely to produce biased estimates in the presence of outliers. Therefore, it can be naturally surmised that the residuals calculated from those biased estimates will not behave like ideal residuals, that is, i.i.d. random variables, and subsequently lead to a distortion of $T_{n}$ .

In order to remedy the problem, a robust estimator is first used to lessen the effect of outliers on parameter estimation. Next, note that even if the parameters are properly estimated by a robust estimation method, the residuals corresponding to outliers still deviate from normal range. Thus, they need to be truncated to prevent damaging the test procedure. In this study, we employ the MDPDE as a robust estimator. Further, in order to avoid some technical problems in the proofs, we use the trimmed ones of the squared residuals to construct test statistics in stead of using the squares of the truncated residuals.

Lee and Song (2013) introduced a robust estimator for diffusion processes (1) using the density power divergence by Basu et al. (1998), and demonstrated in the simulation study that the estimator has a strong robust property with little loss in asymptotic efficiency relative to the ML estimator (MLE). The MDPDE for (1) is defined as

[TABLE]

where

[TABLE]

Here, the tuning parameter $\alpha$ controls the trade-off between robustness and efficiency in the estimation procedure. Note that the estimator with $\alpha=0$ becomes the Gaussian quasi-MLE. For more details on the MDPDE and its properties, see Basu et al. (1998).

We consider the following functions to truncate the residuals: for a given positive number $M$ and $x\geq 0$ ,

[TABLE]

where $1_{A}(x)$ is the indicator function of the set $A$ . Using the trimming function, we now propose a CUSUM test as follows: for $j=1,2$ ,

[TABLE]

where $\hat{Z}_{\alpha,i}$ ’s are the ones calculated from (2) using the MDPD estimate $(\hat{\theta}_{n}^{\alpha},\hat{\sigma}_{n}^{\alpha})$ and $\displaystyle\hat{\tau}_{j,n}^{2}=\frac{1}{n}\sum_{i=1}^{n}f_{j,M}^{2}(\hat{Z}_{\alpha,i}^{2})-\Big{(}\frac{1}{n}\sum_{i=1}^{n}f_{j,M}(\hat{Z}_{\alpha,i}^{2})\Big{)}^{2}$ .

To derive the asymptotic null distribution of $T_{j,n}^{\alpha}$ , $\sqrt{nh_{n}}||\hat{\theta}_{n}^{\alpha}-\theta_{0}||=O_{P}(1)$ and $\sqrt{n}|\hat{\sigma}_{n}^{\alpha}-\sigma_{0}|=O_{P}(1)$ are required. For this, we additionally assume the following conditions to ensure the stochastic boundedness (cf. see Lee and Song (2013)).

A5.

$\Theta$ is convex compact and $(\theta_{0},\sigma_{0})$ lies in the interior of $\Theta$ . 2. A6.

The function $a$ and all its $x$ -derivatives are three times differentiable with respect to $\theta$ for all $x$ . Moreover, these derivatives up to the third order with respect to $\theta$ belong to $\mathcal{P}$ . 3. A7.

If $\mu_{0}(a(x,\theta)=a(x,\theta_{0}))=1$ , then $\theta=\theta_{0}$ . 4. A8.

$\int\partial_{\theta}a(x,\theta_{0})\,\partial_{\theta^{T}}a(x,\theta_{0})d\mu_{0}(x)$ is positive definite, where $\partial_{\theta}a=\partial a/\partial\theta$ .

The following theorem is the main result of this paper. The proposed test has the same limiting null distribution as $T_{n}$ .

Theorem 2.2.

Assume that A1-A8 hold. For each $\alpha\geq 0$ , if $nh_{n}^{2}\rightarrow 0$ , then under $H_{0}$ ,

[TABLE]

Remark 1. For the selection of the tuning constant $M$ in $f_{j,M}$ , we rely on the fact that under $H_{0}$ , the distribution of $Z_{\alpha,i}$ is approximated to $N(0,1)$ as $h_{n}$ goes to 0. Depending on the extent of contamination, one can choose $M$ as the squared number of a proper quantile of the standard normal distribution. For example, if it seems that contamination is low or it is not certain of contamination, one can use the 99.5% quantile, i.e., $M=2.576^{2}$ . Although it is not easy to assess the degree of contamination, we propose to use $M$ in $[z_{0.025}^{2},z_{0.005}^{2}]=[3.84,6.63]$ based on our simulation results.

Remark 2. The proposed test is not suitable for detecting changes in the drift parameter. The main reason seems to be that a change in $\theta$ could not make a significant effect on $\hat{Z}_{\alpha,i}$ when $h_{n}$ is small. To see this, note that

[TABLE]

Roughly speaking, whatever the estimated value of $\theta$ is, the influence of $a(X_{t_{i-1}},\hat{\theta}_{n}^{\alpha})$ on $\hat{Z}_{\alpha,i}$ become reduced when $h_{n}$ is small, whereas the estimate of $\sigma$ can make great differences in the residuals. This is why the residual-based CUSUM test is not sensitive to the drift parameter change but is sensitive to the dispersion parameter change. Such tendency that the residual-based test misses a change of certain parameter has been reported, for example, in Lee (2011) for diffusion process and Song and Kang (2018) for ARMA-GARCH models. As will be seen in the following section, in the cases that only the drift parameter is changed, all tests considered produce empirical powers close to significance level. This means that the change of the drift parameter does not affect the performance of the tests. When the tests reject the null hypothesis, one can therefore conclude that the dispersion parameter has changed.

Remark 3. Any other robust estimators satisfying $\sqrt{nh_{n}}||\hat{\theta}_{n}-\theta_{0}||=O_{P}(1)$ and $\sqrt{n}|\hat{\sigma}_{n}-\sigma_{0}|=O_{P}(1)$ can be used in the test procedure.

Remark 4. Other types of functions can be employed to trim the squared residuals. For example, Hampel’s function in Andrews et al. (1972), which indeed is an intermediate form between $f_{1,M}$ and $f_{2,M}$ , can be used. The performance of the test may be different depending on the trimming functions and the tuning constant $M$ . For the Ornstein-Uhlenbeck process, $f_{2,M}$ with $M=z_{0.005}^{2}$ showed best performance, see the simulation study below.

Remark 5. One may consider the CUSUM of squares test based on the trimmed residuals, that is,

[TABLE]

where $g_{j,M}(x)=\textrm{sign}(x)f_{j,M}(|x|)$ and $\hat{\tau}_{g,j}$ is the sample variance of $\{g_{j,M}^{2}(\hat{Z}_{i}^{\alpha})\}$ . In this case, the tuning constant $M$ is chosen as the just quantile of $N(0,1)$ . According to our simulation results (not reported), the performances of $T_{j,n}^{\alpha}$ and $\tilde{T}_{j,n}^{\alpha}$ are almost similar. As mentioned above, we present $T_{j,n}^{\alpha}$ since it is easier to handle in deriving its asymptotic distribution.

3 Simulation study

In the present simulation, we compare the performances of the naive test $T_{n}$ and the proposed tests $T_{1,n}^{\alpha}$ and $T_{2,n}^{\alpha}$ . For this task, we consider the following Ornstein-Uhlenbeck (OU) process:

[TABLE]

The sample $\{X_{t_{i}}\}_{i=0}^{n}$ is obtained with the sampling interval of $h_{n}=n^{-0.75}$ , where the path of $X$ is generated via the Euler scheme with the generating interval of $h_{n}/20$ . For the tuning constant $M$ , we use $z_{0.005}^{2}$ (=6.63) and $z_{0.025}^{2}$ (=3.84). To evaluate the empirical sizes, we generate paths with $(\theta,\sigma)$ =(1,1). For the powers, we change the parameter $(\theta,\sigma)$ from (1,1) to (1,1.2), (1,1.5), (5,1), and (5,1.2) at the midpoint $t=nh_{n}/2$ . Empirical sizes and powers are calculated at 5% significance level, based on 5,000 repetitions. The corresponding critical value is 1.358, which is obtained from the following well-known formula:

[TABLE]

We first examine the case where the data is not contaminated by outliers. The empirical sizes and powers are presented in Tables 1 and 2, respectively. One can see that $T_{n}$ , $T_{1,n}^{\alpha}$ , and $T_{2,n}^{\alpha}$ show no size distortions and produce reasonably good powers against the change of the dispersion parameter $\sigma$ , regardless of whether the drift parameter $\theta$ changes or not. It is, however, observed that all the tests can not detect the change of $\theta$ as mentioned in Remark 2. They produces empirical powers very close to the significance level. $T_{n}$ and $T_{1,n}^{\alpha}$ with $M=z_{0.005}^{2}$ perform similarly, and $T_{1,n}^{\alpha}$ with $M=z_{0.025}^{2}$ and $T_{2,n}$ with $M=z_{0.005}^{2}$ are found to yield slightly smaller powers. $T_{2,n}^{\alpha}$ with $M=z_{0.025}^{2}$ is comparatively less powerful, but its power approaches 1 as the sample size increases. Interestingly, MDPDE’s tuning parameter $\alpha$ does not make a significant difference in the performance. As will be seen in the contaminated cases below, the performance of the proposed tests are also not significantly different depending on the value of $\alpha$ , so the choice of $\alpha$ does not seem to be critical in the testing procedure. Although not reported here, we can see that the powers tend to decrease with a decrease in $M$ but no size distortions are found.

Next, to explore the cases where outliers are involved in the data, we generate contaminated sample $\{X_{t_{i}}^{c}\}_{i=0}^{n}$ by the following scheme: $X_{t_{i}}^{c}\,=X_{t_{i}}\,+\,p_{i}\,|V_{i}|\times sign(X_{t_{i}})\,$ , where $\{p_{i}\}_{i=0}^{n}$ and $\{V_{i}\}_{i=0}^{n}$ are sequences of i.i.d. random variables from Bernoulli distribution with success probability $p$ and normal distribution with mean 0 and variance $\sigma_{v}^{2}$ , respectively; $\{X_{t_{i}}\}$ , $\{p_{i}\}$ , and $\{V_{i}\}$ are assumed to be all independent. We consider the cases of $p=0.5\%$ and $5\%$ to describe a low degree of contamination and more severely contaminated situation, respectively, and $\sigma_{v}^{2}$ is set to 1. The empirical sizes and powers are provided in Tables 3 - 6. We first note that $T_{n}$ exhibits size distortions and significant power losses whereas $T_{2,n}$ dramatically eliminates the impact of outliers. $T_{2,n}$ with $M=z_{0.005}^{2}$ outperforms other tests in all cases considered. On the other hand, $T_{1,n}$ shows relatively good performance in the low contaminated case, i.e., $p=0.5\%$ , but are observed to be somewhat affected by outliers when $p=5\%$ . In this case, $T_{1,n}$ with the smaller $M$ (= $z_{0.025}^{2}$ ) shows more robust behavior than $T_{1,n}$ with $z_{0.005}^{2}$ .

Our findings can evidently be seen in Figures 2 and 2. Figure 2 presents the plots of the empirical sizes of $T_{n}$ and $T_{j,n}^{\alpha=0.2}$ versus the degree of contamination $p\in\{0\%,0.05\%,1\%,\cdots,5\%\}$ . The upper and lower panels depict the results in the cases of $\sigma_{v}^{2}=1$ and $\sigma_{v}^{2}=2$ , respectively. One can clearly see the severe size distortions of $T_{n}$ in the first column in Figure 2. As can be seen in Table 3, $T_{n}$ is damaged even by the small portion of contamination, i.e., $p=0.5\%$ . It should also be noted that the distortion gets worse as $n$ increases, indicating that particular attention should be paid when dealing with high-frequency data. This is because the closer the transition distribution gets to normal distribution, the more affected it is by outliers. Upward trends in the second and third columns suggest that $T_{1,n}^{\alpha}$ may not be suitable for high-contaminated cases. The last two columns show strong robustness of $T_{2,n}^{\alpha}$ yielding no size distortions in all cases. Figure 2 displays the power curves when $\sigma$ changes from 1 to $\sigma_{1}$ at midpoint. The upper and lower panels presents for the case of $p=0.5\%,\sigma_{v}^{2}=1$ (low contaminated case) and $p=5\%,\sigma_{v}^{2}=2$ (severely contaminated case), respectively. Unlike $T_{n}$ showing power losses, $T_{1,n}^{\alpha}$ and $T_{1,n}^{\alpha}$ yield reasonable powers in both cases. In particular, $T_{2,n}^{\alpha}$ with $M=z^{2}_{0.005}$ is observed to perform best.

Overall, the results above support the validity of the proposed test. In this simulation section, we see that our proposed test keeps good sizes and powers in the presence of outliers, while the naive test $T_{n}$ shows severe size distortions and significant power loses. Therefore, our test can be a functional tool to test for parameter change when outliers are speculated to contaminate data.

4 Real data analysis

We analyse daily time series of KOSPI200 volatility (VKSOPI200) index. Like the VIX index (the Chicago Board Options Exchange volatility index), VKOPSI200 index is designed to measure 30-day expected volatility of KOSPI200 index. The data analyzed here is depicted in Figure 3, covering 737 trading days from Jan 2, 2015 to Dec 28, 2017. As a key characteristic, the plot clearly shows a volatile and mean-reverting behaviour. One can also see a number of deviating observations. To capture the mean-reversion, we employ the following OU process:

[TABLE]

where $X_{t}$ is the value of VKOSPI200 index at time $t$ . One may consider the OU process with jump component to accommodate the spiky observations, but in the present analysis, we regard these observations as outliers and examine whether or not there were parameter changes in the fitted model. We conduct the naive test $T_{n}$ and the proposed test $T_{1,n}^{\alpha}$ and $T_{2,n}^{\alpha}$ to the data until Aug 31, 2017. Relying on the results in the simulation study, our decision is, however, made based on $T_{n,2}^{\alpha}$ with $M=z_{0.005}^{2}$ . The remaining data set after Sep 1, 2017 is used to compare the performances of the models without break and with breaks obtained by $T_{2,n}^{\alpha}$ .

Table 7 presents the test statistics and p-values of $T_{n}$ , $T_{1,n}^{\alpha}$ and $T_{2,n}^{\alpha}$ . We first note that the p-value of $T_{n}$ is obtained to be 0.319 whereas all the p-values of $T_{1,n}^{\alpha}$ and $T_{2,n}^{\alpha}$ are less then 0.05. As observed in the simulation study, this indicates that the outlying observations are highly likely to have hindered $T_{n}$ from detecting a significant parameter change. To find further changes, we use the binary segmentation procedure (cf. Aue and Horváth (2013)) and locate three more breaks. The sub-periods divided by the estimated change-points and the ML and MDPD estimates for each sub-period are reported in Table 8. As mentioned in Remark 2, the tests are difficult to detect changes in drift parameter and therefore it should be interpreted that the obtained sub-periods are due to the changes in the dispersion parameter. In Table 8, one can see evident changes in $\hat{\sigma}$ . Although the period is divided by the changes in $\sigma$ , other parameters $\lambda$ and $\mu$ are also estimated differently in each sub-period. Here, it is noteworthy that the difference between the ML and the MDPD estimates of $\sigma$ is comparatively large in the first and the last sub-periods. Since MLE and MDPDE tend to yield similar estimates when the portion of outliers is small, we can surmise that the first and the last periods include some outliers that may affect the ML estimates. The estimated change-points and $\hat{\mu}$ are displayed in Figure 5, where the red and blue dashed lines stand for $\hat{\mu}$ by MLE and MDPDE with $\alpha=0.1$ , respectively.

Finally, we compare the OU processes without and with parameter changes in terms of the superiority in forecasting performance. Hereafter, we denote the processes with and without changes by OUchg process and OUno.chg process, respectively. Forecasting using the process with changes means that predicted values are obtained using the data from the last change point, i.e., Mar 3, 2016 in the present analysis. We use the Euler approximation in (3) to calculate one-step-ahead forecasts for the last four months, total 78 observations, as follows:

[TABLE]

where ( $\hat{\lambda}_{t_{s}},\hat{\mu}_{t_{s}}$ ) is an estimate based on the data up to $X_{t_{s}}$ . 95% prediction interval (PI) is given by $\hat{X}_{t_{s+1}}\pm 2\hat{\sigma}_{t_{s}}\sqrt{h_{n}}$ . The following root mean squared error (RMSE) and the root mean squared percentage error (RMSPE) are employed to evaluate forecasting performance:

[TABLE]

where $X_{t_{660}}$ is the index at Sep 1, 2017.

The forecasting errors and the number of the observations included in 95% PIs are presented in Table 9. The values in the left sub-table are obtained using the OUno.chg process and the data from Jan 2, 2015. On the other hand, as aforementioned, the one-step-ahead forecasts for the right sub-table are calculated using the data after Mar 2, 2016. The results in Table 9 show that the OUchg process outperforms the OUno.chg process. All the values of RMSE and RMSPE in the right sub-table are less than the corresponding values in the left sub-table. The OUno.chg process estimated by the MLE is shown to yield worst performance and the OUchg process estimated by the MDPDE with $\alpha=0.1$ and $\alpha=0.2$ show best performances in terms for RMSE and RMSPE, respectively. It is important to note that the OUchg process estimated by the MLE is superior to the OUno.chg process by the MDPDE, implying that the improvement of the forecasting performance by considering parameter changes is greater than by just using the robust estimator. Even though the predicted values are calculated depending only on the drift parameter estimate $(\hat{\lambda},\hat{\mu})$ , the forecasting results above strongly indicate that the OUchg process is better fitted to the data.

Figure 5 displays the predicted values and 95% PIs of the OUno.chg process with the ML estimates (left) and the OUchg process with the MDPD estimates by $\alpha=0.1$ (right). Although the PI of the OUno.chg process includes all the observations, it produces comparatively longer intervals. The average lengths of 95% PIs in the left and right sub-figures are 3.99 and 2.63, respectively. 75 observations (97.4%) are included in the PIs of the OUchg process, indicating that the process with changes produces reasonable PIs.

Our empirical findings support that the series are partitioned validly by the proposed test. The estimates in each sub-period are significantly different and the forecasting based on the last sub-period shows better performances. Political and economical events or crises often cause deviating observations in financial data and can also lead to structural changes in underlying models. Our analysis as well as simulation study demonstrates that in such situation where data includes seemingly outliers, the proposed test can effectively detect parameter changes that the existing tests may miss, hence improving forecasting performance.

5 Concluding remarks

We have proposed a robust test for dispersion parameter constancy in discretely observed diffusion processes. The idea used to construct the test is simple and easy to implement: the residuals are calculated using a robust estimate and then a CUSUM test statistics are constructed based on the truncated ones of squared residuals. The limiting null distribution of the proposed test is established and a simulation study demonstrates the promising performance of our test in the presence of outliers. The proposed test possesses a strong robust property against outliers, whereas the naive CUSUM test is observed to be severely damaged particularly when the sampling interval is short. Given the situations that high-frequency data have often been obtained, our test will be a good alternative to test for parameter change in such cases.

The extension to general diffusion processes such as $dX_{t}=a(X_{t},\theta)dt+b(X_{t},\sigma)dW_{t}$ is of natural interest. Our results are focused on diffusion processes, but we anticipate that our procedure can be applied to other time series models such as ARMA models and GARCH-type models. Once a robust estimator is given for each model, the same procedure can be adapted to construct test statistics. If one prove the results corresponding to Lemmas 6.3 and 6.4 below, the same asymptotic result in Theorem 2.2 will be obtained under other time series models. We leave these issues as possible topics of future research.

6 Proofs

Hereafter, we shall use the relation $A_{n}\lesssim B_{n}$ , where $A_{n}$ and $B_{n}$ are nonnegative, to mean that $A_{n}\leq CB_{n}$ for some constant $C>0$ and drop the $\alpha$ in $(\hat{\theta}_{n}^{\alpha},\hat{\sigma}_{n}^{\alpha})$ and $\hat{Z}_{\alpha,i}$ for notational simplicity. Further, we denote

[TABLE]

Then, $Z_{i}$ ’s are i.i.d. random variables from $N(0,1)$ and, according to Lemma 1 in Lee and Song (2013), we have

[TABLE]

Lemma 6.1.

Suppose that A1 and A3 hold. If $nh_{n}^{2}\rightarrow 0$ , then

[TABLE]

where $k,l\in\{0,1,2,\cdots\}$ and $m>-1.5l+2q$ .

Proof.

In view of A3 and (8), we have that for any $\epsilon,d>0$ ,

[TABLE]

Since $1.5l+m-2q>0$ , the lemma is yielded by choosing $d$ such that $1-(1.5l+m-2q)d/2<-1$ . ∎

Lemma 6.2.

Suppose that A1 - A3 hold and let $f(x)$ and $g_{n}(x)$ belong to $\mathcal{P}$ . Assume further that $\partial_{x}f$ exists and belongs to $\mathcal{P}$ ; $g_{n}$ converges almost surely to $g\in\mathcal{P}$ and is dominated by some function in $\mathcal{P}$ . If $nh_{n}^{2}\rightarrow 0$ , then

[TABLE]

Proof.

In view of ergodic property, we get

[TABLE]

Using Jensen’s inequality, Cauchy’s inequality and $E\big{|}X_{t}-X_{t_{i-1}}\big{|}^{k}\lesssim h_{n}^{k/2}$ (cf. Kessler (1997)), we have that for $r>0$

[TABLE]

Hence, we have for any $\epsilon>0$ and $r>2$ ,

[TABLE]

which together with (9) asserts

[TABLE]

Next, let

[TABLE]

Then, $\{S_{i}\}_{i=1}^{n}$ forms a martingale difference with respect to $\mathcal{G}_{i}=\sigma\{W_{s}:s\leq t_{i}\}$ . Thus, it follows from Burkholder’s inequality and Jensen’s inequality that for any $r>1$

[TABLE]

and consequently one can see that

[TABLE]

Since $E[g_{n}(Z_{1})]$ converges to $E[g(Z_{1})]$ by the dominated convergence theorem, the lemma is asserted from (10). ∎

Lemma 6.3.

Suppose that A0-A4 hold. For a subset $S\subset\mathbb{R}$ and any monotone sequence of subsets $\{S_{n}\}$ with $\lim_{n}S_{n}=S$ , if $nh_{n}^{2}\rightarrow 0$ , then

[TABLE]

where $1_{A}$ denotes the indicator function.

Proof.

Since

[TABLE]

we can express that

[TABLE]

where

[TABLE]

First, note that

[TABLE]

Since $\frac{1}{n}\sum_{i=1}^{n}1_{S_{n}}(Z_{i}^{2})Z_{i}^{2}$ converges almost surely by Lemma 6.2, we can obtain that

[TABLE]

which together with $\sqrt{n}|\hat{\sigma}_{n}-\sigma_{0}|=O_{P}(1)$ yields that(6) is $o_{P}(1)$ . In a similar fashion, by using $\sqrt{nh_{n}}||\hat{\theta}_{n}-\theta_{0}||=O_{P}(1)$ , one can show that

[TABLE]

Next, to show that the remaining terms in (11) are negligible, we note that

[TABLE]

Using $\sqrt{nh_{n}}||\hat{\theta}_{n}-\theta_{0}||=O_{P}(1)$ and Lemma 6.1, we have

[TABLE]

Also, it follows from (8) that

[TABLE]

which implies

[TABLE]

This completes the proof. ∎

Lemma 6.4.

Suppose that A0-A4 hold. For any $q<0.5$ , if $nh_{n}^{2}\rightarrow 0$ , then

[TABLE]

Proof.

Following the similar arguments in the proof of (13), the lemma can be obtained and thus we omit its proof. ∎

Lemma 6.5.

Suppose that A0-A4 hold. If $nh_{n}^{2}\rightarrow 0$ , then

[TABLE]

Proof.

Note that $|f_{j,M}(x)-f_{j,M}(y)|\leq|x-y|$ for $\forall x,y>0$ . Then, we have

[TABLE]

which together with Lemma 6.2 and 6.4 yields the lemma. ∎

**Proof of Theorem 2.2

**In this proof, we will only deal with the case of $f_{2,M}$ because the case of $f_{1,M}$ can be verified following essentially the same arguments below.

Since $f_{2,M}(Z_{1}^{2}),\cdots,f_{2,M}(Z_{n}^{2})$ are i.i.d. random variables, it follows from the invariance principle and the mapping theorem that

[TABLE]

where $\tau$ denotes the variance of $f_{2,M}(Z_{1}^{2})$ . It is therefore sufficient to show that

[TABLE]

where $H_{i,M}=f_{2,M}(Z_{i}^{2})-f_{2,M}(\hat{Z}_{i}^{2})$ .

Let $\epsilon_{n}=\max_{1\leq k\leq n}|Z_{i}^{2}-\hat{Z}_{i}^{2}|$ , $S_{n}=[0,2M+1/n^{q}]$ for some $q\in(0,0.5)$ and

[TABLE]

Then, due to Lemma 6.3 and 6.4, we have that for any $\epsilon>0$ ,

[TABLE]

Observe that on $(n^{q}\epsilon_{n}<1)$ ,

[TABLE]

Then, we have that on $(n^{q}\epsilon_{n}<1)$ ,

[TABLE]

Since the first term of the RHS above converges to zero in probability by Lemma 6.3, the theorem is established if we verify that

[TABLE]

To prove (16), denote by $n_{B}$ the number of $Z_{i}^{2}$ ’s belonging to $\Lambda_{3,n}$ and let $\delta_{n}=P(Z_{1}^{2}\in\Lambda_{3,n})$ . Then, $n_{B}$ becomes a random variable from $B(n,\delta_{n})$ . Using the fact that $\sqrt{a+x}-\sqrt{a-x}=O(x/\sqrt{a})$ as $x\rightarrow 0$ , it can be readily seen that

[TABLE]

Now, take a triangular array of random varibles $\{X_{ni}|n\geq 1,1\leq i\leq n\}$ defined on a new probability space $(\Omega^{\prime},\mathcal{F}^{\prime},P^{\prime})$ such that $X_{n1},X_{n2},\cdots,X_{nn}$ are i.i.d. Bernoulli random variables with success probability $\delta_{n}$ , which is possible due to Theorem 5.3 of Billingsley (1995). Letting $r_{n}=n_{B}/n-\delta_{n}$ , it follows from the law of the iterated logarithm that

[TABLE]

and thus, by (17), we have

[TABLE]

Hence, in view of Lemma 6.4, we have that

[TABLE]

which establish (16), and thus the proof is completed. $\Box$

**Acknowledgments

**This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science, ICT and Future Planning (NRF-2016R1C1B1015963).

Bibliography21

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Ait-Sahalia (2002) Aït-Sahalia, Y. , 2002. Maximum-likelihood estimation of discretely-sampled diffusions: a closed-form approximation approach. Econometrica 70 , 223–262.
2Andrews et al. (1972) Andrews, D. F., Bickel, P. J., Hampel, F. R., Huber, P. J., Rogers, W. H. and Tukey, J. W., 1972. Robust Estimates of Location: Survey and Advances . Princeton Univ. Press.
3Basu et al. (1998) Basu, A., Harris, I. R., Hjort, N. L. and Jones, M. C., 1998. Robust and efficient estimation by minimizing a density power divergence. Biometrika 85 , 549–559.
4Beskos et al. (2009) Beskos, A., Papaspiliopoulos, O. and G Roberts, G., 2009. Monte Carlo maximum likelihood estimation for discretely observed diffusion processes. The Annals of Statistics 37 , 223–245.
5Billingsley (1995) Billingsley, P., 1995. Probability and Measure . Wiley, New York.
6Chen et al. (2008) Chen, S. X., Gao, J. and Tang, C. Y., 2008. A test for model specification of diffusion processes. The Annals of Statistics 36 , 167–198.
7Dacunha-Castelle and Florens-Zmirou (1986) Dacunha-Castelle, D. and Florens-Zmirou, D., 1986. Estimation of the coefficient of a diffusion from discrete observations. Stochastics 19 , 263–284.
8Gregorio and Iacus (2008) Gregorio, A. and Iacus, S. M., 2008. Least squares volatility change point estimation for partially observed diffusion processes. Communications in Statistics, Theory and Methods 37 , 2342–2357.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Robust test for dispersion parameter change

Abstract

1 Introduction

2 Main Result

2.1 Naive CUSUM test for diffusion processes

Theorem 2.1**.**

2.2 Robust CUSUM test for diffusion processes

Theorem 2.2**.**

3 Simulation study

4 Real data analysis

5 Concluding remarks

6 Proofs

Lemma 6.1**.**

Proof.

Lemma 6.2**.**

Proof.

Lemma 6.3**.**

Proof.

Lemma 6.4**.**

Proof.

Lemma 6.5**.**

Proof.

Theorem 2.1.

Theorem 2.2.

Lemma 6.1.

Lemma 6.2.

Lemma 6.3.

Lemma 6.4.

Lemma 6.5.