A copula approach for dependence modeling in multivariate nonparametric time series
Natalie Neumeyer, Marek Omelka, Sarka Hudecova

TL;DR
This paper introduces a copula-based method for modeling dependence in multivariate nonparametric time series, accounting for covariates affecting mean and variance but assuming stable innovation distributions.
Contribution
It develops nonparametric and semiparametric estimators for the copula of innovations, demonstrating their asymptotic equivalence to unobserved innovation-based estimators.
Findings
Copula estimators are asymptotically consistent.
Method performs well in simulations.
Application to real data illustrates practical utility.
Abstract
This paper is concerned with modeling the dependence structure of two (or more) time-series in the presence of a (possible multivariate) covariate which may include past values of the time series. We assume that the covariate influences only the conditional mean and the conditional variance of each of the time series but the distribution of the standardized innovations is not influenced by the covariate and is stable in time. The joint distribution of the time series is then determined by the conditional means, the conditional variances and the marginal distributions of the innovations, which we estimate nonparametrically, and the copula of the innovations, which represents the dependency structure. We consider a nonparametric as well as a semiparametric estimator based on the estimated residuals. We show that under suitable assumptions these copula estimators are asymptotically…
| Model | estim | bias | SD | RMSE | bias | SD | RMSE | bias | SD | RMSE | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Known innovations | 0.25 | -0.03 | 3.25 | 3.25 | 0.10 | 2.47 | 2.47 | 0.12 | 1.86 | 1.87 | |
| 0.25 | 0.51 | 3.00 | 3.04 | 0.35 | 2.15 | 2.18 | 0.24 | 1.69 | 1.70 | ||
| 0.50 | 0.01 | 2.64 | 2.64 | 0.06 | 2.03 | 2.03 | 0.07 | 1.52 | 1.52 | ||
| 0.50 | 0.09 | 2.47 | 2.47 | 0.08 | 1.84 | 1.85 | 0.04 | 1.39 | 1.39 | ||
| 0.75 | 0.01 | 1.58 | 1.58 | 0.05 | 1.19 | 1.19 | 0.02 | 0.89 | 0.89 | ||
| 0.75 | -0.28 | 1.48 | 1.50 | -0.17 | 1.10 | 1.11 | -0.12 | 0.80 | 0.81 | ||
| 1 | 0.25 | -0.08 | 4.66 | 4.66 | -0.22 | 2.97 | 2.97 | -0.16 | 2.06 | 2.06 | |
| 0.25 | 0.62 | 4.15 | 4.19 | 0.07 | 2.62 | 2.62 | -0.02 | 1.82 | 1.82 | ||
| 0.50 | -0.46 | 3.94 | 3.97 | -0.41 | 2.48 | 2.51 | -0.25 | 1.74 | 1.76 | ||
| 0.50 | -0.90 | 3.59 | 3.70 | -0.81 | 2.25 | 2.39 | -0.55 | 1.60 | 1.69 | ||
| 0.75 | -1.04 | 2.45 | 2.66 | -0.85 | 1.55 | 1.77 | -0.59 | 1.07 | 1.22 | ||
| 0.75 | -3.00 | 2.66 | 4.01 | -2.23 | 1.59 | 2.74 | -1.57 | 1.15 | 1.94 | ||
| 2 | 0.25 | -0.43 | 4.78 | 4.79 | -0.05 | 2.93 | 2.92 | 0.07 | 2.08 | 2.08 | |
| 0.25 | 0.26 | 4.30 | 4.31 | 0.25 | 2.58 | 2.59 | 0.15 | 1.90 | 1.90 | ||
| 0.50 | -0.91 | 3.93 | 4.03 | -0.24 | 2.40 | 2.41 | -0.09 | 1.71 | 1.72 | ||
| 0.50 | -1.50 | 3.62 | 3.92 | -0.57 | 2.21 | 2.29 | -0.36 | 1.60 | 1.64 | ||
| 0.75 | -1.96 | 2.63 | 3.27 | -0.70 | 1.52 | 1.68 | -0.39 | 1.05 | 1.12 | ||
| 0.75 | -4.63 | 3.19 | 5.62 | -2.14 | 1.84 | 2.82 | -1.27 | 1.16 | 1.72 | ||
| 3 | 0.25 | -0.43 | 4.81 | 4.83 | -0.09 | 2.91 | 2.91 | 0.03 | 2.09 | 2.09 | |
| 0.25 | 0.24 | 4.37 | 4.38 | 0.19 | 2.56 | 2.57 | 0.11 | 1.90 | 1.90 | ||
| 0.50 | -0.93 | 3.97 | 4.07 | -0.32 | 2.41 | 2.43 | -0.16 | 1.72 | 1.72 | ||
| 0.50 | -1.52 | 3.70 | 4.00 | -0.66 | 2.20 | 2.30 | -0.46 | 1.61 | 1.67 | ||
| 0.75 | -1.85 | 2.61 | 3.20 | -0.82 | 1.53 | 1.73 | -0.53 | 1.04 | 1.16 | ||
| 0.75 | -4.39 | 3.05 | 5.35 | -2.25 | 1.78 | 2.86 | -1.46 | 1.14 | 1.85 | ||
| 4 | 0.25 | -0.49 | 4.85 | 4.87 | -0.09 | 2.93 | 2.93 | 0.02 | 2.10 | 2.10 | |
| 0.25 | 0.13 | 4.37 | 4.37 | 0.14 | 2.58 | 2.59 | 0.06 | 1.90 | 1.90 | ||
| 0.50 | -0.82 | 3.99 | 4.07 | -0.25 | 2.40 | 2.41 | -0.12 | 1.73 | 1.73 | ||
| 0.50 | -1.54 | 3.70 | 4.01 | -0.80 | 2.22 | 2.36 | -0.53 | 1.60 | 1.69 | ||
| 0.75 | -1.22 | 2.57 | 2.84 | -0.49 | 1.48 | 1.56 | -0.28 | 1.04 | 1.08 | ||
| 0.75 | -3.43 | 2.76 | 4.40 | -1.93 | 1.65 | 2.54 | -1.20 | 1.10 | 1.62 | ||
| Model | estim | bias | SD | RMSE | bias | SD | RMSE | bias | SD | RMSE | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Known innovations | 0.25 | -0.01 | 3.16 | 3.16 | -0.05 | 2.33 | 2.33 | -0.14 | 1.70 | 1.71 | |
| 0.25 | 0.04 | 3.16 | 3.15 | -0.03 | 2.32 | 2.32 | -0.12 | 1.70 | 1.70 | ||
| 0.50 | -0.02 | 2.37 | 2.37 | -0.01 | 1.73 | 1.73 | -0.09 | 1.28 | 1.28 | ||
| 0.50 | 0.00 | 2.34 | 2.34 | -0.02 | 1.72 | 1.72 | -0.08 | 1.27 | 1.27 | ||
| 0.75 | -0.02 | 1.18 | 1.18 | 0.00 | 0.87 | 0.87 | -0.03 | 0.64 | 0.64 | ||
| 0.75 | -0.13 | 1.17 | 1.17 | -0.07 | 0.87 | 0.87 | -0.07 | 0.64 | 0.64 | ||
| 1 | 0.25 | -0.23 | 4.54 | 4.54 | -0.11 | 2.82 | 2.82 | -0.05 | 1.92 | 1.92 | |
| 0.25 | -0.12 | 4.52 | 4.52 | -0.05 | 2.81 | 2.81 | -0.03 | 1.90 | 1.90 | ||
| 0.50 | -0.49 | 3.46 | 3.50 | -0.32 | 2.18 | 2.20 | -0.22 | 1.43 | 1.44 | ||
| 0.50 | -0.47 | 3.40 | 3.43 | -0.30 | 2.15 | 2.17 | -0.21 | 1.42 | 1.43 | ||
| 0.75 | -0.97 | 1.87 | 2.11 | -0.69 | 1.15 | 1.34 | -0.53 | 0.74 | 0.91 | ||
| 0.75 | -1.22 | 1.84 | 2.21 | -0.81 | 1.16 | 1.41 | -0.60 | 0.75 | 0.96 | ||
| 2 | 0.25 | -0.28 | 4.48 | 4.49 | -0.15 | 2.78 | 2.79 | -0.21 | 1.88 | 1.89 | |
| 0.25 | -0.17 | 4.47 | 4.47 | -0.12 | 2.77 | 2.77 | -0.19 | 1.88 | 1.89 | ||
| 0.50 | -0.77 | 3.44 | 3.53 | -0.29 | 2.13 | 2.14 | -0.24 | 1.41 | 1.43 | ||
| 0.50 | -0.75 | 3.40 | 3.48 | -0.31 | 2.10 | 2.12 | -0.24 | 1.40 | 1.42 | ||
| 0.75 | -1.65 | 2.20 | 2.75 | -0.66 | 1.18 | 1.35 | -0.38 | 0.75 | 0.84 | ||
| 0.75 | -1.90 | 2.20 | 2.91 | -0.78 | 1.18 | 1.41 | -0.43 | 0.75 | 0.87 | ||
| 3 | 0.25 | -0.33 | 4.53 | 4.54 | -0.17 | 2.77 | 2.77 | -0.24 | 1.89 | 1.91 | |
| 0.25 | -0.23 | 4.53 | 4.53 | -0.14 | 2.75 | 2.75 | -0.22 | 1.89 | 1.90 | ||
| 0.50 | -0.83 | 3.48 | 3.58 | -0.37 | 2.09 | 2.12 | -0.32 | 1.42 | 1.45 | ||
| 0.50 | -0.81 | 3.44 | 3.53 | -0.38 | 2.06 | 2.10 | -0.32 | 1.41 | 1.44 | ||
| 0.75 | -1.62 | 2.15 | 2.70 | -0.77 | 1.14 | 1.37 | -0.51 | 0.76 | 0.92 | ||
| 0.75 | -1.86 | 2.14 | 2.84 | -0.89 | 1.14 | 1.44 | -0.57 | 0.77 | 0.96 | ||
| 4 | 0.25 | -0.37 | 4.56 | 4.57 | -0.16 | 2.79 | 2.80 | -0.22 | 1.90 | 1.91 | |
| 0.25 | -0.26 | 4.54 | 4.54 | -0.13 | 2.79 | 2.79 | -0.20 | 1.90 | 1.91 | ||
| 0.50 | -0.76 | 3.48 | 3.56 | -0.30 | 2.13 | 2.15 | -0.25 | 1.43 | 1.46 | ||
| 0.50 | -0.73 | 3.43 | 3.50 | -0.31 | 2.11 | 2.13 | -0.25 | 1.42 | 1.44 | ||
| 0.75 | -1.11 | 2.05 | 2.34 | -0.48 | 1.15 | 1.24 | -0.30 | 0.76 | 0.81 | ||
| 0.75 | -1.33 | 2.01 | 2.41 | -0.58 | 1.14 | 1.28 | -0.35 | 0.76 | 0.83 | ||
| Model | estim | bias | SD | RMSE | bias | SD | RMSE | bias | SD | RMSE | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Known innovations | 0.25 | 0.01 | 3.19 | 3.19 | 0.13 | 2.43 | 2.44 | 0.08 | 1.88 | 1.88 | |
| 0.25 | 0.44 | 3.01 | 3.04 | 0.38 | 2.37 | 2.40 | 0.24 | 1.81 | 1.82 | ||
| 0.50 | 0.02 | 2.58 | 2.58 | 0.11 | 1.96 | 1.97 | 0.02 | 1.49 | 1.49 | ||
| 0.50 | 0.24 | 2.42 | 2.43 | 0.27 | 1.89 | 1.91 | 0.12 | 1.44 | 1.44 | ||
| 0.75 | 0.02 | 1.48 | 1.48 | 0.06 | 1.12 | 1.12 | 0.00 | 0.84 | 0.84 | ||
| 0.75 | -0.06 | 1.35 | 1.36 | 0.02 | 1.05 | 1.05 | -0.03 | 0.78 | 0.78 | ||
| 1 | 0.25 | -0.36 | 4.76 | 4.78 | 0.06 | 3.06 | 3.05 | -0.09 | 2.06 | 2.06 | |
| 0.25 | 0.24 | 4.68 | 4.68 | 0.37 | 2.92 | 2.94 | 0.08 | 2.01 | 2.01 | ||
| 0.50 | -0.56 | 3.92 | 3.96 | -0.17 | 2.45 | 2.46 | -0.22 | 1.69 | 1.70 | ||
| 0.50 | -0.36 | 3.83 | 3.84 | -0.10 | 2.35 | 2.35 | -0.20 | 1.65 | 1.66 | ||
| 0.75 | -0.85 | 2.36 | 2.50 | -0.52 | 1.42 | 1.51 | -0.49 | 1.01 | 1.12 | ||
| 0.75 | -1.35 | 2.32 | 2.69 | -0.84 | 1.36 | 1.60 | -0.73 | 0.99 | 1.22 | ||
| 2 | 0.25 | -0.16 | 4.58 | 4.58 | 0.02 | 2.91 | 2.91 | 0.04 | 2.10 | 2.10 | |
| 0.25 | 0.49 | 4.42 | 4.45 | 0.32 | 2.86 | 2.88 | 0.20 | 2.03 | 2.04 | ||
| 0.50 | -0.66 | 3.77 | 3.82 | -0.14 | 2.36 | 2.36 | -0.09 | 1.67 | 1.68 | ||
| 0.50 | -0.50 | 3.61 | 3.64 | -0.09 | 2.30 | 2.30 | -0.05 | 1.62 | 1.62 | ||
| 0.75 | -1.61 | 2.50 | 2.97 | -0.52 | 1.43 | 1.52 | -0.32 | 0.99 | 1.04 | ||
| 0.75 | -2.37 | 2.52 | 3.46 | -0.95 | 1.45 | 1.73 | -0.55 | 0.98 | 1.13 | ||
| 3 | 0.25 | -0.18 | 4.57 | 4.57 | 0.01 | 2.93 | 2.92 | 0.02 | 2.11 | 2.11 | |
| 0.25 | 0.46 | 4.41 | 4.43 | 0.31 | 2.87 | 2.88 | 0.18 | 2.03 | 2.03 | ||
| 0.50 | -0.66 | 3.73 | 3.78 | -0.18 | 2.36 | 2.37 | -0.16 | 1.69 | 1.70 | ||
| 0.50 | -0.50 | 3.59 | 3.62 | -0.13 | 2.31 | 2.32 | -0.13 | 1.63 | 1.64 | ||
| 0.75 | -1.52 | 2.48 | 2.90 | -0.58 | 1.41 | 1.53 | -0.42 | 0.98 | 1.07 | ||
| 0.75 | -2.20 | 2.44 | 3.29 | -0.98 | 1.40 | 1.71 | -0.64 | 0.96 | 1.15 | ||
| 4 | 0.25 | -0.26 | 4.60 | 4.60 | -0.06 | 2.97 | 2.97 | 0.04 | 2.12 | 2.12 | |
| 0.25 | 0.30 | 4.47 | 4.47 | 0.19 | 2.89 | 2.89 | 0.18 | 2.04 | 2.05 | ||
| 0.50 | -0.63 | 3.79 | 3.84 | -0.13 | 2.36 | 2.37 | -0.11 | 1.69 | 1.69 | ||
| 0.50 | -0.56 | 3.63 | 3.67 | -0.16 | 2.31 | 2.32 | -0.13 | 1.65 | 1.65 | ||
| 0.75 | -0.83 | 2.38 | 2.52 | -0.29 | 1.40 | 1.43 | -0.21 | 0.97 | 0.99 | ||
| 0.75 | -1.51 | 2.35 | 2.79 | -0.71 | 1.41 | 1.57 | -0.45 | 0.95 | 1.05 | ||
| Model | estim | bias | SD | RMSE | bias | SD | RMSE | bias | SD | RMSE | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Known innovations | 0.25 | -0.02 | 3.13 | 3.13 | -0.05 | 2.32 | 2.31 | -0.03 | 1.78 | 1.77 | |
| 0.25 | 0.38 | 2.99 | 3.02 | 0.22 | 2.19 | 2.20 | 0.13 | 1.66 | 1.67 | ||
| 0.50 | -0.01 | 2.44 | 2.44 | -0.04 | 1.81 | 1.81 | -0.02 | 1.39 | 1.39 | ||
| 0.50 | 0.32 | 2.26 | 2.28 | 0.19 | 1.67 | 1.68 | 0.12 | 1.27 | 1.27 | ||
| 0.75 | -0.01 | 1.36 | 1.36 | -0.02 | 1.01 | 1.01 | -0.01 | 0.77 | 0.77 | ||
| 0.75 | -0.04 | 1.23 | 1.23 | -0.03 | 0.91 | 0.91 | -0.01 | 0.69 | 0.69 | ||
| 1 | 0.25 | -0.29 | 4.65 | 4.66 | -0.07 | 2.83 | 2.83 | -0.15 | 1.99 | 2.00 | |
| 0.25 | 0.35 | 4.49 | 4.50 | 0.19 | 2.72 | 2.72 | 0.02 | 1.89 | 1.89 | ||
| 0.50 | -0.48 | 3.67 | 3.70 | -0.23 | 2.22 | 2.23 | -0.25 | 1.56 | 1.58 | ||
| 0.50 | 0.00 | 3.40 | 3.40 | -0.05 | 2.08 | 2.08 | -0.13 | 1.44 | 1.44 | ||
| 0.75 | -0.78 | 2.17 | 2.30 | -0.53 | 1.27 | 1.38 | -0.47 | 0.88 | 1.00 | ||
| 0.75 | -0.94 | 2.02 | 2.23 | -0.64 | 1.19 | 1.35 | -0.52 | 0.81 | 0.96 | ||
| 2 | 0.25 | -0.34 | 4.39 | 4.40 | -0.12 | 2.80 | 2.80 | -0.10 | 1.94 | 1.94 | |
| 0.25 | 0.38 | 4.21 | 4.22 | 0.22 | 2.72 | 2.72 | 0.10 | 1.83 | 1.83 | ||
| 0.50 | -0.70 | 3.47 | 3.54 | -0.25 | 2.20 | 2.21 | -0.16 | 1.53 | 1.54 | ||
| 0.50 | -0.20 | 3.21 | 3.22 | -0.01 | 2.06 | 2.06 | -0.01 | 1.40 | 1.40 | ||
| 0.75 | -1.54 | 2.25 | 2.73 | -0.59 | 1.31 | 1.43 | -0.34 | 0.86 | 0.93 | ||
| 0.75 | -1.80 | 2.14 | 2.80 | -0.71 | 1.23 | 1.43 | -0.39 | 0.79 | 0.88 | ||
| 3 | 0.25 | -0.38 | 4.41 | 4.42 | -0.15 | 2.80 | 2.81 | -0.13 | 1.95 | 1.96 | |
| 0.25 | 0.33 | 4.23 | 4.24 | 0.18 | 2.72 | 2.73 | 0.06 | 1.83 | 1.83 | ||
| 0.50 | -0.71 | 3.48 | 3.55 | -0.32 | 2.19 | 2.21 | -0.22 | 1.52 | 1.53 | ||
| 0.50 | -0.21 | 3.20 | 3.21 | -0.08 | 2.05 | 2.06 | -0.07 | 1.39 | 1.39 | ||
| 0.75 | -1.45 | 2.19 | 2.63 | -0.70 | 1.29 | 1.46 | -0.43 | 0.87 | 0.97 | ||
| 0.75 | -1.70 | 2.07 | 2.67 | -0.81 | 1.21 | 1.46 | -0.48 | 0.79 | 0.93 | ||
| 4 | 0.25 | -0.34 | 4.40 | 4.41 | -0.15 | 2.81 | 2.81 | -0.11 | 1.96 | 1.97 | |
| 0.25 | 0.30 | 4.24 | 4.25 | 0.16 | 2.72 | 2.72 | 0.07 | 1.84 | 1.84 | ||
| 0.50 | -0.69 | 3.47 | 3.53 | -0.27 | 2.20 | 2.22 | -0.18 | 1.54 | 1.55 | ||
| 0.50 | -0.26 | 3.23 | 3.24 | -0.09 | 2.07 | 2.07 | -0.07 | 1.41 | 1.41 | ||
| 0.75 | -0.82 | 2.19 | 2.34 | -0.35 | 1.29 | 1.34 | -0.22 | 0.87 | 0.89 | ||
| 0.75 | -1.14 | 2.08 | 2.37 | -0.52 | 1.23 | 1.33 | -0.31 | 0.81 | 0.86 | ||
| Model | estim | bias | SD | RMSE | bias | SD | RMSE | bias | SD | RMSE | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Known innovations | 0.25 | -0.28 | 3.53 | 3.53 | 0.00 | 2.68 | 2.68 | 0.07 | 1.99 | 1.99 | |
| 0.25 | 0.03 | 3.48 | 3.48 | 0.23 | 2.61 | 2.62 | 0.21 | 1.97 | 1.98 | ||
| 0.50 | -0.19 | 2.81 | 2.82 | -0.01 | 2.14 | 2.14 | 0.05 | 1.60 | 1.60 | ||
| 0.50 | 0.05 | 2.66 | 2.66 | 0.19 | 2.00 | 2.00 | 0.17 | 1.51 | 1.52 | ||
| 0.75 | -0.10 | 1.62 | 1.62 | -0.01 | 1.23 | 1.23 | 0.02 | 0.93 | 0.93 | ||
| 0.75 | -0.16 | 1.46 | 1.47 | -0.02 | 1.09 | 1.09 | 0.02 | 0.83 | 0.83 | ||
| 1 | 0.25 | -0.25 | 4.93 | 4.93 | -0.18 | 3.30 | 3.30 | -0.11 | 2.28 | 2.28 | |
| 0.25 | 0.24 | 4.96 | 4.96 | 0.08 | 3.32 | 3.32 | 0.00 | 2.27 | 2.27 | ||
| 0.50 | -0.48 | 3.95 | 3.97 | -0.34 | 2.62 | 2.64 | -0.24 | 1.81 | 1.83 | ||
| 0.50 | -0.17 | 3.82 | 3.82 | -0.18 | 2.57 | 2.57 | -0.20 | 1.74 | 1.75 | ||
| 0.75 | -0.79 | 2.33 | 2.46 | -0.64 | 1.56 | 1.68 | -0.49 | 1.06 | 1.17 | ||
| 0.75 | -1.13 | 2.22 | 2.48 | -0.83 | 1.47 | 1.69 | -0.66 | 0.99 | 1.19 | ||
| 2 | 0.25 | -0.61 | 4.99 | 5.03 | -0.20 | 3.23 | 3.24 | 0.02 | 2.22 | 2.22 | |
| 0.25 | -0.21 | 4.98 | 4.98 | 0.03 | 3.18 | 3.18 | 0.15 | 2.19 | 2.20 | ||
| 0.50 | -0.89 | 4.01 | 4.11 | -0.35 | 2.62 | 2.64 | -0.08 | 1.79 | 1.79 | ||
| 0.50 | -0.80 | 3.86 | 3.94 | -0.24 | 2.45 | 2.46 | -0.01 | 1.69 | 1.69 | ||
| 0.75 | -1.66 | 2.57 | 3.06 | -0.70 | 1.55 | 1.70 | -0.30 | 1.06 | 1.10 | ||
| 0.75 | -2.37 | 2.48 | 3.42 | -0.99 | 1.44 | 1.75 | -0.46 | 0.97 | 1.07 | ||
| 3 | 0.25 | -0.59 | 5.01 | 5.05 | -0.24 | 3.22 | 3.23 | -0.01 | 2.23 | 2.23 | |
| 0.25 | -0.21 | 4.97 | 4.98 | -0.01 | 3.18 | 3.18 | 0.12 | 2.20 | 2.20 | ||
| 0.50 | -0.90 | 4.07 | 4.16 | -0.43 | 2.59 | 2.63 | -0.14 | 1.79 | 1.79 | ||
| 0.50 | -0.79 | 3.88 | 3.96 | -0.33 | 2.44 | 2.46 | -0.08 | 1.68 | 1.69 | ||
| 0.75 | -1.60 | 2.61 | 3.06 | -0.76 | 1.55 | 1.73 | -0.39 | 1.06 | 1.13 | ||
| 0.75 | -2.20 | 2.48 | 3.31 | -1.05 | 1.43 | 1.77 | -0.56 | 0.97 | 1.12 | ||
| 4 | 0.25 | -0.63 | 5.03 | 5.07 | -0.23 | 3.26 | 3.27 | 0.00 | 2.22 | 2.22 | |
| 0.25 | -0.28 | 4.97 | 4.97 | -0.01 | 3.22 | 3.22 | 0.12 | 2.19 | 2.20 | ||
| 0.50 | -0.91 | 4.06 | 4.16 | -0.38 | 2.61 | 2.64 | -0.09 | 1.78 | 1.78 | ||
| 0.50 | -0.81 | 3.82 | 3.91 | -0.32 | 2.45 | 2.47 | -0.07 | 1.68 | 1.68 | ||
| 0.75 | -0.97 | 2.51 | 2.69 | -0.42 | 1.55 | 1.61 | -0.17 | 1.05 | 1.06 | ||
| 0.75 | -1.47 | 2.34 | 2.76 | -0.70 | 1.44 | 1.60 | -0.33 | 0.96 | 1.02 | ||
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
A copula approach for dependence modeling in multivariate nonparametric time series
Natalie Neumeyera, Marek Omelkab, Šárka Hudecováb
Abstract.
This paper is concerned with modeling the dependence structure of two (or more) time-series in the presence of a (possibly multivariate) covariate which may include past values of the time series. We assume that the covariate influences only the conditional mean and the conditional variance of each of the time series but the distribution of the standardized innovations is not influenced by the covariate and is stable in time. The joint distribution of the time series is then determined by the conditional means, the conditional variances and the marginal distributions of the innovations, which we estimate nonparametrically, and the copula of the innovations, which represents the dependency structure. We consider a nonparametric as well as a semiparametric estimator based on the estimated residuals. We show that under suitable assumptions these copula estimators are asymptotically equivalent to estimators that would be based on the unobserved innovations. The theoretical results are illustrated by simulations and a real data example.
a Department of Mathematics, University of Hamburg, Bundesstrasse 55, 20146 Hamburg, Germany
b Department of Probability and Statistics, Faculty of Mathematics and Physics, Charles University, Sokolovská 83, 186 75 Praha 8, Czech Republic
Keywords and phrases: Asymptotic representation; CHARN model; empirical copula process; goodness-of-fit testing; nonparametric AR-ARCH model; nonparametric SCOMDY model; weak convergence.
NOTICE: This is the version of the manuscript accepted to Journal of Multivariate Analysis. DOI: 10.1016/j.jmva.2018.11.016
1. Introduction
Modeling the dependency of observed time series can be of utmost importance for applications, e. g. in risk management (for instance to model the dependence between several exchange rates). We will take the approach to model dependent nonparametric AR-ARCH time series
[TABLE]
where the covariate may include past values of the process, (), or other exogenous variables. Further the innovations , , are assumed to be independent and identically distributed random vectors and is independent of the past and present covariates , for all . For identifiability we assume , (), such that the functions and represent the conditional mean and volatility function of the th time series. Such models are also called multivariate nonparametric CHARN (conditional heteroscedastic autoregressive nonlinear) models and have gained much attention over the last decades, see Fan and Yao, (2005) and Gao, (2007) for extensive overviews.
Note that due to the structure of the model and Sklar’s theorem (see e.g., Nelsen,, 2006), for () one has
[TABLE]
where () denote the marginal distributions of the innovations and their copula. Thus the joint conditional distribution of the observations, given the covariate, is completely specified by the individual conditional mean and variance functions, the marginal distributions of the innovations, and their copula. The copula describes the dependence structure of the time series, conditional on the covariates, after removing influences of the conditional means and variances as well as marginal distributions.
We will model the conditional mean and variance function nonparametrically like Härdle et al., (1998), among others. Semiparametric estimation, e. g. with additive structure for and multiplicative structure for as in Yang et al., (1999), can be considered as well and all presented results remain valid under appropriate changes for the estimators and assumptions. Further we will model the marginal distributions of the innovations nonparametrically, whereas we will take two different approaches to estimate the copula : nonparametrically and parametrically. As the innovations are not observable, both estimators will be based on estimated residuals. We will show that the asymptotic distribution is not affected by the necessary pre-estimation of the mean and variance functions. This remarkable result is intrinsic for copula estimation and it was already observed in (semi)parametric estimation of copula (see the references in the next paragraph). It is worth noting that on the other hand the asymptotic distribution of empirical distribution functions is typically influenced by pre-estimation of mean and variance functions. Moreover, comparison of the nonparametric and parametric copula estimator gives us the possibility to test goodness-of-fit of a parametric class of copulas.
Our approach extends the following parametric and semiparametric approaches in time series contexts. Chen and Fan, (2006) introduced SCOMDY (semiparametric copula-based multivariate dynamic) models, which are very similar to the model considered here. However, the conditional mean and variance functions are modeled parametrically, while the marginal distributions of innovations are estimated nonparametrically and a parametric copula model is applied to model the dependence. See also Kim et al., (2007) for similar methods for some parametric time series models including nonlinear GARCH models, Rémillard et al., (2012), Kim et al., (2008) and the review by Patton, (2012). Chan et al., (2009) give (next to the parametric estimation of a copula) even a goodness-of fit test for the innovation copula in the GARCH context. Further, in an i.i.d. setting Gijbels et al., (2015) show that in nonparametric location-scale models the asymptotic distribution of the empirical copula is not influenced by pre-estimation of the mean and variance function. This results was further generalized by Portier and Segers, (2018) to a completely nonparametric model for the marginals.
The remainder of the paper is organized as follows. In Section 2 we define the estimators and state some regularity assumptions. In Subsection 2.1 we show the weak convergence of the copula process, while in Subsection 2.2 we show asymptotic normality of a parameter estimator when considering a parametric class of copulas. Subsection 2.3 is devoted to goodness-of-fit testing. In Section 3 we present simulation results and in Section 4 a real data example. All proofs are given in the Appendix.
2. Main results
For the ease of presentation we will focus on the case of two time series, i. e. , but all results can be extended to general in an obvious manner. Suppose we have observed for a section of the stationary stochastic process \big{\{}Y_{1i},Y_{2i},\boldsymbol{X}_{i}\big{\}}_{i\in\mathbb{Z}} that satisfies
[TABLE]
where is a -dimensional covariate and the innovations \big{\{}(\varepsilon_{1i},\varepsilon_{2i})\big{\}}_{i\in\mathbb{Z}} are independent identically distributed random vectors. Further is independent of the past and present covariates , , and , . If the marginal distribution functions and of the innovations are continuous, then the copula function of the innovations is unique and can be expressed as
[TABLE]
As the innovations are unobserved, the inference about the copula function is based on the estimated residuals
[TABLE]
where and are the estimates of the unknown functions and . In what follows we will consider the local polynomial estimators of order ; see Fan and Gijbels, (1996) or Masry, (1996), among others. Here, for a given , is defined as , the component of with multi-index , where is the solution to the minimization problem
[TABLE]
Here denotes the set of multi-indices with and \psi_{\mathbf{i},\mathbf{h}_{n}}(\mathbf{x})=\prod_{k=1}^{d}\big{(}\frac{x_{k}}{h_{n}^{(k)}}\big{)}^{i_{k}}\frac{1}{i_{k}!}. Further
[TABLE]
with being a kernel function and \mathbf{h}_{n}=\big{(}h_{n}^{(1)},\dotsc,h_{n}^{(d)}\big{)} the smoothing parameter.
Further is estimated as
[TABLE]
where is obtained in the same way as but with replaced with .
For any function defined on , interval in , define for , ,
[TABLE]
where and is the Euclidean norm on . Denote by the set of -times differentiable functions on , such that . Denote by the subset of of the functions that satisfy .
In what follows we are going to prove that under appropriate regularity assumptions using the estimated residuals (3) instead of the (true) unobserved innovations affects neither the asymptotic distribution of the empirical copula estimator nor the parametric estimator of a copula.
2.1. Empirical copula estimation
Mimicking the copula function can be estimated nonparametrically as
[TABLE]
where
[TABLE]
is the estimate of the joint distribution function and
[TABLE]
the corresponding marginal empirical cumulative distribution functions. Here we make use of a weight function and put as well as . For some real positive sequence we set .
Now let be the ‘oracle’ estimator based on the unobserved innovations, i.e.
[TABLE]
where \widehat{F}_{\varepsilon}(z_{1},z_{2})=\frac{1}{n}\sum_{i=1}^{n}\mathbbm{1}\big{\{}\varepsilon_{1i}\leq z_{1},\varepsilon_{2i}\leq z_{2}\big{\}} is the estimator of based on the unobserved innovations and the corresponding marginal empirical cumulative distribution functions.
Regularity assumptions
The process is strictly stationary and absolutely regular (-mixing) with the mixing coefficient that satisfies with .
The second-order partial derivatives , and of the joint cumulative distribution function , with , satisfy
[TABLE]
Further the innovation density satisfies
[TABLE]
The observations () have density that is bounded and differentiable with bounded uniformly continuous first order partial derivatives. Suppose that the sequence which is of order O\big{(}(\log n)^{1/d}\big{)} is chosen in such a way that converges to zero not faster than some negative power of .
For some with from assumption (), for , , the functions and are bounded and there are some , such that for all ,
[TABLE]
where denotes the joint density of and is bounded (for ).
Let, for and for each , and be elements of for some sequence that is either bounded or diverges to infinity not faster than some power of . Further, assume and that is either bounded away from zero or converges to zero not faster than a negative power of .
There exists a sequence such that , where , . Further, there exists some such that
[TABLE]
for all .
is a symmetric ()-times continuously differentiable probability density function supported on .
Remark 1*.*
Using F_{\varepsilon}(y_{1},y_{2})=C\big{(}F_{1\varepsilon}(y_{1}),F_{2\varepsilon}(y_{2})\big{)} assumption requires that
[TABLE]
where and stand for the first and second order partial derivatives of the copula function.
Thus provided that for some
[TABLE]
then we need that the functions f_{j\varepsilon}\big{(}F_{j\varepsilon}^{-1}(u)\big{)}(1+\big{|}F_{j\varepsilon}^{-1}(u)\big{|}) are of order and the functions f^{\prime}_{j\varepsilon}\big{(}F_{j\varepsilon}^{-1}(u_{j})\big{)}\big{(}1+|F_{j\varepsilon}^{-1}(u_{j})|\big{)}^{2} are bounded.
Remark 2*.*
Parts of our assumptions are reproduced from Hansen, (2008) because we apply his results about uniform rates of convergence for kernel estimators several times in our proofs. Note that in his Theorem 2 we set to simplify the assumptions. Further note that if beta mixing coefficients are diminishing exponentially fast then it is sufficient to assume in .
Remark 3*.*
Note that the bandwidth conditions (8) can be fulfilled iff , i.e. in view of assumption iff . Thus if , then for it is sufficient to take and for one can take . In general with increasing dimension higher smoothness of the unknown functions has to be assumed and higher order local polynomial estimators have to be used. This phenomenon is well known in the context of nonparametric inference.
So in general one can choose the bandwidth as , where . The problem is that if one wants to take as small as possible, the range of possible values of is rather short which makes the choice of rather delicate. To make the choice of more flexible in practice one can for instance assume that which (among others) includes models for beta mixing coefficients diminishing exponentially fast. Now for and one can take in the interval . See also the bandwidth choice in our simulation study in Section 3.
Remark 4*.*
The choice of is a delicate problem in practice. As far as we know even in analogous settings (see e.g. Müller et al.,, 2009; Dette et al.,, 2009; Koul and Zhu,, 2015, and the references therein) this problem has not been touched yet. Note that the weight function is chosen in the simplest possible form in order to simplify the presentation of the proof. On the other hand in practice it is of interest to use more general forms of . Further as the density is unknown, data-driven procedures to choose are of interest. In the simulation study in Section 3 we suggest a data-driven procedure for the choice of the weighting function in the case . Nevertheless the data driven choice of (in particular for general ) and its theoretical justification calls for further research.
Theorem 1**.**
Suppose that assumptions , , , , , , and are satisfied. Then
[TABLE]
Note that Theorem 1 together with the weak convergence of \sqrt{n}\,\big{[}C_{n}^{(or)}-C\big{]} (see e.g., Proposition 3.1 of Segers,, 2012) implies that that process \widetilde{\mathbb{C}}_{n}=\sqrt{n}\,\big{[}\widetilde{C}_{n}-C\big{]} weakly converges in the space of bounded functions to a centred Gaussian process , which can be written as
[TABLE]
where is a Brownian bridge on with covariance function
[TABLE]
Nevertheless when one uses this result in applications for statistical inference we recommend to replace the sample size with in the formulas. The thing is that the copula is estimated in fact only from observations and this should be reflected in order to improve the finite sample performance of asymptotic inference procedures.
2.2. Semiparametric copula estimation
The copula describes the dependency between the two time series of interest, given the covariate. For applications modeling this dependency structure parametrically is advantageous because a parametric model often gives easier access to interpretations. Goodness-of-fit testing will be considered in the next section.
Suppose that the joint distribution of is given by the copula function , where is an unknown parameter that belongs to a parametric space . In copula settings we are often interested in semiparametric estimation of the parameter , i.e. estimation of without making any parametric assumption on the marginal distributions and . The methods of semiparametric estimation for i.i.d. settings are summarized in Tsukahara, (2005). The question of interest is what happens if we use the estimated residuals (3) instead of the unobserved innovations . Generally speaking, thanks to Theorem 1 the answer is that using instead of does not change the asymptotic distribution provided that the parameter of interest can be written as a Hadamard differentiable functional of a copula.
2.2.1. Method-of-Moments using rank correlation
This method is in a general way described for instance in McNeil et al., (2005, Section 5.5.1). To illustrate the application of Theorem 1 for this method consider that the parameter is one-dimensional. Then the method of the inversion of Kendall’s tau is a very popular method of estimating the unknown parameter. For this method the estimator of is given by
[TABLE]
where
[TABLE]
is the theoretical Kendall’s tau and is an estimate of Kendall’s tau. In our settings the Kendall’s tau would be computed from the estimated residuals for which . By Theorem 1 and Hadamard differentiability of Kendall’s tau proved in Veraverbeke et al., (2011, Lemma 1), the estimators of Kendall’s tau based on or on are asymptotically equivalent. Thus provided that one gets that
[TABLE]
and
[TABLE]
Analogously one can show that working with residuals has asymptotically negligible effects also for the method of moments introduced in Brahimi and Necir, (2012).
2.2.2. Minimum distance estimation
Here one can follow for instance Tsukahara, (2005, Section 3.2). Note that thanks to Theorem 1 the proof of Theorem 3 of Tsukahara, (2005) does not change when is replaced with . Thus provided assumptions (B.1)-(B.5) of Tsukahara, (2005) are satisfied with , then the estimator defined as
[TABLE]
is asymptotically normal and satisfies
[TABLE]
where
[TABLE]
with
[TABLE]
2.2.3. M-estimator, rank approximate Z-estimators
To define a general -estimator let us introduce
[TABLE]
that can be viewed as estimates of the unobserved . Note that the multiplier is introduced in order to have both of the coordinates of the vector \big{(}\widetilde{U}_{1i},\widetilde{U}_{2i}\big{)} bounded away from zero and one. The -estimator of the parameter is now defined as
[TABLE]
where is a given loss function. This class of estimators includes among others the pseudo-maximum likelihood estimators (), for which , with being the copula density function.
Note that the estimator is usually searched for as a solution to the estimating equations
[TABLE]
where . In Tsukahara, (2005) the estimator defined as the solution of (11) is called a rank approximate -estimator.
In what follows we give general assumptions under which there exists a consistent root () of the estimating equations (11) that is asymptotically equivalent to the consistent root () of the ‘oracle’ estimating equations given by
[TABLE]
where
[TABLE]
are the standard pseudo-observations calculated from the unobserved innovations and their marginal empirical distribution functions .
Unfortunately, these general assumptions exclude some useful models (e.g. pseudo-maximum likelihood estimator in the Clayton family of copulas) for which the function viewed as a function of is unbounded. The reason is that for empirical distribution functions calculated from estimated residuals we lack some of the sophisticated results that are available for empirical distribution functions calculated from (true) innovations . For such copula families one can use for instance the Method-of-Moments using rank correlation (see Section 2.2.1) to stay on the safe side. Nevertheless the simulation study in Section 3 suggests that the pseudo-maximum likelihood estimation can be used also for the Clayton copula (and probably also for other families of copulas with non-zero tail dependence) provided that the dependence is not very strong.
Regularity assumptions
In what follows let stand for the true value of the parameter and for an open neighbourhood of .
is a unique minimizer of the function and is an inner point of .
There exists such that for each the functions and are uniformly continuous in uniformly in and of uniformly bounded Hardy-Kraus variation (see e.g., Berghaus et al.,, 2017).
There exists and a function such that for each
[TABLE]
and .
Each element of the (matrix) function is a continuous function on and the matrix is positively definite.
Theorem 2**.**
Suppose that the assumptions of Theorem 1 are satisfied and that also , , , and hold. Then with probability going to one there exists a consistent root of the estimating equations (11), which satisfies
[TABLE]
where
[TABLE]
The proof of Theorem 2 is given in Appendix B. Note that the asymptotic distribution of the estimator coincides with the distribution given in Section 4 of Genest et al., (1995) that corresponds to the consistent root of the estimating equations (12). Thus using the residuals instead of the true innovations has asymptotically negligible effect on the (first-order) asymptotic properties. In fact, it can be even shown that both and have the same asymptotic representations and thus
[TABLE]
2.3. Goodness-of-fit testing
When modeling multivariate data using copulas parametrically one needs to choose a suitable family of copulas. When choosing the copula family tests of goodness-of-fit are often a useful tool. Thus we are interested in testing , where is a given parametric family of copulas.
Many testing methods have been proposed (see e.g. Genest et al.,, 2009; Kojadinovic and Holmes,, 2009, and the references therein). The most standard ones are based on the comparison of nonparametric and parametric estimators of a copula. For instance the Cramér-von Mises statistic is given by
[TABLE]
where is an estimate of the unknown parameter . As the asymptotic distributions of and are the same as the asymptotic distribution of and we suggest that the significance of the test statistic can be assessed in the same way as in i.i.d. settings. Thus one can use for instance the parametric bootstrap by simply generating independent and identically distributed observations from the copula function . The test statistic is then simply recalculated from this observations in the same way as if we directly observed the innovations. The only difference is that instead of generating observations we recommend to generate only observations.
Similar remarks hold when testing other hypotheses about the copula such as symmetry, for instance. Note that testing provides a test for conditional independence of the two time series, given the covariate.
3. Simulation study
A small Monte Carlo study was conducted in order to compare the semiparametric estimators based on the residuals with the ‘oracle’ estimators based on (unobserved) innovations. The inversion of Kendall’s tau (IK) method and the maximum pseudo-likelihood (MPL) method were considered for the following five copula families: Clayton, Frank, Gumbel, normal, and Student with 4 degrees of freedom. The values of the parameters are chosen so that they correspond to the Kendall’s tau , and . The data were simulated from the following four models:
[TABLE]
where the innovations , , follow marginally the standard normal distribution, and is an exogenous variable following the AR model with being i.i.d. from a standard normal distribution. The simulations were conducted also for innovations , with Student marginals with 5 degrees of freedom, but the corresponding results are very similar. For brevity of the paper we do not present them here.
The nonparametric estimates and are constructed as local polynomial estimators of order with being the triweight kernel. The bandwidth is chosen for each estimation separately by the cross-validation method from the interval , where and for (cf. Remark 3) and is an estimate of the standard deviation of the explanatory variable (being or , depending on the model) given by , where stands for the sample standard deviation and is the interquartile range.
The weights are given by , where is the largest possible interval such that , where is the kernel density estimator of the marginal density of (with triweight kernel and the bandwidth chosen by the standard normal reference rule, see e.g. Fan and Yao,, 2005, p. 201).
For each setting, we compute the estimate of the copula parameter from the true (but unobserved) innovations using the inversion of Kendall’s tau method () and the maximum pseudo-likelihood method (). These oracle estimators are compared with their counterparts computed from the residuals () and (). To have more comparable results for different copula families the estimates of the parameters are done on the Kendall’s tau scale. That is we are in fact comparing nonparametric estimates of Kendall’s tau with parametric estimates, where the parameter is estimated with the help of maximum pseudo-likelihood method. The performance of the estimators is measured by the bias, standard deviation (SD), and the root mean square error (RMSE), which are estimated from the random samples for chosen sample sizes , and . Since the obtained quantities are of order and smaller, we report multiples of bias, SD and RMSE in Tables 1,2,3,4 and 5. As and are natural competitors, the bigger of the two corresponding performance measures (bias, SD, RMSE) is stressed by the bold font.
In agreement with the results of Genest et al., (1995) and Tsukahara, (2005) the results for the (oracle) estimates based on (unobserved) innovations are in favour of MPL method. This continues to hold also when working with estimated residuals provided that the dependence is not very strong (i.e. or ). But if the dependence is strong (i.e. ) then one should consider using the IK method. This seems to be true in particular for the Clayton copula and to some extent also for the Frank copula and the Gumbel copula. A closer inspection of the results reveals that while the standard deviation of MPL method is almost always slightly smaller than the standard deviation of the IK method, the bias can be substantially larger. On the other hand the results suggest that for the normal and the Student copula one can stay with MPL method even in case of a strong dependence.
Finally note that for large sample sizes the performance of the estimates based on residuals is usually almost as good as of the oracle estimates based on (unobserved) innovations. But there is still some price to pay even for the sample size and this price relatively increases with the level of dependence. The question for possible further research is how to explain the bad performance of PML method based on residuals for the Clayton copula with a strong dependence.
4. Application
To illustrate the proposed methods let us consider daily log returns of USD/CZK (US Dollar/Czech Koruna) and GBP/CZK (British Pound/Czech Koruna) exchange rates from 4th January 2010 to 31st December 2012. Note that we take only data till the end of 2012 (total of 758 observations for each series), because in November 2013 the Czech National Bank started its intervention aimed at CZK/EUR exchange rate.
Daily foreign exchange rates have been successfully modelled using the nonparametric autoregression, e.g., in Härdle et al., (1998) and Yang et al., (1999). Here, we apply a simple model of two separate nonparametric autoregressions of order 1 and search for a feasible copula for the innovations. The conditional means and variances are modelled using local polynomials with degree . The weights and the smoothing parameters are chosen as in Section 3. The fitted conditional means and standard deviations are plotted together with the data in Figure 1. It is visible that the conditional mean functions are rather flat and range around zero.
We use the goodness-of-fit test proposed in Section 2.3 in order to decide which copula should be used for modeling the innovations from the two autoregressions. The copula parameter is estimated using the inversion of Kendall’s tau method. The significance of the test statistics is assessed with the help of the bootstrap test based on bootstrap samples. We test Clayton, Frank, Gumbel, normal and Student copula with 4 degrees of freedom respectively and obtain -values , , , , . Hence, we conclude that the Student copula seems to be the best choice for the innovations. The normal copula is also not rejected on the 5% level, but the corresponding -value is rather borderline, so the Student copula seems to provide a better fit. The maximum pseudo-likelihood method estimates degrees of freedom and parameter . Figure 2 shows plot of pseudo-observations given by (10), together with contours of the fitted Student copula.
Acknowledgement
The authors thank the editor, an associate editor and two referees for their very valuable comments that led to a considerable improvement of the contribution. The second and the third authors gratefully acknowledge support from the grant GACR 15-04774Y.
Appendix A - Proof of Theorem 1
Recall the definition . Introduce
[TABLE]
and note that
[TABLE]
where and denote the marginals of . Further is a distribution function on with the marginals cdfs satisfying . Thus one can make use of the Hadamard differentiability of the ‘copula mapping’ proved in Theorem 2.4 of Bücher and Volgushev, (2013) provided we show that the process
[TABLE]
converges in distribution in the space to a process with continuous trajectories such that for each .
A1: Decomposition and weak convergence of \texorpdfstringGn
Denote
[TABLE]
Now one can decompose the process as
[TABLE]
where , and stand for the first, second and third term respectively on the right-hand side of the first equation in (A2).
In Section A2 it will be shown that the first term on the right-hand side of (A2) satisfies uniformly in ,
[TABLE]
where (in agreement with the last two conditions in ) for the first term on the right-hand side of (A3) is defined as zero and analogously for .
In Section A3, we will show the asymptotic negligibility of the second term on the right-hand side of (A2), i.e.
[TABLE]
Now combining (A2) with (A3) and (A4) yields that uniformly in
[TABLE]
where
[TABLE]
The asymptotic representation (A5) together with standard techniques yields the weak convergence of the process .
Now thanks to Hadamard differentiability of the copula functional and Theorem 3.9.4 of van der Vaart and Wellner, (1996),
[TABLE]
Note that for each
[TABLE]
Further combining (A7) with (A5), (A6) and (A8) gives
[TABLE]
Now the right-hand side of (A9) coincides with the asymptotic representation of the ‘oracle’ copula process \sqrt{n}\big{[}C_{n}^{(or)}-C\big{]}, which implies the statement of Theorem 1.
A2: Showing \texorpdfstring(A3)(A3)
Let us introduce the process
[TABLE]
that is indexed be the following set of functions
[TABLE]
where
[TABLE]
and for from assumption and some large enough such that
[TABLE]
Denote the centred process as
[TABLE]
and note that may be formally identified by . We will use the notation . Further in agreement with the notation used in van der Vaart and Wellner, (2007) by for random we understand the value of the mapping evaluated at .
Consider the semi-norm given by
[TABLE]
where
[TABLE]
From assumption one obtains that for some constant . Further denote
[TABLE]
As consists of indicator functions for one has . Thus one obtains for
[TABLE]
Starting with brackets of -length of the function classes , and it is then easy to construct brackets for with -length (compare with the proof of Lemma 1 in Dette et al.,, 2009). Thus one obtains
[TABLE]
where the rate follows from Lemma 2 in Appendix C. Further one bracket is sufficient for . Thus by (A15) and (A12),
[TABLE]
From Dedecker and Louhichi, (2002), Section 4.3, it follows that the centred process given by (A13) is asymptotically -equicontinuous.
To apply this result in order to prove (A3) note that
[TABLE]
and introduce the process
[TABLE]
with , , from Lemma 1 and Remark 5 in Appendix C. Then one obtains by monotonicity arguments applying Lemma 1(i) that, on an event with probability converging to one
[TABLE]
for some deterministic positive sequence . Here,
[TABLE]
We only consider the upper bound, the lower one can be handled completely analogously. First note that Z_{n}\big{(}f_{n}^{u}\big{)}-Z_{n}\big{(}g_{n})=\bar{Z}_{n}\big{(}f_{n}^{u}\big{)}-\bar{Z}_{n}\big{(}g_{n})+R_{n}, where with probability converging to one,
[TABLE]
where the last equality follows by a Taylor expansion, assumption and . Now introducing the notation ()
[TABLE]
one can show as in (A14) that for a sufficiently large
[TABLE]
and this can be bounded by times the bound on the right hand side of (A16) and thus converges to zero in probability uniformly in . Therefore there exists a deterministic sequence with \mathsf{P}\big{(}\sup_{u_{1},u_{2}}\|f_{n}^{u}-g_{n}\|_{2,\beta}\leq\delta_{n}\big{)}\to 1 as . Further by Lemma 1 and Remark 5 of Appendix C one has \mathsf{P}\big{(}f_{n}^{u}, , g_{n}\in\mathcal{F}\big{)}\to 1 as . Now from -equicontinuity of one obtains for every that
[TABLE]
and thus \big{|}\bar{Z}_{n}(f_{n}^{u})-\bar{Z}_{n}(g_{n})\big{|}=o_{P}(1) uniformly with respect to . In combination with (A16), analogous considerations for the lower bound Z_{n}\big{(}f_{n}^{\ell}\big{)}-Z_{n}\big{(}g_{n}) and the fact that
[TABLE]
we obtain
[TABLE]
Further thanks to (A18) it is sufficient to show that the process has the asymptotic representation given by the right-hand side of (A3).
Thus the remaining proof of (A3) is divided into two parts. First we prove that
[TABLE]
and then we calculate . Here, with slight abuse of notation, denotes expectation, considering the functions , as deterministic.
Showing (A19)
Note that we have
[TABLE]
where
[TABLE]
with [math] and standing for functions that are constantly equal to zero and one respectively. Similarly to before one can show that for a sufficiently large
[TABLE]
using notation (A17).
Now note that with Lemma 1 (iii) in Appendix C we obtain uniformly in . Finally with the help of (A18), (A20) and the asymptotic -equicontinuity of the process one can conclude (A19).
Calculating
To simplify the notation and to prevent the confusion let the random vector have the same distribution as . With the help of a second-order Taylor series expansion of the right-hand side one gets
[TABLE]
where
[TABLE]
and the point lies between the points F_{j\varepsilon}\big{(}F_{j\varepsilon}^{-1}(u_{j},\mathbf{x},0)\big{)} and . Now using Lemma 1 (iv) in Appendix C for
[TABLE]
uniformly in .
To conclude the proof of (A3) we need to show that ‘the second order terms’ in (A21) are asymptotically negligible. To show that note that by assumption and Lemma 1 (iii) there exists a finite constant such that with probability going to one
[TABLE]
uniformly in and .
Thus to prove
[TABLE]
it is sufficient to use once more Lemma 1 (iii).
A3: Showing \texorpdfstring(A4)(A4)
Recall that and decompose
[TABLE]
where stands for the first term on the right-hand side of the equation (A22) (except for the factor ) and for the second term. Further using standard techniques one can show that both and viewed as processes on are asymptotically equi-continuous. To this end, note that corresponds to the process as defined in Section A1 above with f\hat{=}\big{(}c_{n},F_{1\varepsilon}^{-1}(u_{1}),F_{2\varepsilon}^{-1}(u_{2}),0,1,0,1\big{)}. Alternatively, results by Bickel and Wichura, (1971) can be applied. Moreover as
[TABLE]
one can conclude that both processes \big{(}\tfrac{n}{W_{n}}-1\big{)}\,B_{n1}(u_{1},u_{2}) and are uniformly asymptotically negligible in probability, which together with (A22) implies (A4).
Appendix B - Proof of Theorem 2
Thanks to assumption , the estimator is a solution to the estimating equations (11).
In what follows, first we prove the existence of a consistent root of the estimating equations (11) and then we derive that this root satisfies
[TABLE]
where are introduced in (13). The statement of the theorem now follows for by Proposition A 1(ii) of Genest et al., (1995) and for by Theorem 1 of Gijbels et al., (2017).
Proving consistency
Put \widetilde{C}_{n}^{\prime}(u_{1},u_{2})=\frac{1}{W_{n}}\sum_{i=1}^{n}w_{ni}\,\mathbbm{1}\big{\{}\widetilde{U}_{1i}\leq u_{1},\widetilde{U}_{2i}\leq u_{2}\big{\}}, where pseudo-observations are defined in (10). Note that
[TABLE]
Fix . By Corollary A.7 of Berghaus et al., (2017) one gets
[TABLE]
Note that thanks to assumption (uniformly in )
[TABLE]
and analogously also
[TABLE]
Now combining (B3), (B4) and (B5) yields
[TABLE]
where
[TABLE]
Analogously one gets
[TABLE]
Now using (B2), (B6), (B8) and assumption gives that uniformly in
[TABLE]
where we have used Theorem 1 and assumption . The existence of a consistent root of estimating equations (11) now follows by assumptions and .
Analogously one can show the existence of a consistent root of estimating equations (12).
Showing \texorpdfstring(B1)(B1)
Let be a consistent root of the estimating equations (11). Then by the mean value theorem applied to each coordinate of the vector-valued function
[TABLE]
one gets
[TABLE]
where stands for and is between and . Note that as the mean value theorem is applied to a vector valued function there are in fact different points for each coordinate of the function but all of them are consistent so for simplicity of notation we do not distinguish them.
Thus to finish the proof of (B1) it is sufficient to show that
[TABLE]
and
[TABLE]
When proving (B9) one can mimic the proof of consistency of and show that there exists (an open neighbourhood of such that)
[TABLE]
Using the consistency of and assumption yields (B9).
Thus one can concentrate on proving (B10). Put C_{n}^{\prime(or)}(u_{1},u_{2})=\frac{1}{n}\sum_{i=1}^{n}\mathbbm{1}\big{\{}\widehat{U}_{1i}\leq u_{1},\widehat{U}_{2i}\leq u_{2}\big{\}}, where are defined in (13). Note that
[TABLE]
Analogously as (B6) one can also show that for
[TABLE]
where is given in (B7).
Now using (B2), (B6), (B11), (B12), Theorem 1 and one gets
[TABLE]
which verifies (B10) and finishes the proof of (B1).
Appendix C - Auxiliary results
Lemma 1**.**
Assume that , , , , , , and are satisfied. Then there exist random functions and on such that for
- (i)
\displaystyle\sup_{\mathbf{x}\in\mathbf{J}_{n}}\Big{|}\frac{\widehat{m}_{j}(\mathbf{x})-m_{j}(\mathbf{x})}{\sigma_{j}(\mathbf{x})}-\widehat{a}_{j}(\mathbf{x})\Big{|}=o_{P}\big{(}n^{-1/2}\big{)},\ \sup_{\mathbf{x}\in\mathbf{J}_{n}}\Big{|}\frac{\widehat{\sigma}_{j}(\mathbf{x})}{\sigma_{j}(\mathbf{x})}-\widehat{b}_{j}(\mathbf{x})\Big{|}=o_{P}\big{(}n^{-1/2}\big{)},** 2. (ii)
* for from assumption ,* 3. (iii)
\displaystyle\sup_{\mathbf{x}\in\mathbf{J}_{n}}\big{|}\widehat{a}_{j}(\mathbf{x})\big{|}=o_{P}\big{(}n^{-1/4}\big{)},\qquad\sup_{\mathbf{x}\in\mathbf{J}_{n}}\big{|}\widehat{b}_{j}(\mathbf{x})-1\big{|}=o_{P}\big{(}n^{-1/4}\big{)},** 4. (iv)
\displaystyle\int_{\mathbf{J}_{n}}\widehat{a}_{j}(\mathbf{x})f_{\boldsymbol{X}}(\mathbf{x})\,d\mathbf{x}=\frac{1}{n}\sum_{i=1}^{n}\varepsilon_{ji}+o_{P}\big{(}n^{-1/2}\big{)},
\displaystyle\int_{\mathbf{J}_{n}}\Big{(}\widehat{b}_{j}(\mathbf{x})-1\Big{)}\,f_{\boldsymbol{X}}(\mathbf{x})\,d\mathbf{x}=\frac{1}{2n}\sum_{i=1}^{n}\big{(}\varepsilon_{ji}^{2}-1\big{)}+o_{P}\big{(}n^{-1/2}\big{)}.**
Proof.
For ease of presentation we set and assume \mathbf{h}_{n}=\big{(}h_{n},\dotsc,h_{n}\big{)}. We will first prove the assertions for . The proof basically goes along the lines of the proof of Lemma 1 by Müller et al., (2009), but changes are necessary due to the dependency of observations in our model and because our covariate density is not assumed to be bounded away from zero on its support. Recall that denotes the set of multi-indices with and we set , where is the order of the polynomials used in the local polynomial estimation. Further introduce and note thanks to assumption
[TABLE]
as for all sufficiently large the set is a subset of . Finally define which is by assumption either bounded away from zero or converges to zero not faster than a negative power of .
Proof of assertion (i) for . Fix some and let denote the solution of the minimization problem (4). Then satisfies the normal equations
[TABLE]
where
[TABLE]
From Theorem 2 in Hansen, (2008) we obtain for \varrho_{n}=\big{(}\log n/(nh_{n}^{d})\big{)}^{1/2},
[TABLE]
where we define Q_{\mathbf{i}\mathbf{k}}(\mathbf{x})=\mathsf{E}\,\big{[}\widehat{Q}_{\mathbf{i}\mathbf{k}}(\mathbf{x})\big{]}, . Note that
[TABLE]
and for , consider the matrices with entries , . Analogously put for the matrix with entries , .
It follows from (C1) that for all vectors of unit Euclidean length, where is a sequence of positive real numbers of the same rate as in (C1). Thus has eigenvalues in the interval , and on the event
[TABLE]
one has for all of unit Euclidean length, such that the matrix is invertible as well. Here and throughout denotes the spectral norm of a matrix . Note that by (C2) and \varrho_{n}=o\big{(}\alpha_{n}^{(1)}\big{)}, which holds under assumption . For the remainder of the proof we assume that the event takes place because its complement does not matter for the assertions of the lemma. It follows from the normal equations that for ,
[TABLE]
where and and denote the vectors with components and , , respectively. Now define
[TABLE]
then we have the decomposition
[TABLE]
with remainder terms
[TABLE]
where is the vector with components , . From Theorem 2 in Hansen, (2008) we obtain
[TABLE]
For the treatment of the inverse matrices in we use Cramer’s rule and write
[TABLE]
where and denote the cofactor matrices of and , respectively. Due to the boundedness of the functions each element of can be absolutely bounded by by (C2) and the same rate is obtained for \big{|}\det\big{(}{\mathbf{Q}}(\mathbf{x})\big{)}-\det\big{(}\widehat{\mathbf{Q}}(\mathbf{x})\big{)}\big{|}, uniformly in . Using the lower bound for the determinant of , and assumption to bound gives the rate
[TABLE]
by assumption . In order to show negligibility of first note that the spectral norm of is given by the reciprocal of the square root of the smallest eigenvalue of . With
[TABLE]
(on ) for all with , we obtain the rate for \big{\|}\widehat{\mathbf{Q}}^{-1}(\mathbf{x})\big{\|}. Further, by Taylor expansion of of order in the definition of and using assumption we have
[TABLE]
where the kernel density estimator converges to uniformly in , see Theorem 6 by Hansen, (2008). Altogether we have
[TABLE]
using assumption .
Now assertion (i) for follows from (C3), (C4), (C6) and (C7).
Proof of assertion (ii) for . Note that and thus is ()-times partially differentiable and
[TABLE]
by the mean value theorem. Again by Theorem 2 of Hansen, (2008) we have
[TABLE]
Further note that
[TABLE]
and that the spectral norm of can be bounded by O\big{(}1/\alpha_{n}^{(1)}\big{)} with considerations as before. We apply the product rule for derivatives to obtain
[TABLE]
by assumption . Assertion (ii) for follows.
Proof of assertion (iii) for . From the definition of and (C5) we obtain that
[TABLE]
and thus (iii) follows for .
Proof of assertion (iv) for . To prove (iv) note that
[TABLE]
with
[TABLE]
From the support properties of the kernel function it follows that . Further, for note that
[TABLE]
because the expectation is zero and the variance is bounded by
[TABLE]
It remains to consider
[TABLE]
with , where
[TABLE]
Now, by applying the mean value theorem for , for , can be bounded absolutely by O\big{(}M_{n}\,h_{n}/(\alpha_{n}^{(1)}\alpha_{n}^{(2)})\big{)}=o(1). Thus analogously as when showing (C8) one can use Markov’s inequality to get
[TABLE]
To obtain the desired negligibility it remains to show \mathsf{E}\big{[}\big{(}\Delta_{n}^{(1)}(\boldsymbol{X}_{i})-1\big{)}^{2}\mathbbm{1}\{\boldsymbol{X}_{i}\in\mathbf{J}_{n}^{-}\}\big{]}\to 0. To this end we write
[TABLE]
where the matrix has entries
[TABLE]
Note that has the smallest eigenvalue of order . Thus we obtain the bound
[TABLE]
Now with bounds for the matrix norms similar to before, and inserting the definitions of and we obtain
[TABLE]
by the mean value theorem and assumptions and .
Proof of assertions (i)–(iv) for . Recall the definition , where is the local polynomial estimator based on , . With the notation we obtain
[TABLE]
where
[TABLE]
Put
[TABLE]
where denotes the vector with components
[TABLE]
Along the lines of the proof of (i) and (ii) for one can prove that
[TABLE]
and
[TABLE]
Now noticing that we obtain the rate
[TABLE]
and (i) follows for .
If we define , then (ii) and (iii) follow analogously to before. The only difference is an additional factor in the denominator that needs to be considered.
To show validity of (iv) note that the regression model holds with error term . From this one obtains analogously to the derivation of (iv) for that
[TABLE]
But the second sum is also the dominating term in , which is again shown analogously to the proof of (iv) for . Thus (iv) follows for . ∎
Remark 5*.*
Note that due to property (iii) of Lemma 1 and (C1) we have for ,
[TABLE]
for every . In the proof of Lemma 1, was only defined for . Now we define on in a way that if and , then defined in (A10). Then \mathsf{P}\big{(}\widehat{a}_{1}\in\mathcal{G}\big{)}\to 1 by Lemma 1. Analogously is defined on such that \mathsf{P}\big{(}\widehat{b}_{1}\in\widetilde{\mathcal{G}}\big{)}\to 1 for from (A11).
Lemma 2**.**
Let or denote one of the function classes defined in (A10) and (A11) (depending on and ), then we have
[TABLE]
for , and thus the same bound holds for .
Proof.
Let (the proof is similar for ) and let . Choose . Let denote the ball of radius around the origin. Let denote the centers of -balls with respect to the supremum norm that cover , that is . Then for each we have and thus there exists such that . Now define for , . Then
[TABLE]
because by definition of . We obtain and due to van der Vaart and Wellner, (1996), Theorem 2.7.1, we have for some universal
[TABLE]
where B^{1}=\big{\{}\mathbf{x}:\,\|\mathbf{x}-B\|<1\big{\}}
Thus the first assertion follows. The second assertion follows by van der Vaart and Wellner, (1996), proof of Cor. 2.7.2. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Berghaus et al., (2017) Berghaus, B., Bücher, A., and Volgushev, S. (2017). Weak convergence of the empirical copula process with respect to weighted metrics. Bernoulli , 23(1):743–772.
- 2Bickel and Wichura, (1971) Bickel, P. J. and Wichura, M. J. (1971). Convergence criteria for multiparameter stochastic processes and some applications. Ann. Math. Statist. , 42:1656–1670.
- 3Brahimi and Necir, (2012) Brahimi, B. and Necir, A. (2012). A semiparametric estimation of copula models based on the method of moments. Stat. Methodol. , 9(4):467–477.
- 4Bücher and Volgushev, (2013) Bücher, A. and Volgushev, S. (2013). Empirical and sequential empirical copula processes under serial dependence. J. Multivariate Anal. , 119:61–70.
- 5Chan et al., (2009) Chan, N.-H., Chen, J., Chen, X., Fan, Y., and Peng, L. (2009). Statistical inference for multivariate residual copula of GARCH models. Statist. Sinica , 19:53–70.
- 6Chen and Fan, (2006) Chen, X. and Fan, Y. (2006). Estimation and model selection of semiparametric copula-based multivariate dynamic models under copula misspecification. J. Econometrics , 135:125–154.
- 7Dedecker and Louhichi, (2002) Dedecker, J. and Louhichi, S. (2002). Maximal inequalities and empirical central limit theorems. In Dehling, H., Mikosch, T., and Sorensen, M., editors, Empirical process techniques for dependent data , pages 137–160. Birkhäuser Boston.
- 8Dette et al., (2009) Dette, H., Pardo-Fernández, J. C., and Keilegom, I. V. (2009). Goodness-of-fit tests for multiplicative models with dependent data. Scand. J. Statist. , 36(4):782–799.
