Fourier analysis of serial dependence measures

Ria van Hecke; Stanislav Volgushev; Holger Dette

arXiv:1703.04320·math.ST·March 14, 2017

Fourier analysis of serial dependence measures

Ria van Hecke, Stanislav Volgushev, Holger Dette

PDF

TL;DR

This paper explores new frequency domain methods for analyzing serial dependence using U-statistics-based measures like Kendall's tau, revealing unique asymptotic properties and behaviors.

Contribution

It introduces a novel spectral analysis approach replacing auto-covariances with U-statistics dependence measures, expanding spectral analysis tools.

Findings

01

Kendall's tau-based spectral density exhibits surprising limiting variance behavior

02

Asymptotic properties of new frequency domain methods are characterized

03

Alternative dependence measures can be effectively used in spectral analysis

Abstract

Classical spectral analysis is based on the discrete Fourier transform of the auto-covariances. In this paper we investigate the asymptotic properties of new frequency domain methods where the auto-covariances in the spectral density are replaced by alternative dependence measures which can be estimated by U-statistics. An interesting example is given by Kendall{'}s $τ$ , for which the limiting variance exhibits a surprising behavior.

Equations652

f_{ξ} (ω) = \frac{1}{2 π} k \in Z \sum ξ_{k} e^{- ik ω} (ω \in R),

f_{ξ} (ω) = \frac{1}{2 π} k \in Z \sum ξ_{k} e^{- ik ω} (ω \in R),

τ_{k} = 4 \int C_{k} (u) d C_{k} (u) - 1

τ_{k} = 4 \int C_{k} (u) d C_{k} (u) - 1

τ_{k} = 2 P [X_{1} < X_{2}, Y_{1} < Y_{2}] + 2 P [X_{2} < X_{1}, Y_{2} < Y_{1}] - 1

τ_{k} = 2 P [X_{1} < X_{2}, Y_{1} < Y_{2}] + 2 P [X_{2} < X_{1}, Y_{2} < Y_{1}] - 1

\displaystyle\xi_{k}=\textnormal{\mbox{I\negthinspace E}}\Big{[}h\Big{(}\begin{pmatrix}X_{0}^{(1)}\\ X_{k}^{(1)}\end{pmatrix},\dots,\begin{pmatrix}X_{0}^{(m)}\\ X_{k}^{(m)}\end{pmatrix}\Big{)}\Big{]}.

\displaystyle\xi_{k}=\textnormal{\mbox{I\negthinspace E}}\Big{[}h\Big{(}\begin{pmatrix}X_{0}^{(1)}\\ X_{k}^{(1)}\end{pmatrix},\dots,\begin{pmatrix}X_{0}^{(m)}\\ X_{k}^{(m)}\end{pmatrix}\Big{)}\Big{]}.

\hat{f}_{n, ξ} (ω) = \frac{1}{2 π} ∣ k ∣ < n \sum w_{n} (k) ξ_{n, k} e^{- ik ω},

\hat{f}_{n, ξ} (ω) = \frac{1}{2 π} ∣ k ∣ < n \sum w_{n} (k) ξ_{n, k} e^{- ik ω},

r_{k}=\textnormal{\mbox{I\negthinspace E}}\Big{[}\frac{1}{2}(X_{0}^{(1)}-X_{0}^{(2)})(X_{k}^{(1)}-X_{k}^{(2)})\Big{]}=\mathrm{Cov}(X_{0},X_{k})

r_{k}=\textnormal{\mbox{I\negthinspace E}}\Big{[}\frac{1}{2}(X_{0}^{(1)}-X_{0}^{(2)})(X_{k}^{(1)}-X_{k}^{(2)})\Big{]}=\mathrm{Cov}(X_{0},X_{k})

\displaystyle h\big{(}({x_{1}},{y_{2}})^{T},({x_{2}},{y_{2}})^{T}\big{)}=2I(x_{1}<x_{2},y_{1}<y_{2})+2I(x_{2}<x_{1},y_{2}<y_{1})-1,

\displaystyle h\big{(}({x_{1}},{y_{2}})^{T},({x_{2}},{y_{2}})^{T}\big{)}=2I(x_{1}<x_{2},y_{1}<y_{2})+2I(x_{2}<x_{1},y_{2}<y_{1})-1,

τ_{k} =

τ_{k} =

=

f_{τ} (ω) = \frac{1}{2 π} k \in Z \sum τ_{k} e^{- ik ω} .

f_{τ} (ω) = \frac{1}{2 π} k \in Z \sum τ_{k} e^{- ik ω} .

\displaystyle h\big{(}({x_{1}},{y_{1}})^{T},({x_{2}},{y_{2}})^{T}\big{)}=4\big{(}I(x_{1}<x_{2})-\frac{1}{2}\big{)}\big{(}I(y_{1}<y_{2})-\frac{1}{2}\big{)}

\displaystyle h\big{(}({x_{1}},{y_{1}})^{T},({x_{2}},{y_{2}})^{T}\big{)}=4\big{(}I(x_{1}<x_{2})-\frac{1}{2}\big{)}\big{(}I(y_{1}<y_{2})-\frac{1}{2}\big{)}

\displaystyle h\big{(}({x_{1}},{y_{1}})^{T},({x_{2}},{y_{2}})^{T},({x_{3}},{y_{3}})^{T}\big{)}=\frac{1}{6}\sum_{\gamma\in\Gamma\{1,2,3\}}[12I(x_{\gamma(1)}<x_{\gamma(2)},y_{\gamma(1)}<y_{\gamma(3)})-3].

\displaystyle h\big{(}({x_{1}},{y_{1}})^{T},({x_{2}},{y_{2}})^{T},({x_{3}},{y_{3}})^{T}\big{)}=\frac{1}{6}\sum_{\gamma\in\Gamma\{1,2,3\}}[12I(x_{\gamma(1)}<x_{\gamma(2)},y_{\gamma(1)}<y_{\gamma(3)})-3].

ρ_{k} = 3 (P [(X_{0}^{(1)} - X_{0}^{(2)}) (X_{k}^{(1)} - X_{k}^{(3)}) > 0] - P [(X_{0}^{(1)} - X_{0}^{(2)}) (X_{k}^{(1)} - X_{k}^{(3)}) < 0]),

ρ_{k} = 3 (P [(X_{0}^{(1)} - X_{0}^{(2)}) (X_{k}^{(1)} - X_{k}^{(3)}) > 0] - P [(X_{0}^{(1)} - X_{0}^{(2)}) (X_{k}^{(1)} - X_{k}^{(3)}) < 0]),

f_{ρ} (ω) = \frac{1}{2 π} k \in Z \sum ρ_{k} e^{- ik ω} .

f_{ρ} (ω) = \frac{1}{2 π} k \in Z \sum ρ_{k} e^{- ik ω} .

\displaystyle h\big{(}({x_{1}},{y_{1}})^{T},({x_{2}},{y_{2}})^{T},({x_{3}},{y_{3}})^{T}\big{)}=\sum_{\gamma\in\Gamma\{1,2,3\}}\big{[}2\big{(}I(x_{\gamma(1)}<x_{\gamma(2)})-\frac{1}{2}\big{)}\big{(}I(y_{\gamma(1)}<y_{\gamma(3)})-\frac{1}{2}\big{)}\big{]}

\displaystyle h\big{(}({x_{1}},{y_{1}})^{T},({x_{2}},{y_{2}})^{T},({x_{3}},{y_{3}})^{T}\big{)}=\sum_{\gamma\in\Gamma\{1,2,3\}}\big{[}2\big{(}I(x_{\gamma(1)}<x_{\gamma(2)})-\frac{1}{2}\big{)}\big{(}I(y_{\gamma(1)}<y_{\gamma(3)})-\frac{1}{2}\big{)}\big{]}

ξ_{n, k} =

ξ_{n, k} =

h_{c, k} (y_{1}, \dots, y_{c}) :=

h_{c, k} (y_{1}, \dots, y_{c}) :=

- j = 1 \sum c - 1 {ν_{1}, \dots, ν_{j}} \subset {1, \dots, c} ν_{1} < \dots < ν_{j} \sum h_{j, k} (y_{ν_{1}}, \dots, y_{ν_{j}}) - ξ_{k}

=

\displaystyle U_{n-|k|}^{(c)}(h_{c,k})=\frac{1}{\binom{n-|k|}{c}}\sum_{\begin{subarray}{c}t_{1},\dots,t_{c}\in\mathcal{T}_{k}\\ t_{1}<\dots<t_{c}\end{subarray}}h_{c,k}\Big{(}\begin{pmatrix}X_{t_{1}}\\ X_{t_{1}+k}\end{pmatrix},\dots,\begin{pmatrix}X_{t_{c}}\\ X_{t_{c}+k}\end{pmatrix}\Big{)}

\displaystyle U_{n-|k|}^{(c)}(h_{c,k})=\frac{1}{\binom{n-|k|}{c}}\sum_{\begin{subarray}{c}t_{1},\dots,t_{c}\in\mathcal{T}_{k}\\ t_{1}<\dots<t_{c}\end{subarray}}h_{c,k}\Big{(}\begin{pmatrix}X_{t_{1}}\\ X_{t_{1}+k}\end{pmatrix},\dots,\begin{pmatrix}X_{t_{c}}\\ X_{t_{c}+k}\end{pmatrix}\Big{)}

ξ_{n, k} - ξ_{k} = \frac{m}{n - ∣ k ∣} t \in T_{k} \sum h_{1, k} (X_{t} X_{t + k}) + c = 2 \sum m (c m) U_{n - ∣ k ∣}^{(c)} (h_{c, k}),

ξ_{n, k} - ξ_{k} = \frac{m}{n - ∣ k ∣} t \in T_{k} \sum h_{1, k} (X_{t} X_{t + k}) + c = 2 \sum m (c m) U_{n - ∣ k ∣}^{(c)} (h_{c, k}),

\displaystyle\max\Big{\{}\int_{\mathbb{R}}\dots\int_{\mathbb{R}}|h|^{2+\delta}dG,\int_{\mathbb{R}}\dots\int_{\mathbb{R}}|h|^{2+\delta}dG_{j}^{(1)}dG_{j}^{(2)}\Big{\}}\leq M_{0}<\infty,

\displaystyle\max\Big{\{}\int_{\mathbb{R}}\dots\int_{\mathbb{R}}|h|^{2+\delta}dG,\int_{\mathbb{R}}\dots\int_{\mathbb{R}}|h|^{2+\delta}dG_{j}^{(1)}dG_{j}^{(2)}\Big{\}}\leq M_{0}<\infty,

\hat{f}_{n, ξ} (ω) \to P f_{ξ} (ω), (n \to \infty) .

\hat{f}_{n, ξ} (ω) \to P f_{ξ} (ω), (n \to \infty) .

\displaystyle\tau_{n,k}\overset{a.s.}{=}\frac{1}{\binom{n-|k|}{2}}\underset{\begin{subarray}{c}t_{1},t_{2}\in\mathcal{T}_{k}\\ t_{1}<t_{2}\end{subarray}}{\sum}4\Big{(}I(X_{t_{1}}<X_{t_{2}})-\frac{1}{2}\Big{)}\Big{(}I(X_{t_{1}+k}<X_{t_{2}+k})-\frac{1}{2}\Big{)},

\displaystyle\tau_{n,k}\overset{a.s.}{=}\frac{1}{\binom{n-|k|}{2}}\underset{\begin{subarray}{c}t_{1},t_{2}\in\mathcal{T}_{k}\\ t_{1}<t_{2}\end{subarray}}{\sum}4\Big{(}I(X_{t_{1}}<X_{t_{2}})-\frac{1}{2}\Big{)}\Big{(}I(X_{t_{1}+k}<X_{t_{2}+k})-\frac{1}{2}\Big{)},

\displaystyle\rho_{n,k}\overset{a.s.}{=}\frac{1}{\binom{n-|k|}{2}}\underset{\begin{subarray}{c}t_{1},t_{2},t_{3}\in\mathcal{T}_{k}\\ t_{1}<t_{2}<t_{3}\end{subarray}}{\sum}\sum_{\gamma\in\Gamma\{1,2,3\}}2\Big{(}I(X_{t_{\gamma(1)}}<X_{t_{\gamma(2)}})-\frac{1}{2}\Big{)}\Big{(}I(X_{t_{\gamma(1)}+k}<X_{t_{\gamma(3)}+k})-\frac{1}{2}\Big{)},

\displaystyle\rho_{n,k}\overset{a.s.}{=}\frac{1}{\binom{n-|k|}{2}}\underset{\begin{subarray}{c}t_{1},t_{2},t_{3}\in\mathcal{T}_{k}\\ t_{1}<t_{2}<t_{3}\end{subarray}}{\sum}\sum_{\gamma\in\Gamma\{1,2,3\}}2\Big{(}I(X_{t_{\gamma(1)}}<X_{t_{\gamma(2)}})-\frac{1}{2}\Big{)}\Big{(}I(X_{t_{\gamma(1)}+k}<X_{t_{\gamma(3)}+k})-\frac{1}{2}\Big{)},

\hat{f}_{n,\xi}(\omega)=\frac{1}{2\pi}\sum_{|k|\leq\lfloor r_{n}\rfloor}w\Big{(}\frac{k}{r_{n}}\Big{)}\Big{\{}\xi_{k}+\frac{m}{n-|k|}\sum_{t\in\mathcal{T}_{k}}h_{1,k}^{\xi}\begin{pmatrix}X_{t}\\ X_{t+k}\end{pmatrix}+\sum_{c=2}^{m}\binom{m}{c}U_{n-|k|}^{(c)}(h_{c,k})\Big{\}}e^{-ik\omega},

\hat{f}_{n,\xi}(\omega)=\frac{1}{2\pi}\sum_{|k|\leq\lfloor r_{n}\rfloor}w\Big{(}\frac{k}{r_{n}}\Big{)}\Big{\{}\xi_{k}+\frac{m}{n-|k|}\sum_{t\in\mathcal{T}_{k}}h_{1,k}^{\xi}\begin{pmatrix}X_{t}\\ X_{t+k}\end{pmatrix}+\sum_{c=2}^{m}\binom{m}{c}U_{n-|k|}^{(c)}(h_{c,k})\Big{\}}e^{-ik\omega},

C_{w} (d) := u \to 0 lim \frac{1 - w ( u )}{∣ u ∣ ^{d}}

C_{w} (d) := u \to 0 lim \frac{1 - w ( u )}{∣ u ∣ ^{d}}

f_{ξ}^{[d]} (ω) := \frac{1}{2 π} k \in Z \sum ∣ k ∣^{d} ξ_{k} e^{- ik ω}

f_{ξ}^{[d]} (ω) := \frac{1}{2 π} k \in Z \sum ∣ k ∣^{d} ξ_{k} e^{- ik ω}

\displaystyle\sqrt{\frac{n}{r_{n}}}\Big{(}\hat{f}_{n,\rho}(\omega)-\boldsymbol{\mathfrak{f}}_{\rho}(\omega)-b_{\rho}(\omega)\Big{)}\overset{\mathcal{D}}{\longrightarrow}\mathcal{N}(0,\sigma_{\rho}^{2}(\omega)),

\displaystyle\sqrt{\frac{n}{r_{n}}}\Big{(}\hat{f}_{n,\rho}(\omega)-\boldsymbol{\mathfrak{f}}_{\rho}(\omega)-b_{\rho}(\omega)\Big{)}\overset{\mathcal{D}}{\longrightarrow}\mathcal{N}(0,\sigma_{\rho}^{2}(\omega)),

σ_{ρ}^{2} (ω) = (1 + I (ω \in {0, π})) f_{ρ}^{2} (ω) \int_{- 1}^{1} w^{2} (x) d x .

σ_{ρ}^{2} (ω) = (1 + I (ω \in {0, π})) f_{ρ}^{2} (ω) \int_{- 1}^{1} w^{2} (x) d x .

b_{ρ} (ω) := \mbox I E [\hat{f}_{n, ρ} (ω)] - f_{ρ} (ω) = - C_{w} (d) r_{n}^{- d} f_{ρ}^{[d]} (ω) + o (r_{n}^{- d}) =: r_{n}^{- d} b_{ρ, ω} + o (r_{n}^{- d})

b_{ρ} (ω) := \mbox I E [\hat{f}_{n, ρ} (ω)] - f_{ρ} (ω) = - C_{w} (d) r_{n}^{- d} f_{ρ}^{[d]} (ω) + o (r_{n}^{- d}) =: r_{n}^{- d} b_{ρ, ω} + o (r_{n}^{- d})

\displaystyle\sqrt{\frac{n}{r_{n}}}\Big{(}\hat{f}_{n,\rho}(\omega)-\textnormal{\mbox{I\negthinspace E}}[\hat{f}_{n,\rho}(\omega)]\Big{)}\overset{\mathbb{P}}{\longrightarrow}0.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Fourier analysis of serial dependence measures

Ria Van Hecke

Ruhr-Universität Bochum

Fakultät für Mathematik

44780 Bochum

Germany

Stanislav Volgushev

University of Toronto

Department of Statistical Sciences

Toronto, Ontario M5S 3G3

Canada

Holger Dette

Ruhr-Universität Bochum

Fakultät für Mathematik

44780 Bochum

Germany

Abstract

Classical spectral analysis is based on the discrete Fourier transform of the auto-covariances. In this paper we investigate the asymptotic properties of new frequency domain methods where the auto-covariances in the spectral density are replaced by alternative dependence measures which can be estimated by U-statistics. An interesting example is given by Kendall’s $\tau$ , for which the limiting variance exhibits a surprising behavior.

Keywords and Phrases: Spectral theory, strictly stationary time series, $U$ -statistics

AMS subject classification: 62M15, 62G20

1 Introduction

Over the years spectral analysis has developed into a fundamental important tool kit in the analysis of data from a stationary time series $\{X_{t}\}_{t\in{\mathbb{Z}}}$ . The spectral density, defined as the discrete Fourier transform of the auto-covariances, provides a convenient way to characterize the second order properties of a stationary sequence. Estimation of the spectral density is usually performed by smoothing the periodogram, that is the discrete Fourier transform of empirical auto-covariances [see for example Chapters 4 and 10 of Brockwell and Davis, (1987)].

It is well known that this approach is not able to capture non-linear features of time series dynamics such as changes in skewness, kurtosis or dependence in the extremes. This motivated numerous authors to describe serial dependence by considering spectral densities corresponding to a family of transformations of the original time series [see Hong, (1999, 2000), Li, (2008, 2012), Hagemann, (2013), Dette et al., (2015), Birr et al., (2014), Davis et al., (2013), Kley et al., (2016)]. Roughly speaking, these authors suggest to define a family of spectral densities, say $\{f(\lambda,x,y)~{}|~{}x,y\}$ , where the auto-covariances (at lag $k$ ) are replaced by functionals of the lag $k$ -distributions $\mathbb{P}(X_{t}\leq x,X_{t+k}\leq y)$ . This approach is attractive as it allows a more complete description of the serial dependence. The price for this flexibility is the calculation of a family of spectral densities, in contrast to the classical approach, which uses only one spectral density calculated as the discrete Fourier transform of the auto-covariances.

In the present paper we investigate a class of alternative spectral densities, which keeps the simplicity of the classical spectral theory but eliminates some drawbacks arising from the use of auto-covariances in its definition. More precisely, we consider general spectral densities of the form

[TABLE]

where for each $k\in{\mathbb{Z}}$ the quantity $\xi_{k}$ denotes a dependence measure between the random variables $X_{t}$ and $X_{t+k}$ (in the classical case $\xi_{k}=r_{k}=$ Cov $(X_{t},X_{t+k})$ ) and we implicitly assumed that $\sum_{k\in{\mathbb{Z}}}|\xi_{k}|<\infty$ . Spectral densities of the form (1.1) have been considered by Ahdesmäki et al., (2005), Zhou, (2012) and Carcea and Serfling, (2015), who replaced the lag $k$ auto-covariance by other measures of dependence such as Kendall’s $\tau$ , distance correlation, or L-moments. A thorough theoretical analysis of this idea for dependence measures that can be represented as linear functionals of the empirical copula at lag $k$ was conducted in Kley et al., (2016). Their analysis includes dependence measures such as Spearman’s rank autocorrelation [see Wald and Wolfowitz, (1943)], Blomqvist’s beta [see Blomqvist, (1950)] and Gini’s rank association coefficient [see Schechtman and Yitzhaki, (1987)]. However, the theory depends crucially on the linearity of the corresponding functional and cannot be generalized to other dependence measures. A particularly interesting dependence measure that is not covered by the analysis of Kley et al., (2016) is Kendall’s tau which can be represented as by

[TABLE]

where $C_{k}$ denotes the copula corresponding to lag $k$ . Note that Kendall’s tau is a non-linear functional of the lag $k$ copula. A classical approach to the estimation of Kendall’s tau is based on the representation

[TABLE]

where $(X_{1},Y_{1}),(X_{2},Y_{2})$ are independent copies with the same distribution as $(X_{0},X_{k})$ . Motivated by this example we are interested in the statistical properties of estimators of spectral densities of the form (1.1) with a measure $\xi_{k}$ of lag $k$ dependence that can be represented as

[TABLE]

where $({X_{0}^{(1)}},{X_{k}^{(1)}})^{T},\ldots,({X_{0}^{(m)}},{X_{k}^{(m)}})^{T}$ are independent copies of $({X_{0}},{X_{k}})^{T}$ and $h$ is a symmetric kernel of order $m$ . The representation (1.3) motivates to estimate $\xi_{k}$ by a $U$ -statistic, say $\xi_{n,k}$ , and to form the corresponding $U$ -lag-window estimate

[TABLE]

where $\{w_{n}(k)\}_{k=-(n-1),\ldots,n-1}$ are given weights. In Section 2 we will introduce the necessary notation and illustrate the general approach by several examples. The main results of the paper can be found in Section 3, where we investigate the asymptotic properties of the new estimates. In particular we prove consistency of the estimate (1.4) for a broad class of kernels $h$ and establish its asymptotic normality for several important cases including Kendalls $\tau$ . Interestingly the asymptotic variance of the $U$ -lag-window estimate based on Kendall’s tau depends on the spectral density (1.1) where the quantities $\xi_{k}$ are the lag $k$ Spearman’s rho correlations. The proofs are very involved and will be deferred to Section 4, while more technical arguments can be found in Section 5.

2 Examples of U-lag-window spectral densities and their estimators

Throughout this paper let $\{X_{t}\}_{t\in{\mathbb{Z}}}$ be a real-valued process and denote by $F$ and $F_{k}$ the marginal distribution function of $X_{t}$ and the distribution function of the pair $(X_{t},X_{t+k})$ , respectively ( $k\in{\mathbb{Z}}$ ). Recall the definition of the spectral density $\boldsymbol{\mathfrak{f}}_{\xi}$ in (1.1), where the measure of dependence (at lag $k$ ) has the representation (1.3) for a given kernel $h$ of order $m$ . Throughout this paper, we will maintain the following assumption

(C0)

The process $\{X_{t}\}_{t\in{\mathbb{Z}}}$ is strictly stationary. The functions $F$ and $F_{k}$ are continuous (for all $k\in{\mathbb{Z}}$ ) and $\sum_{k\in{\mathbb{Z}}}|\xi_{k}|<\infty$ .

Let $X_{1},\dots,X_{n}$ be the finite stretch of this process representing the observed data and define for any $k\in\{-(n-1),\dots\,n-1\}$ the set $\mathcal{T}_{k}:=\big{\{}t|t,t+k\in\{1,\dots,n\}\big{\}}$ . In the following example we illustrate how different kernels yield different measures of dependence and as consequence different spectral densities.

Example 2.1.

(i)

If $m=2$ and $h\big{(}({x_{1}},{y_{2}})^{T},({x_{2}},{y_{2}})^{T}\big{)}=\frac{1}{2}(x_{1}-x_{2})(y_{1}-y_{2})$ , then the representation (1.3) gives the auto-covariance at lag $k$ , that is

[TABLE]

and we obtain the classical spectral density.

(ii)

If $m=2$ , $I(\cdot)$ denotes the indicator function and the kernel is defined by

[TABLE]

the representation (1.3) yields Kendall’s $\tau$ at lag $k$ , that is

[TABLE]

The corresponding spectral density will be denoted by

[TABLE]

As the distribution function $F$ and $F_{k}$ are assumed to be continuous, $\tau_{k}$ can also be represented in the form (1.3) using the kernel

[TABLE]

(iii)

If $m=3$ , $\Gamma\{i,j,k\}$ denotes the set of all permutations of $\{i,j,k\}$ and the kernel $h$ is defined by

[TABLE]

we obtain the (lag $k$ ) population version of Spearman’s $\rho$ , that is

[TABLE]

The corresponding spectral density will be denoted by

[TABLE]

Given continuity of $F$ and $F_{k}$ , $\rho_{k}$ can also be represented in the form (1.3) using the following kernel

[TABLE]

In the remaining part of the manuscript we estimate the dependence measures $\xi_{k}$ (at lag $k$ ) by a $U$ -statistic of order $m$ , that is

[TABLE]

Estimates of corresponding spectral densities are defined as in (1.4). The asymptotic properties of such spectral density estimates are investigated in the following section.

Before proceeding, we recall the Hoeffding decomposition for U-statistics. Recall that $h$ is a symmetric kernel of order $m$ and let $\boldsymbol{Y}^{(1)},\dots,\,\boldsymbol{Y}^{(m)}$ denote independent identically distributed copies of $\begin{pmatrix}X_{0}\\ X_{k}\end{pmatrix}\sim F_{k}$ . We now recursively define kernels $h_{c,k}$ by

[TABLE]

where $G_{\boldsymbol{y_{j}}}$ denotes the distribution of the Dirac measure at $\boldsymbol{y_{j}}$ . If

[TABLE]

is the U-statistic based on the kernel $h_{c,k}$ we obtain for the statistic in (2.7) the decomposition [see, for example Lee, (1990)]

[TABLE]

which will be an important tool in the asymptotic analysis of the following sections.

3 Asymptotic theory for U-lag-window estimates

3.1 Consistency of U-lag-window estimates

Our first main result shows that for a general class of symmetric kernels the statistic $\hat{f}_{n,\xi}$ consistently estimates the spectral density $\boldsymbol{\mathfrak{f}}_{\xi}$ defined in (1.1) if the following assumptions are satisfied.

(C1)

The lag window $w_{n}(\cdot)$ can be written in the form $w_{n}(k)=w\big{(}\frac{k}{r_{n}}\big{)}$ , where $w(\cdot)$ is a uniformly continuous function, supported on the interval $[-1,1]$ , satisfying $\|w\|_{\infty}\leq 1$ , $w(0)=1$ , $w(-x)=w(x)$ for all $x\in\mathbb{R}$ , and $r_{n}=n^{\frac{1}{2}-\nu}$ for some $\nu\in(0,\frac{1}{2})$ .

(C2)

There exist constants $\delta,M_{0}>0$ such that for all $t_{1},\dots,t_{m},k\in{\mathbb{Z}}$ , $1\leq j\leq 2m$ ,

[TABLE]

where $G$ , $G_{j}^{(1)}$ and $G_{j}^{(2)}$ denote the joint distributions of $(X_{t_{(1)}},\dots,X_{t_{(2m)}})$ , $(X_{t_{(1)}},\dots,X_{t_{(j)}})$ and $(X_{t_{(j+1)}},\dots,X_{t_{(2m)}})$ , respectively, and $t_{(1)}\leq\dots\leq t_{(2m)}$ is the order statistic of $\{t_{1},t_{1}+k,t_{2},t_{2}+k\,\dots,t_{m},t_{m}+k\}$ .

(C3)

The process $\{X_{t}\}_{t\in{\mathbb{Z}}}$ is $\beta$ -mixing and for some $\delta^{\prime}<\delta$ with $\beta$ -mixing coefficients satisfying $\beta(n)=O(n^{-\gamma}),$ where $\gamma=\frac{2+\delta^{\prime}}{\delta^{\prime}}$ .

Theorem 3.1.

If Assumptions (C0) – (C3) are satisfied, we have for any fixed $\omega\in\mathbb{R}$

[TABLE]

3.2 Asymptotic distribution of U-lag-window estimates

In this section we establish asymptotic normality of the spectral density estimators. Throughout this section we focus our attention on settings where $\xi$ is Kendall’s $\tau$ or Spearman’s $\rho$ . Recalling the discussion in Example 2.1 it follows that $\tau_{k}$ und $\rho_{k}$ can be estimated by the $U$ -statistics

[TABLE]

and

[TABLE]

respectively. Note that these $U$ -statistics have bounded kernels, satisfy Assumption (C2) and can be written as a product or sum of products of two centered functions of random variables. This special structure is crucial for obtaining the asymptotic distribution results given below. It is not clear if similar results hold without imposing this kind of structure on the kernel $h$ .

Throughout this section we write $\xi_{k}$ if assumptions or results are the same for both Kendall’s $\tau$ and Spearman’s $\rho$ . On the other hand we explicitly write $\tau$ or $\rho$ if the results or arguments are different. For example, from (2.9) we obtain for Kendall’s $\tau$ and Spearman’s $\rho$ the decomposition

[TABLE]

and therefore only $\xi_{k}$ appears in the formula. We will demonstrate that under suitable assumptions the term corresponding to the linear part converges to a normal distribution and that the term corresponding to the degenerate part is asymptotically negligible. In what follows we assume that (C0) – (C3) hold and impose the following additional conditions.

(N1)

The process $\{X_{t}\}_{t\in{\mathbb{Z}}}$ is $\alpha$ -mixing with corresponding $\alpha$ -mixing coefficients satisfying $\alpha(n)=O(n^{-\nu}),$ where $\nu>7$ .

(N2)

For the lag window generator $w$ there exists a ’characteristic exponent’ $d>0$ being the largest integer such that

[TABLE]

exists, is finite and non-zero. For this $d$ we have $\sum_{k\in{\mathbb{Z}}}|k|^{d}|\xi_{k}|<\infty$ .

(N3)

$r_{n}=o(n^{\theta})$ where $\theta=\min\Big{\{}\frac{2(\delta-\delta^{\prime})}{\delta^{\prime}(2+\delta)},1\Big{\}}$ and $\delta,\delta^{\prime}$ are from conditions (C2),(C3).

Remark 3.2.

The summability condition $\sum_{k\in{\mathbb{Z}}}|k|^{d}|\xi_{k}|<\infty$ in assumption (N2) implies the existence of the ’generalized $d^{th}$ derivative’ of $\boldsymbol{\mathfrak{f}}_{\xi}(\omega)$

[TABLE]

and can thus be interpreted as a smoothness condition; note that for even $d$ this coincides with the usual $d$ ’th order derivative. The other part of assumption (N2) places mild restrictions on the lag-window generator for which the rate of the scale parameter is limited by assumption (N3). Note that (N3) is satisfied for scale parameters leading to optimal asymptotic mean squared error rates (see Remark 3.5). * *

We begin by examining the asymptotic distribution of $\hat{f}_{n,\rho}(\omega)$ .

Theorem 3.3.

Assume that conditions (C0) – (C3) and (N1) – (N3) are satisfied and that $\omega\in(-\pi,\pi]$ . If $\boldsymbol{\mathfrak{f}}_{\rho}(\omega)\neq 0$ , then

[TABLE]

where

[TABLE]

and

[TABLE]

If $\boldsymbol{\mathfrak{f}}_{\rho}(\omega)=0$ , we have

[TABLE]

Interestingly, the limiting distribution has exactly the same form as the limiting distribution for the usual spectral density where $\xi$ corresponds to covariance. This is remarkable, since Spearman’s $\rho$ is based on covariances of ranks. Asymptotic normality of $\hat{f}_{n,\rho}(\omega)$ was also obtained in Kley et al., (2016) under a different set of assumptions on the serial dependence and using a completely different set of proof techniques. Specifically, their results require dependence to decay exponentially. The next result establishes asymptotic normality of $\hat{f}_{n,\xi}(\omega)$ with $\xi$ corresponding to Kendall’s $\tau$ . The asymptotic distribution of $\hat{f}_{n,\tau}(\omega)$ cannot be obtained from the findings in Kley et al., (2016) (under any assumptions) since Kendall’s $\tau$ is a non-linear functional of the copula.

Theorem 3.4.

Assume that conditions (C0) – (C3) and (N1) – (N3) are satisfied and that $\omega\in(-\pi,\pi]$ . If $\boldsymbol{\mathfrak{f}}_{\rho}(\omega)\neq 0$ , then

[TABLE]

where

[TABLE]

and

[TABLE]

If $\boldsymbol{\mathfrak{f}}_{\rho}(\omega)=0$ , we have

[TABLE]

It is remarkable that the asymptotic variance of estimator $\hat{f}_{n,\tau}(\omega)$ depends on the spectral measure $\boldsymbol{\mathfrak{f}}_{\rho}(\omega)$ obtained from Spearman’s $\rho$ (provided that $\boldsymbol{\mathfrak{f}}_{\rho}(\omega)\neq 0$ ). This is in sharp contrast to the finding in Theorem 3.3 and spectral density estimation based on covariances. The results in Theorem 3.4 provide an asymptotic analysis of the estimator introduced in Ahdesmäki et al., (2005). We conclude this section by commenting on the optimal choice of window length $r_{n}$ .

Remark 3.5.

Both theorems allow to determine the scale parameter $r_{n}$ such that the asymptotic mean squared error is minimized. To be precise, define $\sigma_{\xi,\omega}^{2}:=\sigma_{\xi}^{2}(\omega)$ . Then the asymptotic mean squared error takes the form

[TABLE]

Assuming that $b_{\xi,\omega}\neq 0$ , we obtain that this expression is minimized for

[TABLE]

Note that for $d=2$ the asymptotic MSE is of the order $n^{-4/5}$ . In that case the above scale parameter $r_{n}$ is of order $n^{1/5}$ and satisfies Assumptions (C1) and (N3) if the mixing coefficients $\beta(n)$ decay sufficiently quickly. More precisely, as for Kendall’s $\tau$ and Spearman’s $\rho$ the kernels of the $U$ -statistic are bounded, we can choose $\delta$ in Assumption (C2) arbitrarily large. Assumption (N3) is satisfied if $1/5<\theta=\min\Big{\{}\frac{2(\delta-\delta^{\prime})}{\delta^{\prime}(2+\delta)},1\Big{\}}$ , which is equivalent to $\delta^{\prime}<10\delta/(12+\delta)$ . Since $\delta$ is arbitrarily large, we can choose any $\delta^{\prime}<10$ and (N3), (C3) will hold if $\beta(n)=O(n^{-\gamma})$ for some $\gamma>6/5$ . * *

4 Proofs

4.1 Proof of Theorem 3.1

We first illustrate the main steps in the proofs. These rely on several delicate bounds, which will be shown below. Rearranging sums in (1.4) and using assumption (C1), the U-lag-window estimate can be decomposed as follows

[TABLE]

where

[TABLE]

We will show in Section 4.1.1 that

[TABLE]

For a proof of $s_{n,3}\overset{\mathbb{P}}{\to}0$ we use the Hoeffding decomposition (2.9), which gives

[TABLE]

where

[TABLE]

The assertion of the theorem now follows from the estimates

[TABLE]

which are shown in Section 4.1.2 and 4.1.3, respectively.

4.1.1 Proof of (4.1)

Using the fact that $w(0)=1$ and $\sup_{|k|<n}\big{(}\big{|}w\big{(}\frac{k}{r_{n}}\big{)}\big{|}+1\big{)}\leq 2\quad\forall\,n\in\mathbb{N}$ , we obtain for any fixed $0\leq K<n$

[TABLE]

As the lag window generator $w(\cdot)$ is continuous at [math] we obtain for any fixed $K$

[TABLE]

As inequality (4.5) holds for all $K$ , we can conclude that $s_{n,1}\overset{a.s.}{\to}0$ , as the $\xi_{k}$ ’s are absolutely summable. By the same argument, we also have $|s_{n,2}|\leq\frac{1}{2\pi}\sum_{|k|\geq n}|\xi_{k}|\to 0$ for $n\rightarrow\infty$ .

4.1.2 Proof of (4.3)

The proof is based on an extension to lagged data of a covariance inequality by Yoshihara, (1976). More precisely, we prove in the technical Appendix, Section 5.3.3, that for fixed $2\leq c\leq m$

[TABLE]

where $\theta=\min\Big{\{}\frac{2(\delta-\delta^{\prime})}{\delta^{\prime}(2+\delta)},1\Big{\}}$ . Note that the above bound holds uniformly over a growing number of lags $k$ while the result in Yoshihara, (1976) only holds for a fixed $k$ . Observe that,

[TABLE]

where the constant $C_{m}$ does not depend on $m$ . Consequently, equation (4.6) yields $\textnormal{\mbox{I\negthinspace E}}|d_{n,2}|=O(r_{n}n^{-1/2-\theta/2}),$ which establishes (4.3).

4.1.3 Proof of (4.4)

Introduce the notation $\boldsymbol{X}_{t,k}:=(X_{t},X_{t+k})^{T}$ . We only consider positive lags $k$ , negative $k$ can be treated analogously. Similar arguments as in the proof of (4.3) yield

[TABLE]

Next,

[TABLE]

As $(\textnormal{\mbox{I\negthinspace E}}|Z|^{p})^{\frac{1}{p}}\leq(\textnormal{\mbox{I\negthinspace E}}|Z|^{q})^{\frac{1}{q}}$ for $p<q$ , we have

[TABLE]

which gives

[TABLE]

The following bound will be established in Lemma 5.5 in the Appendix (see Section 5.2.3)

[TABLE]

Thus

[TABLE]

By assumption (C3) $\sum_{j=1}^{\infty}\beta^{\frac{\delta}{2+\delta}}(j)<\infty$ . Therefore, we have

[TABLE]

Equations (4.7) and (4.8) yield $\textnormal{\mbox{I\negthinspace E}}|d_{n,1}|=O(r_{n}n^{-\frac{1}{2}})$ and the assertion follows observing that $r_{n}=n^{\frac{1}{2}-\nu}$ . $\Box$

4.2 Proof of Theorem 3.4 and 3.3 - main arguments

In the following proof we write $\xi$ if the results hold for general dependence measures that fulfill the assumptions (C0) – (C3) and (N1) – (N3). Otherwise we explicitly write $\tau$ or $\rho$ .

Under Assumption (N3) and with (4.3),

[TABLE]

Furthermore, in Section 5.3.4 we will prove that

[TABLE]

where $\tilde{f}_{n,\xi}(\omega)=\frac{1}{2\pi}\sum_{|k|\leq r_{n}}w\Big{(}\frac{k}{r_{n}}\Big{)}\Big{\{}\xi_{k}+\frac{m}{n}\sum_{t=1}^{n}h_{1,k}^{\xi}(\boldsymbol{X}_{t,k})\Big{\}}e^{-ik\omega}$ . Then,

[TABLE]

where, by construction, the random variables $(W_{n,t}^{\xi})_{t=1,\dots,n}$ form a triangular array of $\beta$ -mixing random variables with mixing coefficients $\beta^{W}(u)\leq\beta^{X}(0\vee(u-2r_{n}))$ . To prove the asymptotic normality, we will apply the blocking technique described in Section 5.1. That is, we choose $\mu_{n}$ blocks of length $p_{n}$ and $\mu_{n}$ blocks of length $q_{n}$ such that

[TABLE]

According to Assumptions (C1), (C3) and (N3) one possible choice is $r_{n}=O(n^{1/2-\nu}),\,0<\nu<\min\{\theta,\frac{1}{2}\}$ , $q_{n}=O(n^{1/2})$ , $p_{n}=O(n^{1/2+\nu})$ . Then we decompose

[TABLE]

Next, we show that the remaining part and the part corresponding to the “small” blocks are negligible whereas the “big” blocks satisfy the Lyapunov condition and yield the asymptotic variance. Observe that

[TABLE]

for all $k,t\in{\mathbb{Z}}$ , and hence, $\sum_{t=1}^{n}W_{n,t}^{\xi}(\omega)$ is real and symmetric in $\omega$ . To prove (4.11) observe that by stationarity of $\{X_{t}\}_{t\in{\mathbb{Z}}}$ we have that $(\boldsymbol{X}_{0,k})\overset{\mathcal{D}}{=}(\boldsymbol{X}_{0,-k})$ and $(\boldsymbol{X}_{0,k}^{(1)})\overset{\mathcal{D}}{=}(\boldsymbol{X}_{0,-k}^{(1)})$ and hence, $\tau_{k}=\tau_{-k}$ , $\rho_{k}=\rho_{-k}$ . Consequently, we obtain in the case of Kendall’s $\tau$ and Spearmans’s $\rho$ $h_{1,k}(({x},{y})^{T})=h_{1,-k}(({y},{x})^{T})$ , which yields

[TABLE]

Moreover, we will show in section 4.3 that for any $a_{n}\rightarrow\infty$ with $a_{n}/n=o(1)$ , $r_{n}/a_{n}=o(1)$ and $\omega\in(-\pi,\pi]$ ,

[TABLE]

Next, the last summand in (4.10) contains at most $O(p_{n}+q_{n})$ summands. Hence, by (4.12) and (4.13) we have,

[TABLE]

that is $\sum_{t\in\mathcal{R}}W_{n,t}^{\xi}(\omega)=o_{\mathbb{P}}\Big{(}\sqrt{\frac{r_{n}}{n}}\Big{)}$ . Next, we show that the sum over the small blocks is negligible. By Lemma 5.1 with the function $\tilde{g}(\cdot)=I(\cdot\geq\varepsilon)$ we obtain

[TABLE]

where $\zeta^{\xi}_{n,t}(\omega)$ denote the random variables of the independent block sequence corresponding to the $\Delta$ -blocks. By the assumptions on $p_{n}$ and $\beta^{X}$ the term on the right hand side in the above expression converges to 0. Observing that the variables $\zeta^{\xi}_{n,t}(\omega)$ are centered and (4.12) or respectively (4.13) applied to the independent blocks $\sum_{t\in\Delta_{j}}\zeta^{\xi}_{n,t}(\omega)$ yields

[TABLE]

where we have used the definition of $\zeta^{\xi}_{n,t}$ and the assumption that $q_{n}/p_{n}=o(1)$ . Hence, it remains to prove that $\sqrt{\frac{n}{r_{n}}}\sum_{j=1}^{\mu_{n}}\sum_{t\in\Gamma_{j}}W_{n,t}^{\xi}(\omega)$ converges weakly. Note that for any measurable set $A$ , by Lemma 5.1 with function $g(\cdot)=I(\cdot\in A)$ and the assumptions on $q_{n}$ and $\beta^{X}$ , we have

[TABLE]

In order to prove the convergence in distribution of $\sqrt{\frac{n}{r_{n}}}\sum_{j=1}^{\mu_{n}}\sum_{t\in\Gamma_{j}}W_{n,t}^{\xi}(\omega)$ , it suffices to show that the triangular array of independent random variables

[TABLE]

satisfies the Lyapunov condition. To achieve this, we show that together with (4.12) or (4.13), respectively,

[TABLE]

and if $\boldsymbol{\mathfrak{f}}_{\rho}(\omega)\neq 0$ ,

[TABLE]

for some constant $c>0$ and $n$ sufficiently large. Hence, the Lyapunov condition is satisfied as $\mu_{n}\rightarrow\infty$ and we can conclude that the distribution of $\sqrt{\frac{n}{r_{n}}}\sum_{j=1}^{\mu_{n}}\sum_{t\in\Gamma_{j}}\zeta^{\xi}_{n,t}(\omega)$ , where $\xi$ is either Kendall’s $\tau$ or Spearman’s $\rho$ , converges weakly to a normal distribution, i.e.

[TABLE]

If $\boldsymbol{\mathfrak{f}}_{\rho}(\omega)=0$ , it follows from equations (4.12) and (4.13) that

[TABLE]

and hence, by (4.9),

[TABLE]

Finally, we have, for $\xi$ representing either $\tau$ or $\rho$ ,

[TABLE]

Hence, the bias is given by

[TABLE]

Next, choose some $L_{n}\rightarrow\infty$ such that $L_{n}/r_{n}\rightarrow 0$ . Then,

[TABLE]

By Assumption (N2), $\sum_{k\in{\mathbb{Z}}}|k|^{q}|\xi_{k}|<\infty$ for $q\leq d$ and therefore,

[TABLE]

For the second term in (4.2) as by Assumption (N2), $\sum_{k\in{\mathbb{Z}}}|k|^{q}|\xi_{k}|$ is finite for $q\leq d$ we obtain

[TABLE]

where $\sup_{v\in[0,1]}\frac{w(v)-1}{|v|^{d}}$ is bounded since $w$ is bounded and the limit for $|v|\rightarrow 0$ exists by Assumption (N2). Finally,

[TABLE]

where the first summand is of order $o(r_{n}^{-d})$ since $\frac{L_{n}}{r_{n}}\rightarrow 0$ and $|k|^{d}\xi_{k}$ is absolutely summable. The second summand is of order $o(r_{n}^{-d})$ as $\sum_{k\in{\mathbb{Z}}}|k|^{q}|\xi_{k}|$ is finite. Hence,

[TABLE]

Conclude applying Slutsky’s theorem. It remains to prove (4.12), (4.13), (4.14) and (4.15). Detailed proofs of these results are given in the remaining part of this section. $\Box$

4.3 Proof of (4.12) – (4.15)

The proofs of Theorem (4.12) – (4.15) rely on two auxiliary results. These will be stated in this section whereas their detailed proof is deferred to the Appendix. The first Lemma bounds cumulants through $\alpha$ -mixing coefficients, see Section 5.3.1 for a proof.

Lemma 4.1.

For $q\in\mathbb{N}$ , let $(X_{t}^{(1)})_{t\in{\mathbb{Z}}},\dots,(X_{t}^{(q)})_{t\in{\mathbb{Z}}}$ be independent copies of a strictly stationary polynomially $\alpha$ -mixing process $(X_{t})_{t\in{\mathbb{Z}}}$ that are independent. For any $t\in{\mathbb{Z}}$ , let $V_{t}:=(X_{t},X_{t}^{(1)},\dots,X_{t}^{(q)})$ . Then, for $p\in\mathbb{N}$ , $t_{1},\dots,t_{p}$ and measurable sets $A_{1},\dots,A_{p}\subset\mathbb{R}^{q+1}$ , there exists a constant $C_{p,q}$ such that

[TABLE]

The Lemma that follows is a key observation which makes it possible to use theory from classical spectral density estimation in the case where the kernel $h$ can be written as a sum of a product of centered functions of random variables. This is a crucial insight for proving asymptotic normality of the estimators $\hat{f}_{n,\xi}$ .

Lemma 4.2.

Let $h$ denote a $U$ -statistic of order $m$ and assume that $(X_{t}^{(1,j)})_{t\in{\mathbb{Z}}},\dots,(X_{t}^{(m-1,j)})_{t\in{\mathbb{Z}}}$ , $j=1,\dots,q$ are independent copies of a strictly stationary process $(X_{t})_{t\in{\mathbb{Z}}}$ that are mutually independent. Then, for $t_{j},k_{j}\in{\mathbb{Z}}$ , $j=1,\dots,q$ ,

[TABLE]

where

[TABLE]

In particular, if $(X_{t}^{(j)})_{t\in{\mathbb{Z}}}$ , $j=1,\dots,5$ are independent copies of the strictly stationary process $\{X_{t}\}_{t\in{\mathbb{Z}}}$ that are independent of each other,

(i)

for Kendall’s $\tau$

[TABLE]

where $(Y_{t}^{(j)})_{t\in{\mathbb{Z}}}:=\Big{(}I(X_{t}<X_{t}^{(j)})-\frac{1}{2}\Big{)}_{t\in{\mathbb{Z}}}$ , $j=1,2$ .

(ii)

for Spearman’s $\rho$

[TABLE]

where $\Gamma\{i,j,k\}$ denotes the set of all permutations of $\{i,j,k\}$ .

Lemma 4.2 is proved in Section 5.3.2.

4.3.1 Proof of (4.12)

Let $(X_{t}^{(1)})_{t\in{\mathbb{Z}}}$ and $(X_{t}^{(2)})_{t\in{\mathbb{Z}}}$ be independent copies of the strictly stationary $\beta$ -mixing process $\{X_{t}\}_{t\in{\mathbb{Z}}}$ that are independent of each other. Define processes $(Y_{t}^{(1)})_{t\in{\mathbb{Z}}}$ and $(Y_{t}^{(2)})_{t\in{\mathbb{Z}}}$ by

[TABLE]

Note that the processes $(Y_{t}^{(1)})_{t\in{\mathbb{Z}}}$ and $(Y_{t}^{(2)})_{t\in{\mathbb{Z}}}$ are strictly stationary.

By Lemma 4.2 (i) we have for Kendall’s $\tau$ ,

[TABLE]

Next, let $\mathcal{T}_{k,a_{n}}:=\{t|t,t+k\in\{1,\dots,a_{n}\}\}$ and define for $j=1,2$ ,

[TABLE]

Then, similar arguments as in the proof of Lemma 4.2 yield

[TABLE]

and consequently,

[TABLE]

where

[TABLE]

We will now derive upper bounds for $|D_{1,n}|$ and $|D_{2,n}|$ separately. First, as $|w(\cdot)|\leq 1$ , $|e^{i\cdot}|=1$ and $\mathcal{T}_{k,a_{n}}\subset\{1,\dots,a_{n}\}$ , we have

[TABLE]

where the latter inequality follows by the strict joint stationarity of the involved processes. Next, observe that, from Theorem 2.3.1 in Brillinger, (1975), it follows that

[TABLE]

where $V_{t_{j}}:=(X_{t_{j}},X_{t_{j}}^{(1)},X_{t_{j}}^{(2)})$ , $A_{1}=\{x\in\mathbb{R}^{3}:x_{1}<x_{2}\}=A_{2}$ and $A_{3}=\{x\in\mathbb{R}^{3}:x_{1}<x_{3}\}=A_{4}$ . Furthermore, let $u_{0}:=0$ and consider the set

[TABLE]

whose cardinality is $\leq c_{p}(m+1)^{p-1}$ . Hence, applying Lemma 4.1 with $q=2$ yields

[TABLE]

where we used Assumption (N1) for the last estimate and $\alpha:=\alpha^{X}$ are the mixing coefficients of the process $\{X_{t}\}_{t\in{\mathbb{Z}}}$ . Therefore,

[TABLE]

Next,

[TABLE]

and observing that $(\sum_{t_{1}=1}^{a_{n}}-\sum_{t_{1}\in\mathcal{T}_{k_{1},a_{n}}})$ contains $O(r_{n})$ summands, we obtain by assumption (N2),

[TABLE]

Analogously, $D_{2,n}^{(2)}=O\Big{(}\frac{r_{n}^{2}}{n^{2}}\Big{)}$ and hence,

[TABLE]

Then, equations (4.20) and (4.21) together yield

[TABLE]

Next, observe that $h_{j}^{\tau}(\omega)$ , $j=1,2$ , is eight times the classical centered lag-window estimator of the spectral density of the stationary process $(Y_{t}^{(j)})_{t\in{\mathbb{Z}}}$ based on the observations $Y_{1}^{(j)},\dots,Y_{a_{n}}^{(j)}$ . Consequently, if we show that

[TABLE]

the same arguments as given in the proof of Theorem 9.3.4 in Anderson, (1971) yield

[TABLE]

Finally, (4.23) can be proved using similar arguments and equation (4.24) together with equation (4.22) conclude the proof of (4.12).

4.3.2 Proof of (4.13)

Let $(X_{t}^{(j)})_{t\in{\mathbb{Z}}}$ , $j=1,\dots,5$ be independent copies of the strictly stationary process $\{X_{t}\}_{t\in{\mathbb{Z}}}$ that are independent of each other. Then, by Lemma 4.2 (ii) for Spearman’s $\rho$ ,

[TABLE]

where $\Gamma\{i,j,k\}$ denotes the set of all permutations of $\{i,j,k\}$ .

Next, let $\mathcal{T}_{k,a_{n}}:=\{t|t,t+k\in\{1,\dots,a_{n}\}\}$ and define

[TABLE]

and

[TABLE]

Then, similarly as in the proof of Lemma 4.2,

[TABLE]

and analogous arguments as for Kendall’s $\tau$ give

[TABLE]

Next, similar arguments as were used in order to derive (4.20) yield

[TABLE]

and the same arguments as in the proof of Theorem 9.3.4 in Anderson, (1971) show

[TABLE]

Hence, equation (4.28) together with equation (4.27) conclude the proof of (4.13).

4.3.3 Proof of (4.14) and (4.15)

By Lemma 4.2, we know that for Kendall’s tau

[TABLE]

where $(Y_{t}^{(j)})_{t\in{\mathbb{Z}}}=\Big{(}I(X_{t}<X_{t}^{(j)})-\frac{1}{2}\Big{)}_{t\in{\mathbb{Z}}}$ . Therefore, we can write

[TABLE]

where $\vartheta^{\tau}_{t_{l}}=\frac{1}{2\pi}\sum_{|k_{l}|\leq r_{n}}w\Big{(}\frac{k_{l}}{r_{n}}\Big{)}e^{-ik_{l}\omega}\frac{2}{n}[4Y_{t_{l}}^{(l)}Y_{t_{l}+k_{l}}^{(l)}-\tau_{k_{l}}]$ , $l=1,\dots,4$ . Observing that by construction $\textnormal{\mbox{I\negthinspace E}}[\sum_{t_{l}\in\Gamma_{j}}\vartheta^{\tau}_{t_{l}}(\omega)]=0$ , we express the fourth moment in terms of a fourth order cumulant and 3 products of second order cumulants, that is

[TABLE]

Note that by construction for all $k,l\in\{1,\dots,4\}$ , $\sum_{t_{k}\in\Gamma_{j}}\vartheta^{\tau}_{n,t_{k}}\overset{\mathcal{D}}{=}\sum_{t_{l}\in\Gamma_{j}}\vartheta^{\tau}_{n,t_{l}}$ . Therefore, each of the second order cumulants is equal to

[TABLE]

where we have used a similar argument as in equation (4.3.3). Hence, we obtain by Theorem 2.3.1 in Brillinger, (1975)

[TABLE]

Following the arguments of Rosenblatt, (1984) on pages 1177-1178, we can express the fourth order cumulant of products in terms of cumulants of the factors, i.e. we obtain

[TABLE]

where the latter sum extends over all indecomposable partitions $\nu=\nu_{1}\cup\dots\cup\nu_{r}$ of the table

[TABLE]

In order to bound this sum, we need that for $2\leq p\leq 8$ and $t\in{\mathbb{Z}}$

[TABLE]

with $V_{t}:=(X_{t},X_{t}^{(1)},\dots,X_{t}^{(4)})$ and measurable sets $A_{1},\dots,A_{p}\subset\mathbb{R}^{5}$ . This follows by Lemma 4.1 and Assumption (N1):

[TABLE]

where $u_{0}:=0$ and $\mathcal{S}_{m}:=\{(u_{1},\dots,u_{p})\in{\mathbb{Z}}^{p}|\max_{i,j=0,\dots,p}|u_{i}-u_{j}|=m\}$ have been introduced in the proof of (4.12).

Next, arguments as in Rosenblatt, (1984) on page 1177–1178 yield

[TABLE]

and together with (4.12) we obtain

[TABLE]

Furthermore, as $\boldsymbol{\mathfrak{f}}_{\xi}(\omega)\neq 0$ , by (4.12)

[TABLE]

for some constant $c>0$ and $n$ sufficiently large. This yields (4.14) and (4.15) in the case where $\xi$ is Kendall’s $\tau$ .

In the case where $\xi$ is Spearman’s $\rho$ we have by Lemma 4.2

[TABLE]

where

[TABLE]

Observing that by construction $\textnormal{\mbox{I\negthinspace E}}[\sum_{t_{l}\in\Gamma_{j}}\vartheta^{\rho}_{t_{l}}(\omega)]=0$ , we have similarly as for Kendall’s $\tau$

[TABLE]

where,

[TABLE]

Following the arguments of Rosenblatt, (1984) on pages 1177-1178, we express the fourth order cumulants of products of random variables in terms of cumulants of the factors which, similarly as in (4.30), can be bounded by Lemma 4.1 for $V_{t}:=(X_{t}^{(1)},\dots,X_{t}^{(9)})$ and $A_{1},\dots,A_{p}\in\mathbb{R}^{9}$ . After that, arguments as in Rosenblatt, (1984) yield

[TABLE]

and together with (4.13) we obtain $\sum_{j=1}^{\mu_{n}}\textnormal{\mbox{I\negthinspace E}}\big{[}\big{(}\sum_{t\in\Gamma_{j}}\zeta^{\rho}_{n,t}(\omega)\big{)}^{4}\big{]}=O\big{(}\frac{\mu_{n}p_{n}^{2}r_{n}^{2}}{n^{4}}\big{)}.$ Furthermore, as $\boldsymbol{\mathfrak{f}}_{\xi}(\omega)\neq 0$ , by (4.13),

[TABLE]

for some constant $c>0$ and $n$ sufficiently large. This concludes the proof. $\Box$

Acknowledgements. The authors would like to thank Martina Stein, who typed parts of this manuscript with considerable technical expertise. This work has been supported in part by the Collaborative Research Center “Statistical modeling of nonlinear dynamic processes” (SFB 823, Teilprojekt A1, C1) of the German Research Foundation (DFG).

5 Appendix: technical details

The proofs of Theorems 3.3 and 3.4 rely on a blocking technique which will be summarized in Section 5.1. In Section 5.2 we state and prove the covariance inequalities that are crucial in order to derive the convergence of the linear and degenerate parts of the U-lag-window estimate. Finally, in Section 5.3 we provide the details for the proofs of results and equations given in Section 4.

For simplicity of notation, let $\boldsymbol{X}_{t,k}:=(X_{t},X_{t+k})^{T}$ .

5.1 Blocking results for stationary $\beta$ -mixing processes

In order to transfer classical results from the iid case to sums of $\beta$ -mixing stationary time series, we apply a blocking technique with alternate ”large” blocks of size $p_{n}$ and ”small” blocks of size $q_{n}$ from Arcones and Yu, (1994) based on a blocking technique introduced by Yu, (1994) with blocks of equal size $p_{n}$ . For each fixed $n$ , we divide the original sequence $X:=(X_{1},\dots,X_{n})$ into $\mu_{n}$ blocks of size $p_{n}$ alternating with $\mu_{n}$ blocks of size $q_{n}$ and a remainder block $\mathcal{R}$ of length $n-2\mu_{n}$ . The block size $q_{n}$ of the ”small” $\Delta$ blocks is chosen depending on the mixing conditions on $X$ and the size of $r_{n}$ . That is, $q_{n}$ is chosen large enough such that the $\Gamma$ blocks are ”almost” independent of each other, but small enough such that the sequence composed of these $\Gamma$ blocks behaves similarly to the original mixing sequence. The block size $p_{n}$ is chosen analogously. More precisely, we assume that

[TABLE]

and define $\text{for }j=1,\dots,\mu_{n}$

[TABLE]

We denote the random variables of $X$ belonging to block $\Gamma_{j}$ , $\Delta_{j},\,j=1,\dots,\mu_{n}$ or $\mathcal{R}$ by

[TABLE]

respectively. This yields a sequence of alternating $\Gamma$ and $\Delta$ blocks

[TABLE]

We then construct a one-dependent sequence $Y$ of independent blocks defined as

[TABLE]

and independent of the original sequence $X$ . Furthermore, the blocks $Y(\Gamma_{j}):=\{Y_{i}:i\in\Gamma_{j}\}$ , $Y(\Delta_{j}):=\{Y_{i}:i\in\Delta_{j}\},\,j=1,\dots,\mu_{n}$ and $Y(\mathcal{R}):=\{Y_{i}:i\in\mathcal{R}\}$ are identically distributed as the corresponding blocks in the sequence $X$ , i.e.

[TABLE]

The existence of a proper measurable space that hosts both sequences, $X$ and the independent block sequence $Y$ , as well as measurability issues on this space are adressed in Yu, (1994). Denote by $X_{\Gamma}$ and $Y_{\Gamma}$ block sequences corresponding to the $\Gamma$ blocks and by $X_{\Delta}$ and $Y_{\Delta}$ the block sequences corresponding to the $\Delta$ blocks, e.g.

[TABLE]

Note that we choose the block size $q_{n}$ such that the dependence between the blocks $X_{\Gamma}$ of the original $\beta$ -mixing sequence $X$ becomes weaker as $q_{n}$ increases. The next lemma is a slightly adapted version of Lemma 4.1 in Yu, (1994) and is proven analogously. It shows that the $\Gamma$ or, respectively, $\Delta$ blocks of the original sequence $X$ can be related to the $\Gamma$ or, respectively, $\Delta$ blocks of the independent block sequence $Y$ in the following way.

Lemma 5.1.

Denote by $Q$ and $\tilde{Q}$ be the distributions of $X_{\Gamma}$ and $Y_{\Gamma}$ , respectively. Then, for any measurable function $g$ on $\mathbb{R}^{\mu_{n}p_{n}}$ with $\|g\|_{\infty}\leq M<\infty$ ,

[TABLE]

Similarly, if $P$ and $\tilde{P}$ denote the distributions of $X_{\Delta}$ and $Y_{\Delta}$ , respectively, and if $\tilde{g}$ is a measurable function on $\mathbb{R}^{\mu_{n}q_{n}}$ with $\|\tilde{g}\|_{\infty}\leq N<\infty$ , then

[TABLE]

In order to establish the convergence in probability of the parts of the U-lag-window estimate corresponding to the linear and degenerate part in the Hoeffding decomposition we prove several covariance inequalities for $\beta$ -mixing data. To this end we apply a coupling technique by Berbee, (1979). The idea is to replace successively dependent variables by variables that have the same distribution but are independent of the original variables and all other involved variables with the smallest error possible. Berbee, (1979) found the following in the case of $\beta$ -mixing data.

Lemma 5.2 (Berbee, (1979)).

Suppose on a probability space there is defined a pair $(X,Y)$ of random variables with values in Borel spaces. If the probability space is rich enough, it can be extended with a random variable $Y^{\prime}$ , independent of $X$ and distributed as $Y$ such that

[TABLE]

5.2 Auxiliary technical results

Lemma 5.3.

Let Assumption (C2) hold. Then, the kernels $h_{c,k}$ defined in (2.8), $c=1,\dots,m$ have uniform $(2+\delta)$ moments, i.e. there exist $\delta,M_{c}>0$ such that for all $t_{1},\dots,t_{c},k\in{\mathbb{Z}}$ , $1\leq j\leq 2c$ ,

[TABLE]

*where $G_{c}$ , $G_{j,c}^{(1)}$ and $G_{j,c}^{(2)}$ denote the joint distributions of $(X_{t_{(1)}},\dots,X_{t_{(2c)}})$ , $(X_{t_{(1)}},\dots,X_{t_{(j)}})$ and $(X_{t_{(j+1)}},\dots,X_{t_{(2c)}})$ , respectively, with $(t_{(1)},t_{(2)},\dots,t_{(2m)})$ , $t_{(1)}\leq\dots\leq t_{(2c)}$ the sorted version of the vector $(t_{1},t_{1}+k,t_{2},t_{2}+k\,\dots,t_{c},t_{c}+k)$ . *

Lemma 5.4.

Let $\boldsymbol{X}_{t_{j},k}^{*}$ denote an independent and identically distributed copy of $\boldsymbol{X}_{t_{j},k}$ on a possibly richer probability space that is independent of $\boldsymbol{X}_{t_{1},k},\dots,\boldsymbol{X}_{t_{2c},k}$ . Then, for $2\leq c\leq m$ and arbitrary $t_{1},\dots,t_{2c}\in{\mathbb{Z}}$ ,

[TABLE]

*where the latter equation also holds if any other pair $\boldsymbol{X}_{t_{j},k}$ is replaced by an iid copy $\boldsymbol{X}_{t_{j},k}^{*}$ . *

Lemma 5.5.

If Assumptions (C1) – (C3) are satisfied, we have for any fixed $0\leq k\leq\lfloor r_{n}\rfloor$

(1)

For any $t\in{\mathbb{Z}}$ ,

[TABLE]

(2)

If $1\leq t_{1}<t_{2}<\dots<t_{2c}\leq n-k$ and

[TABLE]

we have for any permutation $\gamma$ of $\{1,\dots,2c\}$

(i)

$|\textnormal{\mbox{I\negthinspace E}}[h_{c,k}(\boldsymbol{X}_{t_{\gamma(1)},k},\dots,\boldsymbol{X}_{t_{\gamma(c)},k})h_{c,k}(\boldsymbol{X}_{t_{\gamma(c+1)},k},\dots,\boldsymbol{X}_{t_{\gamma(2c)},k})]|\leq M_{c}^{\frac{2}{2+\delta}}.$ **

(ii)

if $(t_{2}-t_{1})>k$ or $(t_{2c}-t_{2c-1})>k$ ,

[TABLE]

(iii)

if $t_{2}-t_{1}\leq k$ , $t_{2c}-t_{2c-1}\leq k$ , $t_{3}-t_{2}>2k$ and $t_{2c-1}-t_{2(c-1)}>2k$ , then

[TABLE]

5.2.1 Proof of Lemma 5.3

From assumption (C2), we have

[TABLE]

and therefore, by the definition of the Hoeffding decomposition,

[TABLE]

As $h_{c,k}$ is recursively defined by

[TABLE]

$h_{c,k}$ also has uniform $(2+\delta)$ -moments. $\Box$

5.2.2 Proof of Lemma 5.4

Recall the following property of the conditional expectation [see Theorem 6.4 in Kallenberg, (2010)] which can easily be adapted to more than one $\mathcal{F}$ -measurable random variable:

Let $X$ and $Y$ be random variables and $\mathcal{F}$ a $\sigma$ -algebra such that $X$ is $\mathcal{F}$ -measurable and $Y$ is independent of $\mathcal{F}$ . Then, for any measurable function $f(x,y)$ with $\textnormal{\mbox{I\negthinspace E}}|f(X,Y)|<\infty$ ,

[TABLE]

*where $F(x)=\textnormal{\mbox{I\negthinspace E}}[f(x,Y)]$ .

Thus, by the law of total expectation, we have

[TABLE]

Obviously, $\boldsymbol{X}_{t_{2},k},\dots,\boldsymbol{X}_{t_{c},k}$ are $\sigma(\boldsymbol{X}_{t_{2},k},\dots,\boldsymbol{X}_{t_{2c},k})$ -measurable and $\boldsymbol{X}^{*}_{t_{1},k}$ is independent of

$\sigma(\boldsymbol{X}_{t_{2},k},\dots,\boldsymbol{X}_{t_{2c},k})$ . As, additionally, $\textnormal{\mbox{I\negthinspace E}}|h_{c,k}(\boldsymbol{X}^{*}_{t_{1},k},\boldsymbol{X}_{t_{2},k},\dots,\boldsymbol{X}_{t_{c},k})|<\infty$ by Lemma 5.3 we have that

[TABLE]

where

[TABLE]

We will now show that $H_{c-1,k}(\boldsymbol{y_{2}},\dots,\boldsymbol{y_{c}})=0$ . To this end, we consider the integral representation of $h_{c,k}$ , i.e. similarly as in the proof of Theorem 2 in [Lee, (1990), pg. 28], we obtain by the symmetry of $h$ ,

[TABLE]

Integrating both sides with respect to $\boldsymbol{u_{1}}\sim F_{k}$ yields

[TABLE]

Observing that

[TABLE]

we have

[TABLE]

and altogether,

[TABLE]

which concludes the proof. $\Box$

5.2.3 Proof of Lemma 5.5

(1)

If $l>k\geq 0$ replace the pair $\boldsymbol{X}_{t,k}:=\begin{pmatrix}X_{t}\\ X_{t+k}\end{pmatrix}$ using Berbee’s coupling technique by an identically distributed copy $\boldsymbol{X}_{t,k}^{*}$ that is independent of $\boldsymbol{X}_{t,k}$ and $\boldsymbol{X}_{t+l,k}$ and such that

[TABLE]

Then,

[TABLE]

By Hölder’s inequality $(\frac{1}{2+\delta}+\frac{1}{2+\delta}+\frac{1}{\frac{\delta+2}{\delta}}=1)$ we obtain

[TABLE]

which gives

[TABLE]

Now, $h_{1,k}(\boldsymbol{X_{t}^{*}})$ is independent of $h_{1,k}(\boldsymbol{X_{t+l}})$ and the result follows by Lemma 5.4.

If $0\leq l\leq k$ , using Berbee’s coupling technique, first replace $X_{t}$ by an identically distributed copy $X_{t}^{*}$ that is independent of $X_{t+k}$ , $X_{t+l}$ and $X_{t+l+k}$ and such that

[TABLE]

By Hölder’s inequality, we obtain with similar arguments as in the case $l>k\geq 0$ that

[TABLE]

Then, replace $X_{t+l}$ by an independent copy $X_{t+l}^{*}$ that is independent of $X_{t}^{*}$ , $X_{t+k}$ , $X_{t+l}$ and $X_{t+l+k}$ such that

[TABLE]

where we have used Lemma 1 in Eberlein, (1984). Then,

[TABLE]

Finally, replace $X_{t+k}$ by an independent copy $X_{t+k}^{*}$ that is independent of $X_{t}^{*}$ , $X_{t+l}^{*}$ , $X_{t+k}$ and $X_{t+l+k}$ and such that

[TABLE]

which gives

[TABLE]

Altogether,

[TABLE]

Next, observe that the last summand does not vanish as $\begin{pmatrix}X_{t}^{*}\\ X_{t+k}^{*}\end{pmatrix}$ does not have distribution $F_{k}$ . Therefore, using Berbee’s coupling technique, we rereplace $X_{t}^{*}$ by an independent copy $X_{t}^{\circ}$ such that the couple $\begin{pmatrix}X_{t}^{\circ}\\ X_{t+k}^{*}\end{pmatrix}$ has distribution $F_{k}$ , is independent of $\begin{pmatrix}X_{t+l}^{*}\\ X_{t+k+l}\end{pmatrix}$ and

[TABLE]

Hence,

[TABLE]

Observing that $\min\Big{\{}l,k-l\Big{\}}\leq k$ and that the $\beta$ -mixing coefficients are monotone decreasing, we have

[TABLE]

and as $\textnormal{\mbox{I\negthinspace E}}\Big{[}h_{1,k}\begin{pmatrix}X_{t}^{\circ}\\ X_{t+k}^{*}\end{pmatrix}\Big{]}=0$ , we can conclude that

[TABLE]

(2)
(i)

By Hölder’s inequality $(\frac{1}{2+\delta}+\frac{1}{\frac{2+\delta}{1+\delta}}=1)$ and as $\frac{2+\delta}{1+\delta}<2+\delta$ we obtain

[TABLE]

where we have used that $(\textnormal{\mbox{I\negthinspace E}}[|Z|^{p}])^{1/p}\leq(\textnormal{\mbox{I\negthinspace E}}[|Z|^{q}])^{1/q}$ for $0<p\leq q$ . Hence, by Lemma 5.3,

[TABLE]

(ii)

For brevity, we only consider the case $\gamma=\textrm{id}$ . The other cases are treated similarly but require a more complex notation.

In order to prove inequality (5.3), according to the coupling Lemma 5.2 by Berbee, (1979), depending on whether $(t_{2}-t_{1})>(t_{2c}-t_{2c-1})>k$ or $k<(t_{2}-t_{1})\leq(t_{2c}-t_{2c-1})$ , we can choose a random variable $\boldsymbol{X}^{*}_{t_{1},k}$ or respectively $\boldsymbol{X}^{*}_{t_{2c},k}$ that has the same distribution as $\boldsymbol{X}_{t_{1},k}$ or respectively $\boldsymbol{X}_{t_{2c},k}$ , independent of $\boldsymbol{X}_{t_{1},k},\dots,\boldsymbol{X}_{t_{2c},k}$ and such that

[TABLE]

First, consider the case where $(t_{2}-t_{1})>(t_{2c}-t_{2c-1})>k$ , that is we replace $\boldsymbol{X}_{t_{1},k}$ by an independent identically distributed copy $\boldsymbol{X}^{*}_{t_{1},k}$ . Then, by Lemma 5.4,

[TABLE]

Splitting the probability space, we obtain

[TABLE]

The second summand vanishes and for the first summand Hölder’s inequality $(\frac{1}{2+\delta}+\frac{1}{2+\delta}+\frac{1}{\frac{2+\delta}{\delta}}=1)$ yields

[TABLE]

where the latter inequality is due to Lemma 5.2. In the case where $k<(t_{2}-t_{1})\leq(t_{2c}-t_{2c-1})$ , we obtain

[TABLE]

Inequalities (5.6) and ((ii)) together yield result (ii).

(iii)

As in (ii), we only consider the case $\gamma=\textrm{id}$ . Then, if

[TABLE]

replace one after another $X_{t_{1}}$ , $X_{t_{2}}$ , $X_{t_{1}+k}$ and $X_{t_{2}+k}$ according to Lemma 5.2 by independent identically distributed copies $X_{t_{1}}^{\prime}$ , $X_{t_{2}}^{\prime}$ , $X_{t_{1}+k}^{\prime}$ and $X_{t_{2}+k}^{\prime}$ that are independent of the other involved random variables. Denote by $\boldsymbol{X}^{\prime}_{t_{j},k}$ the pair $(X_{t_{j}}^{\prime},X_{t_{j}+k}^{\prime})^{T}$ , where $j=1,2$ . Then, by Lemma 5.3 and similarly as in the proof of part (i)

[TABLE]

where the latter equality is due to the assumption that $(t_{3}-t_{2})>2k$ . Hence,

[TABLE]

Note that the second summand does not necessarily vanish because, having replaced $X_{t_{1}}$ , $X_{t_{1}+k}$ , $X_{t_{2}}$ and $X_{t_{2}+k}$ by independent identically distributed copies one after another, the couple $(X_{t_{j}}^{\prime},X_{t_{j}+k}^{\prime})^{T}$ does not have distribution $F_{k}$ . However, it is possible to bound

[TABLE]

by applying Lemma 5.2 again in order to ”rereplace” successively $\boldsymbol{X}^{\prime}_{t_{1},k}$ and $\boldsymbol{X}^{\prime}_{t_{2},k}$ by independent pairs $\boldsymbol{X}^{\circ}_{t_{1},k}$ and $\boldsymbol{X}^{\circ}_{t_{2},k}$ with distribution $F_{k}$ . More precisely, according to Lemma 5.2 we can replace $X_{t_{1}}^{\prime}$ by a random variable $X_{t_{1}}^{\circ}$ with the same distribution as $X_{t_{1}}^{\prime}$ that is independent of the other involved variables such that the couple $\boldsymbol{X}^{\circ}_{t_{1},k}:=(X_{t_{1}}^{\circ},X_{t_{1}+k}^{\prime})^{T}$ has distribution $F_{k}$ . Then, similarly as in the proof of (ii),

[TABLE]

where we have used that due to the independence of $X_{t_{1}}^{\prime}$ , $X_{t_{1}+k}^{\prime}$ and all other involved variables, $\mathbb{P}(X_{t_{1}}^{\prime}\neq X_{t_{1}}^{\circ})=\beta(k)$ . Analogously, we can replace $X_{t_{2}}^{\prime}$ by a random variable $X_{t_{2}}^{\circ}$ such that the couple $\boldsymbol{X}^{\circ}_{t_{2},k}:=(X_{t_{2}}^{\circ},X_{t_{2}+k}^{\prime})^{T}$ has distribution $F_{k}$ and is independent of $\boldsymbol{X}^{\circ}_{t_{1},k}$ . Then,

[TABLE]

Then, $\boldsymbol{X}^{\circ}_{t_{1},k}$ and $\boldsymbol{X}^{\circ}_{t_{2},k}$ both have distribution $F_{k}$ and are independent. Similar arguments as in the proof of Lemma 5.4 (ii) show that also

[TABLE]

and hence,

[TABLE]

Observing that by the definition of the $\beta$ -mixing coefficients and since $(t_{2}-t_{1})\leq k$

[TABLE]

equations ((iii)) and (5.9) yield

[TABLE]

Analogously, if $\min\{t_{2}-t_{1},(t_{1}+k)-t_{2}\}<\min\{t_{2c}-t_{2c-1},(t_{2c-1}+k)-t_{2c}\}$ , we obtain

[TABLE]

Combining these inequalities yields (iii), which concludes the proof.

$\Box$

5.3 Proofs of results from Section 4

5.3.1 Proof of Lemma 4.1

By Theorem 5.2 of Bradley, (2005), $(V_{t})_{t\in{\mathbb{Z}}}$ is $\alpha$ -mixing with mixing coefficients

[TABLE]

where the latter identity is due to the definition of the processes $(X_{t}^{(j)})_{t\in{\mathbb{Z}}}$ , $j=1,\dots,q$ . Following the arguments in the proof of Lemma 4.1 in Kley, (2014), we obtain the result. $\Box$

5.3.2 Proof of Lemma 4.2

For notational convenience, let $\boldsymbol{X}_{t_{j},k_{j}}:=(X_{t_{j}},X_{t_{j}+k_{j}})^{T}$ . Note that,

[TABLE]

Define

[TABLE]

By the law of the total expectation we obtain

[TABLE]

where the latter inequality follows as

[TABLE]

is $\mathcal{G}_{1}$ -measurable. Moreover, $\boldsymbol{X}_{t_{1},k_{1}}$ is $\mathcal{G}_{1}$ -measurable and $\sigma(\boldsymbol{X}_{t_{1},k_{1}}^{(1,1)},\dots,\boldsymbol{X}_{t_{1},k_{1}}^{(m-1,1)})$ is independent of $\mathcal{G}_{1}$ . From the property of the conditional expectation stated in the proof of Lemma 5.4, it follows that

[TABLE]

Hence, the same arguments as above yield

[TABLE]

Repeating these steps $q-1$ times yields the result.

(i)

Under (C0) we have $h_{1,k}^{\tau}\begin{pmatrix}x_{1}\\ y_{1}\end{pmatrix}=\textnormal{\mbox{I\negthinspace E}}\Big{[}h^{\tau}\Big{(}\begin{pmatrix}x_{1}\\ y_{1}\end{pmatrix},\begin{pmatrix}X_{0}\\ X_{k}\end{pmatrix}\Big{)}\Big{]}$ with

[TABLE]

Note that $\textnormal{\mbox{I\negthinspace E}}[Y_{t}^{(j)}]=0$ under (C0), we obtain from (4.2) that

[TABLE]

The equivalent representation of moments in terms of cumulants yields

[TABLE]

For all $t,k\in{\mathbb{Z}}$ and $l=1,2$ , $\textnormal{\mbox{I\negthinspace E}}[Y_{t}^{(l)}]=0$ and we have

[TABLE]

and

[TABLE]

where $C_{k}$ is the copula associated with $(X_{t},X_{t+k})$ [see e.g. Schmid et al., (2010)] and $\rho(k)$ is the population version of Spearman’s $\rho$ at lag $k$ . Hence,

[TABLE]

and inserting equations ((i)) and ((i)) in ((i)) yield the result.

(ii)

Under (C0) we have $h_{1,k}^{\rho}\begin{pmatrix}x\\ y\end{pmatrix}=\textnormal{\mbox{I\negthinspace E}}\Big{[}h^{\rho}\Big{(}\begin{pmatrix}x\\ y\end{pmatrix},\begin{pmatrix}X_{0}^{(1)}\\ X_{k}^{(1)}\end{pmatrix},\begin{pmatrix}X_{0}^{(2)}\\ X_{k}^{(2)}\end{pmatrix}\Big{)}\Big{]}$ with

[TABLE]

The first order kernel being centered by definition of the Hoeffding decomposition, from (4.2) we know that

[TABLE]

Thus, as $\textnormal{\mbox{I\negthinspace E}}[I(X_{t}^{(i)}<X_{t}^{(j)})-\frac{1}{2}]=0$ under (C0) for any $i,j=1,\dots,5;\,i\neq j$ , we obtain from (4.2) that

[TABLE]

where we have used the representation of centered fourth moments in terms of cumulants, property (v) of Theorem 2.3.1 in Brillinger, (1975) and ((i))

[TABLE]

Furthermore, the only permutations $\gamma$ and $\tilde{\gamma}$ for which not all products of second order cumulants in ((ii)) contain one cumulant with one independent factor and thus equal [math] are those with $\gamma(1)=\tilde{\gamma}(1)=1$ . For each of these $4$ combinations we obtain

[TABLE]

and

[TABLE]

Plugging ((ii)) and ((ii)) into ((ii)) concludes the proof.

$\Box$

5.3.3 Proof of (4.6)

We will prove this result only for positive lags $k$ as the proof for negative lags is analogous. More precisely we consider

[TABLE]

and prove that

[TABLE]

Finally, using that $\inf_{0\leq k\leq\lfloor r_{n}\rfloor}\binom{n-k}{c}\geq Kn^{c}$ for some constant $K$ , establishes (4.6).

For any fixed $0\leq k\leq\lfloor r_{n}\rfloor$ , decompose (5.3.3) into sums according to the following 3 cases:

(1)

all $2c$ indices are different,

(2)

$2c-1$ indices are different or

(3)

$2(c-1)$ or less indices are different,

that is

[TABLE]

In the sequel, denote by $t_{(j)}$ the $j$ -th smallest of all distinct indices among $t_{1},\dots,t_{2c}$ .

In case (1) we distinguish the following cases:

(1.1)

$(t_{(2)}-t_{(1)})>k$ or $(t_{(2c)}-t_{(2c-1)})>k$ .

(1.2)

$(t_{(2)}-t_{(1)})\leq k$ and $(t_{(2c)}-t_{(2c-1)})\leq k$ .

In case (1.1), consider the set

[TABLE]

and observe that $\#\mathcal{S}_{k}^{(1.1)}(v)\leq(v+k)n^{2(c-1)}$ , where $\#S$ denotes the cardinality of the set $S$ .

Then, we obtain from Lemma 5.5 (2) (ii), for some constant $K$ ,

[TABLE]

where we have bounded $\sum_{v=1}^{n}v\beta^{\frac{\delta}{2+\delta}}(v)$ from above by an integral and then concluded with Assumption (C3).

Next, in case (1.2),

[TABLE]

For (I), we apply similar arguments as in the proof of Lemma 5.5 (2) (iii). That is, if we replace one after another all random variables by independent copies and then rereplace them by independent pairs with cdf $F_{k}$ . We have for any permutation $\gamma$ of $\{1,\dots,2c\}$ ,

[TABLE]

for a constant $K_{c}$ depening only on $c$ . Next, let $u(t_{(1)},\dots,t_{(2c)}):=\min_{\begin{subarray}{c}i,j=1,\dots,2c\\ i\neq j\end{subarray}}\{|t_{(j)}-t_{(i)}|,|(t_{(j)}+k)-t_{(i)}|\}$ which is always smaller than $k$ and consider the set

[TABLE]

Then, for some constant $K$ ,

[TABLE]

since $\sup\limits_{v=0,\dots,k}\#\mathcal{S}_{k}^{(I)}(v)\leq 2r_{n}^{2}n^{2c-3}$ . Hence, $(I)=O(r_{n}^{2}n^{2c-3})=o(n^{2c-1-\theta}).$ From Lemma 5.5 (2) (iii) we know that

[TABLE]

Consider the set

[TABLE]

with $\#\mathcal{S}_{k}^{(II)}(v)\leq 2(v+1)n^{2c-2}$ . Then, for some constant $K$ ,

[TABLE]

and hence, $(II)=O(r_{n}^{1-\theta}n^{2c-2})=O(n^{2c-1-\theta}).$

Therefore, in case (1.2) we have

[TABLE]

Combining equations (5.3.3) and (5.3.3) yields

[TABLE]

which concludes the consideration of case (1).

In case (2), we encounter the following situations:

(2.1)

the index appearing twice is not $t_{(1)}$ or $t_{(2)}$ .

(2.2)

the index appearing twice is $t_{(1)}$ or $t_{(2)}$ .

Then,

[TABLE]

In case (2.1), consider the following situations:

(a)

$t_{(2)}-t_{(1)}>k$

(b)

$t_{(2)}-t_{(1)}\leq k$ and $t_{(3)}-t_{(2)}>2k$

(c)

$t_{(2)}-t_{(1)}\leq k$ and $t_{(3)}-t_{(2)}\leq 2k$

In situation (a), similarly as in the proof of Lemma 5.5 (ii), we replace the pair with the smallest index $\boldsymbol{X}_{t_{(1)},k}$ by an independent copy in order to bound the summand from above by $2M^{\frac{2}{2+\delta}}\beta^{\frac{\delta}{2+\delta}}(t_{(2)}-(t_{(1)}+k))$ . Hence, by assumption (C3),

[TABLE]

Next, in situation (b), with similar arguments as in the proof of Lemma 5.5 (iii), we replace one after another $X_{t_{(1)}}$ , $X_{t_{(2)}}$ , $X_{t_{(1)}+k}$ and $X_{t_{(2)}+k}$ by independent copies and obtain with assumption (C3),

[TABLE]

In situation (c), we use Lemma 5.5 (i) and the fact that in this case the number of summands is of order $O(r_{n}^{2}n^{2c-3})$ , that is

[TABLE]

Therefore,

[TABLE]

Since in case (2.2), the index appearing twice is $t_{(1)}$ or $t_{(2)}$ , the indices $t_{(2c-2)}$ and $t_{(2c-1)}$ appear only once. Thus the case can be handled by the similar arguments as case (2.1), i.e. we obtain

[TABLE]

Cases (2.1) and (2.2) yield

[TABLE]

which concludes the consideration of case (2).

In case (3) observe that the number of summands is of order $O(n^{2(c-1)})$ , such that together with Lemma 5.5 (2) (i) we can conclude that

[TABLE]

Finally, combining equations (5.20), (5.3.3) and (5.3.3) yields the result. $\Box$

5.3.4 Proof of (4.9)

We have by (4.3) that

[TABLE]

Next,

[TABLE]

Similar arguments as in the proof of (4.4) yield

[TABLE]

where we have used that $|\frac{1}{n-r_{n}}-\frac{1}{n}|=O\Big{(}\frac{r_{n}}{n^{2}}\Big{)}$ and $\sup_{|k|\leq r_{n}}\textnormal{\mbox{I\negthinspace E}}|\sum_{t\in\mathcal{T}_{k}}h_{1,k}^{\xi}(\boldsymbol{X}_{t,k})|=O(n^{1/2})$ . Next,

[TABLE]

Note that by the stationarity of the process $\{X_{t}\}_{t\in{\mathbb{Z}}}$ ,

[TABLE]

Similarly as for $A_{n}$ we obtain

[TABLE]

and analogously,

[TABLE]

Altogether,

[TABLE]

This concludes the proof of (4.9). $\Box$

Bibliography29

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Ahdesmäki et al., (2005) Ahdesmäki, M., Lähdesmäki, H., Pearson, R., Huttunen, H., and Yli-Harja, O. (2005). Robust detection of periodic time series measured from biological systems. BMC bioinformatics , 6(1):1.
2Anderson, (1971) Anderson, T. W. (1971). The Statistical Analysis of Time Series . John Wiley and Sons, Inc.
3Arcones and Yu, (1994) Arcones, M. A. and Yu, B. (1994). Central limit theorems for empirical and u-processes of stationary mixing sequences. Journal of Theoretical Probability , 7(1):47–71.
4Berbee, (1979) Berbee, H. C. (1979). Random walks with stationary increments and renewal theory. MC Tracts , 112:1–223.
5Birr et al., (2014) Birr, S., Volgushev, S., Kley, T., Dette, H., and Hallin, M. (2014). Quantile spectral analysis for locally stationary time series. ar Xiv preprint ar Xiv:1404.4605 .
6Blomqvist, (1950) Blomqvist, N. (1950). On a measure of dependence between two random variables. The Annals of Mathematical Statistics , 21(4):593–600.
7Bradley, (2005) Bradley, R. C. (2005). Basic properties of strong mixing conditions. a survey and some open questions. Probability Surveys , 2:107–144.
8Brillinger, (1975) Brillinger, D. R. (1975). Time series: Data Analysis and Theory . Holt, Rinehart and Winston, Inc.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Fourier analysis of serial dependence measures

Abstract

1 Introduction

2 Examples of U-lag-window spectral densities and their estimators

Example 2.1**.**

3 Asymptotic theory for U-lag-window estimates

3.1 Consistency of U-lag-window estimates

Theorem 3.1**.**

3.2 Asymptotic distribution of U-lag-window estimates

Remark 3.2**.**

Theorem 3.3**.**

Theorem 3.4**.**

Remark 3.5**.**

4 Proofs

4.1 Proof of Theorem 3.1

4.1.1 Proof of (4.1)

4.1.2 Proof of (4.3)

4.1.3 Proof of (4.4)

4.2 Proof of Theorem 3.4 and 3.3 - main arguments

4.3 Proof of (4.12) – (4.15)

Lemma 4.1**.**

Lemma 4.2**.**

4.3.1 Proof of (4.12)

4.3.2 Proof of (4.13)

4.3.3 Proof of (4.14) and (4.15)

5 Appendix: technical details

5.1 Blocking results for stationary β\betaβ-mixing processes

Lemma 5.1**.**

Lemma 5.2** (Berbee, (1979)).**

5.2 Auxiliary technical results

Lemma 5.3**.**

Lemma 5.4**.**

Lemma 5.5**.**

5.2.1 Proof of Lemma 5.3

5.2.2 Proof of Lemma 5.4

5.2.3 Proof of Lemma 5.5

5.3 Proofs of results from Section 4

5.3.1 Proof of Lemma 4.1

5.3.2 Proof of Lemma 4.2

5.3.3 Proof of (4.6)

5.3.4 Proof of (4.9)

Example 2.1.

Theorem 3.1.

Remark 3.2.

Theorem 3.3.

Theorem 3.4.

Remark 3.5.

Lemma 4.1.

Lemma 4.2.

5.1 Blocking results for stationary $\beta$ -mixing processes

Lemma 5.1.

Lemma 5.2 (Berbee, (1979)).

Lemma 5.3.

Lemma 5.4.

Lemma 5.5.