Fourier analysis of serial dependence measures
Ria van Hecke, Stanislav Volgushev, Holger Dette

TL;DR
This paper explores new frequency domain methods for analyzing serial dependence using U-statistics-based measures like Kendall's tau, revealing unique asymptotic properties and behaviors.
Contribution
It introduces a novel spectral analysis approach replacing auto-covariances with U-statistics dependence measures, expanding spectral analysis tools.
Findings
Kendall's tau-based spectral density exhibits surprising limiting variance behavior
Asymptotic properties of new frequency domain methods are characterized
Alternative dependence measures can be effectively used in spectral analysis
Abstract
Classical spectral analysis is based on the discrete Fourier transform of the auto-covariances. In this paper we investigate the asymptotic properties of new frequency domain methods where the auto-covariances in the spectral density are replaced by alternative dependence measures which can be estimated by U-statistics. An interesting example is given by Kendall{'}s , for which the limiting variance exhibits a surprising behavior.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Fourier analysis of serial dependence measures
Ria Van Hecke
Ruhr-Universität Bochum
Fakultät für Mathematik
44780 Bochum
Germany
Stanislav Volgushev
University of Toronto
Department of Statistical Sciences
Toronto, Ontario M5S 3G3
Canada
Holger Dette
Ruhr-Universität Bochum
Fakultät für Mathematik
44780 Bochum
Germany
Abstract
Classical spectral analysis is based on the discrete Fourier transform of the auto-covariances. In this paper we investigate the asymptotic properties of new frequency domain methods where the auto-covariances in the spectral density are replaced by alternative dependence measures which can be estimated by U-statistics. An interesting example is given by Kendall’s , for which the limiting variance exhibits a surprising behavior.
Keywords and Phrases: Spectral theory, strictly stationary time series, -statistics
AMS subject classification: 62M15, 62G20
1 Introduction
Over the years spectral analysis has developed into a fundamental important tool kit in the analysis of data from a stationary time series . The spectral density, defined as the discrete Fourier transform of the auto-covariances, provides a convenient way to characterize the second order properties of a stationary sequence. Estimation of the spectral density is usually performed by smoothing the periodogram, that is the discrete Fourier transform of empirical auto-covariances [see for example Chapters 4 and 10 of Brockwell and Davis, (1987)].
It is well known that this approach is not able to capture non-linear features of time series dynamics such as changes in skewness, kurtosis or dependence in the extremes. This motivated numerous authors to describe serial dependence by considering spectral densities corresponding to a family of transformations of the original time series [see Hong, (1999, 2000), Li, (2008, 2012), Hagemann, (2013), Dette et al., (2015), Birr et al., (2014), Davis et al., (2013), Kley et al., (2016)]. Roughly speaking, these authors suggest to define a family of spectral densities, say , where the auto-covariances (at lag ) are replaced by functionals of the lag -distributions . This approach is attractive as it allows a more complete description of the serial dependence. The price for this flexibility is the calculation of a family of spectral densities, in contrast to the classical approach, which uses only one spectral density calculated as the discrete Fourier transform of the auto-covariances.
In the present paper we investigate a class of alternative spectral densities, which keeps the simplicity of the classical spectral theory but eliminates some drawbacks arising from the use of auto-covariances in its definition. More precisely, we consider general spectral densities of the form
[TABLE]
where for each the quantity denotes a dependence measure between the random variables and (in the classical case Cov) and we implicitly assumed that . Spectral densities of the form (1.1) have been considered by Ahdesmäki et al., (2005), Zhou, (2012) and Carcea and Serfling, (2015), who replaced the lag auto-covariance by other measures of dependence such as Kendall’s , distance correlation, or L-moments. A thorough theoretical analysis of this idea for dependence measures that can be represented as linear functionals of the empirical copula at lag was conducted in Kley et al., (2016). Their analysis includes dependence measures such as Spearman’s rank autocorrelation [see Wald and Wolfowitz, (1943)], Blomqvist’s beta [see Blomqvist, (1950)] and Gini’s rank association coefficient [see Schechtman and Yitzhaki, (1987)]. However, the theory depends crucially on the linearity of the corresponding functional and cannot be generalized to other dependence measures. A particularly interesting dependence measure that is not covered by the analysis of Kley et al., (2016) is Kendall’s tau which can be represented as by
[TABLE]
where denotes the copula corresponding to lag . Note that Kendall’s tau is a non-linear functional of the lag copula. A classical approach to the estimation of Kendall’s tau is based on the representation
[TABLE]
where are independent copies with the same distribution as . Motivated by this example we are interested in the statistical properties of estimators of spectral densities of the form (1.1) with a measure of lag dependence that can be represented as
[TABLE]
where are independent copies of and is a symmetric kernel of order . The representation (1.3) motivates to estimate by a -statistic, say , and to form the corresponding -lag-window estimate
[TABLE]
where are given weights. In Section 2 we will introduce the necessary notation and illustrate the general approach by several examples. The main results of the paper can be found in Section 3, where we investigate the asymptotic properties of the new estimates. In particular we prove consistency of the estimate (1.4) for a broad class of kernels and establish its asymptotic normality for several important cases including Kendalls . Interestingly the asymptotic variance of the -lag-window estimate based on Kendall’s tau depends on the spectral density (1.1) where the quantities are the lag Spearman’s rho correlations. The proofs are very involved and will be deferred to Section 4, while more technical arguments can be found in Section 5.
2 Examples of U-lag-window spectral densities and their estimators
Throughout this paper let be a real-valued process and denote by and the marginal distribution function of and the distribution function of the pair , respectively (). Recall the definition of the spectral density in (1.1), where the measure of dependence (at lag ) has the representation (1.3) for a given kernel of order . Throughout this paper, we will maintain the following assumption
- (C0)
The process is strictly stationary. The functions and are continuous (for all ) and .
Let be the finite stretch of this process representing the observed data and define for any the set \mathcal{T}_{k}:=\big{\{}t|t,t+k\in\{1,\dots,n\}\big{\}}. In the following example we illustrate how different kernels yield different measures of dependence and as consequence different spectral densities.
Example 2.1**.**
- (i)
If and h\big{(}({x_{1}},{y_{2}})^{T},({x_{2}},{y_{2}})^{T}\big{)}=\frac{1}{2}(x_{1}-x_{2})(y_{1}-y_{2}), then the representation (1.3) gives the auto-covariance at lag , that is
[TABLE]
and we obtain the classical spectral density.
- (ii)
If , denotes the indicator function and the kernel is defined by
[TABLE]
the representation (1.3) yields Kendall’s at lag , that is
[TABLE]
The corresponding spectral density will be denoted by
[TABLE]
As the distribution function and are assumed to be continuous, can also be represented in the form (1.3) using the kernel
[TABLE]
- (iii)
If , denotes the set of all permutations of and the kernel is defined by
[TABLE]
we obtain the (lag ) population version of Spearman’s , that is
[TABLE]
The corresponding spectral density will be denoted by
[TABLE]
Given continuity of and , can also be represented in the form (1.3) using the following kernel
[TABLE]
In the remaining part of the manuscript we estimate the dependence measures (at lag ) by a -statistic of order , that is
[TABLE]
Estimates of corresponding spectral densities are defined as in (1.4). The asymptotic properties of such spectral density estimates are investigated in the following section.
Before proceeding, we recall the Hoeffding decomposition for U-statistics. Recall that is a symmetric kernel of order and let denote independent identically distributed copies of . We now recursively define kernels by
[TABLE]
where denotes the distribution of the Dirac measure at . If
[TABLE]
is the U-statistic based on the kernel we obtain for the statistic in (2.7) the decomposition [see, for example Lee, (1990)]
[TABLE]
which will be an important tool in the asymptotic analysis of the following sections.
3 Asymptotic theory for U-lag-window estimates
3.1 Consistency of U-lag-window estimates
Our first main result shows that for a general class of symmetric kernels the statistic consistently estimates the spectral density defined in (1.1) if the following assumptions are satisfied.
- (C1)
The lag window can be written in the form w_{n}(k)=w\big{(}\frac{k}{r_{n}}\big{)}, where is a uniformly continuous function, supported on the interval , satisfying , , for all , and for some .
- (C2)
There exist constants such that for all , ,
[TABLE]
where , and denote the joint distributions of , and , respectively, and is the order statistic of .
- (C3)
The process is -mixing and for some with -mixing coefficients satisfying where .
Theorem 3.1**.**
If Assumptions (C0) – (C3) are satisfied, we have for any fixed
[TABLE]
3.2 Asymptotic distribution of U-lag-window estimates
In this section we establish asymptotic normality of the spectral density estimators. Throughout this section we focus our attention on settings where is Kendall’s or Spearman’s . Recalling the discussion in Example 2.1 it follows that und can be estimated by the -statistics
[TABLE]
and
[TABLE]
respectively. Note that these -statistics have bounded kernels, satisfy Assumption (C2) and can be written as a product or sum of products of two centered functions of random variables. This special structure is crucial for obtaining the asymptotic distribution results given below. It is not clear if similar results hold without imposing this kind of structure on the kernel .
Throughout this section we write if assumptions or results are the same for both Kendall’s and Spearman’s . On the other hand we explicitly write or if the results or arguments are different. For example, from (2.9) we obtain for Kendall’s and Spearman’s the decomposition
[TABLE]
and therefore only appears in the formula. We will demonstrate that under suitable assumptions the term corresponding to the linear part converges to a normal distribution and that the term corresponding to the degenerate part is asymptotically negligible. In what follows we assume that (C0) – (C3) hold and impose the following additional conditions.
- (N1)
The process is -mixing with corresponding -mixing coefficients satisfying where .
- (N2)
For the lag window generator there exists a ’characteristic exponent’ being the largest integer such that
[TABLE]
exists, is finite and non-zero. For this we have .
- (N3)
where \theta=\min\Big{\{}\frac{2(\delta-\delta^{\prime})}{\delta^{\prime}(2+\delta)},1\Big{\}} and are from conditions (C2),(C3).
Remark 3.2**.**
The summability condition in assumption (N2) implies the existence of the ’generalized derivative’ of
[TABLE]
and can thus be interpreted as a smoothness condition; note that for even this coincides with the usual ’th order derivative. The other part of assumption (N2) places mild restrictions on the lag-window generator for which the rate of the scale parameter is limited by assumption (N3). Note that (N3) is satisfied for scale parameters leading to optimal asymptotic mean squared error rates (see Remark 3.5). * *
We begin by examining the asymptotic distribution of .
Theorem 3.3**.**
Assume that conditions (C0) – (C3) and (N1) – (N3) are satisfied and that . If , then
[TABLE]
where
[TABLE]
and
[TABLE]
If , we have
[TABLE]
Interestingly, the limiting distribution has exactly the same form as the limiting distribution for the usual spectral density where corresponds to covariance. This is remarkable, since Spearman’s is based on covariances of ranks. Asymptotic normality of was also obtained in Kley et al., (2016) under a different set of assumptions on the serial dependence and using a completely different set of proof techniques. Specifically, their results require dependence to decay exponentially. The next result establishes asymptotic normality of with corresponding to Kendall’s . The asymptotic distribution of cannot be obtained from the findings in Kley et al., (2016) (under any assumptions) since Kendall’s is a non-linear functional of the copula.
Theorem 3.4**.**
Assume that conditions (C0) – (C3) and (N1) – (N3) are satisfied and that . If , then
[TABLE]
where
[TABLE]
and
[TABLE]
If , we have
[TABLE]
It is remarkable that the asymptotic variance of estimator depends on the spectral measure obtained from Spearman’s (provided that ). This is in sharp contrast to the finding in Theorem 3.3 and spectral density estimation based on covariances. The results in Theorem 3.4 provide an asymptotic analysis of the estimator introduced in Ahdesmäki et al., (2005). We conclude this section by commenting on the optimal choice of window length .
Remark 3.5**.**
Both theorems allow to determine the scale parameter such that the asymptotic mean squared error is minimized. To be precise, define . Then the asymptotic mean squared error takes the form
[TABLE]
Assuming that , we obtain that this expression is minimized for
[TABLE]
Note that for the asymptotic MSE is of the order . In that case the above scale parameter is of order and satisfies Assumptions (C1) and (N3) if the mixing coefficients decay sufficiently quickly. More precisely, as for Kendall’s and Spearman’s the kernels of the -statistic are bounded, we can choose in Assumption (C2) arbitrarily large. Assumption (N3) is satisfied if 1/5<\theta=\min\Big{\{}\frac{2(\delta-\delta^{\prime})}{\delta^{\prime}(2+\delta)},1\Big{\}}, which is equivalent to . Since is arbitrarily large, we can choose any and (N3), (C3) will hold if for some . * *
4 Proofs
4.1 Proof of Theorem 3.1
We first illustrate the main steps in the proofs. These rely on several delicate bounds, which will be shown below. Rearranging sums in (1.4) and using assumption (C1), the U-lag-window estimate can be decomposed as follows
[TABLE]
where
[TABLE]
We will show in Section 4.1.1 that
[TABLE]
For a proof of we use the Hoeffding decomposition (2.9), which gives
[TABLE]
where
[TABLE]
The assertion of the theorem now follows from the estimates
[TABLE]
which are shown in Section 4.1.2 and 4.1.3, respectively.
4.1.1 Proof of (4.1)
Using the fact that and \sup_{|k|<n}\big{(}\big{|}w\big{(}\frac{k}{r_{n}}\big{)}\big{|}+1\big{)}\leq 2\quad\forall\,n\in\mathbb{N}, we obtain for any fixed
[TABLE]
As the lag window generator is continuous at [math] we obtain for any fixed
[TABLE]
As inequality (4.5) holds for all , we can conclude that , as the ’s are absolutely summable. By the same argument, we also have for .
4.1.2 Proof of (4.3)
The proof is based on an extension to lagged data of a covariance inequality by Yoshihara, (1976). More precisely, we prove in the technical Appendix, Section 5.3.3, that for fixed
[TABLE]
where \theta=\min\Big{\{}\frac{2(\delta-\delta^{\prime})}{\delta^{\prime}(2+\delta)},1\Big{\}}. Note that the above bound holds uniformly over a growing number of lags while the result in Yoshihara, (1976) only holds for a fixed . Observe that,
[TABLE]
where the constant does not depend on . Consequently, equation (4.6) yields which establishes (4.3).
4.1.3 Proof of (4.4)
Introduce the notation . We only consider positive lags , negative can be treated analogously. Similar arguments as in the proof of (4.3) yield
[TABLE]
Next,
[TABLE]
As for , we have
[TABLE]
which gives
[TABLE]
The following bound will be established in Lemma 5.5 in the Appendix (see Section 5.2.3)
[TABLE]
Thus
[TABLE]
By assumption (C3) . Therefore, we have
[TABLE]
Equations (4.7) and (4.8) yield and the assertion follows observing that .
4.2 Proof of Theorem 3.4 and 3.3 - main arguments
In the following proof we write if the results hold for general dependence measures that fulfill the assumptions (C0) – (C3) and (N1) – (N3). Otherwise we explicitly write or .
Under Assumption (N3) and with (4.3),
[TABLE]
Furthermore, in Section 5.3.4 we will prove that
[TABLE]
where \tilde{f}_{n,\xi}(\omega)=\frac{1}{2\pi}\sum_{|k|\leq r_{n}}w\Big{(}\frac{k}{r_{n}}\Big{)}\Big{\{}\xi_{k}+\frac{m}{n}\sum_{t=1}^{n}h_{1,k}^{\xi}(\boldsymbol{X}_{t,k})\Big{\}}e^{-ik\omega}. Then,
[TABLE]
where, by construction, the random variables form a triangular array of -mixing random variables with mixing coefficients . To prove the asymptotic normality, we will apply the blocking technique described in Section 5.1. That is, we choose blocks of length and blocks of length such that
[TABLE]
According to Assumptions (C1), (C3) and (N3) one possible choice is , , . Then we decompose
[TABLE]
Next, we show that the remaining part and the part corresponding to the “small” blocks are negligible whereas the “big” blocks satisfy the Lyapunov condition and yield the asymptotic variance. Observe that
[TABLE]
for all , and hence, is real and symmetric in . To prove (4.11) observe that by stationarity of we have that and and hence, , . Consequently, we obtain in the case of Kendall’s and Spearmans’s , which yields
[TABLE]
Moreover, we will show in section 4.3 that for any with , and ,
[TABLE]
[TABLE]
Next, the last summand in (4.10) contains at most summands. Hence, by (4.12) and (4.13) we have,
[TABLE]
that is \sum_{t\in\mathcal{R}}W_{n,t}^{\xi}(\omega)=o_{\mathbb{P}}\Big{(}\sqrt{\frac{r_{n}}{n}}\Big{)}. Next, we show that the sum over the small blocks is negligible. By Lemma 5.1 with the function we obtain
[TABLE]
where denote the random variables of the independent block sequence corresponding to the -blocks. By the assumptions on and the term on the right hand side in the above expression converges to 0. Observing that the variables are centered and (4.12) or respectively (4.13) applied to the independent blocks yields
[TABLE]
where we have used the definition of and the assumption that . Hence, it remains to prove that converges weakly. Note that for any measurable set , by Lemma 5.1 with function and the assumptions on and , we have
[TABLE]
In order to prove the convergence in distribution of , it suffices to show that the triangular array of independent random variables
[TABLE]
satisfies the Lyapunov condition. To achieve this, we show that together with (4.12) or (4.13), respectively,
[TABLE]
and if ,
[TABLE]
for some constant and sufficiently large. Hence, the Lyapunov condition is satisfied as and we can conclude that the distribution of , where is either Kendall’s or Spearman’s , converges weakly to a normal distribution, i.e.
[TABLE]
If , it follows from equations (4.12) and (4.13) that
[TABLE]
and hence, by (4.9),
[TABLE]
Finally, we have, for representing either or ,
[TABLE]
Hence, the bias is given by
[TABLE]
Next, choose some such that . Then,
[TABLE]
By Assumption (N2), for and therefore,
[TABLE]
For the second term in (4.2) as by Assumption (N2), is finite for we obtain
[TABLE]
where is bounded since is bounded and the limit for exists by Assumption (N2). Finally,
[TABLE]
where the first summand is of order since and is absolutely summable. The second summand is of order as is finite. Hence,
[TABLE]
Conclude applying Slutsky’s theorem. It remains to prove (4.12), (4.13), (4.14) and (4.15). Detailed proofs of these results are given in the remaining part of this section.
4.3 Proof of (4.12) – (4.15)
The proofs of Theorem (4.12) – (4.15) rely on two auxiliary results. These will be stated in this section whereas their detailed proof is deferred to the Appendix. The first Lemma bounds cumulants through -mixing coefficients, see Section 5.3.1 for a proof.
Lemma 4.1**.**
For , let be independent copies of a strictly stationary polynomially -mixing process that are independent. For any , let . Then, for , and measurable sets , there exists a constant such that
[TABLE]
The Lemma that follows is a key observation which makes it possible to use theory from classical spectral density estimation in the case where the kernel can be written as a sum of a product of centered functions of random variables. This is a crucial insight for proving asymptotic normality of the estimators .
Lemma 4.2**.**
Let denote a -statistic of order and assume that , are independent copies of a strictly stationary process that are mutually independent. Then, for , ,
[TABLE]
where
[TABLE]
[TABLE]
In particular, if , are independent copies of the strictly stationary process that are independent of each other,
- (i)
for Kendall’s
[TABLE]
where (Y_{t}^{(j)})_{t\in{\mathbb{Z}}}:=\Big{(}I(X_{t}<X_{t}^{(j)})-\frac{1}{2}\Big{)}_{t\in{\mathbb{Z}}}, .
- (ii)
for Spearman’s
[TABLE]
where denotes the set of all permutations of .
Lemma 4.2 is proved in Section 5.3.2.
4.3.1 Proof of (4.12)
Let and be independent copies of the strictly stationary -mixing process that are independent of each other. Define processes and by
[TABLE]
Note that the processes and are strictly stationary.
By Lemma 4.2 (i) we have for Kendall’s ,
[TABLE]
Next, let and define for ,
[TABLE]
Then, similar arguments as in the proof of Lemma 4.2 yield
[TABLE]
and consequently,
[TABLE]
where
[TABLE]
[TABLE]
We will now derive upper bounds for and separately. First, as , and , we have
[TABLE]
where the latter inequality follows by the strict joint stationarity of the involved processes. Next, observe that, from Theorem 2.3.1 in Brillinger, (1975), it follows that
[TABLE]
where , and . Furthermore, let and consider the set
[TABLE]
whose cardinality is . Hence, applying Lemma 4.1 with yields
[TABLE]
where we used Assumption (N1) for the last estimate and are the mixing coefficients of the process . Therefore,
[TABLE]
Next,
[TABLE]
and observing that contains summands, we obtain by assumption (N2),
[TABLE]
Analogously, D_{2,n}^{(2)}=O\Big{(}\frac{r_{n}^{2}}{n^{2}}\Big{)} and hence,
[TABLE]
Then, equations (4.20) and (4.21) together yield
[TABLE]
Next, observe that , , is eight times the classical centered lag-window estimator of the spectral density of the stationary process based on the observations . Consequently, if we show that
[TABLE]
the same arguments as given in the proof of Theorem 9.3.4 in Anderson, (1971) yield
[TABLE]
Finally, (4.23) can be proved using similar arguments and equation (4.24) together with equation (4.22) conclude the proof of (4.12).
4.3.2 Proof of (4.13)
Let , be independent copies of the strictly stationary process that are independent of each other. Then, by Lemma 4.2 (ii) for Spearman’s ,
[TABLE]
where denotes the set of all permutations of .
Next, let and define
[TABLE]
and
[TABLE]
Then, similarly as in the proof of Lemma 4.2,
[TABLE]
and analogous arguments as for Kendall’s give
[TABLE]
Next, similar arguments as were used in order to derive (4.20) yield
[TABLE]
and the same arguments as in the proof of Theorem 9.3.4 in Anderson, (1971) show
[TABLE]
Hence, equation (4.28) together with equation (4.27) conclude the proof of (4.13).
4.3.3 Proof of (4.14) and (4.15)
By Lemma 4.2, we know that for Kendall’s tau
[TABLE]
where (Y_{t}^{(j)})_{t\in{\mathbb{Z}}}=\Big{(}I(X_{t}<X_{t}^{(j)})-\frac{1}{2}\Big{)}_{t\in{\mathbb{Z}}}. Therefore, we can write
[TABLE]
where \vartheta^{\tau}_{t_{l}}=\frac{1}{2\pi}\sum_{|k_{l}|\leq r_{n}}w\Big{(}\frac{k_{l}}{r_{n}}\Big{)}e^{-ik_{l}\omega}\frac{2}{n}[4Y_{t_{l}}^{(l)}Y_{t_{l}+k_{l}}^{(l)}-\tau_{k_{l}}], . Observing that by construction , we express the fourth moment in terms of a fourth order cumulant and 3 products of second order cumulants, that is
[TABLE]
Note that by construction for all , . Therefore, each of the second order cumulants is equal to
[TABLE]
where we have used a similar argument as in equation (4.3.3). Hence, we obtain by Theorem 2.3.1 in Brillinger, (1975)
[TABLE]
Following the arguments of Rosenblatt, (1984) on pages 1177-1178, we can express the fourth order cumulant of products in terms of cumulants of the factors, i.e. we obtain
[TABLE]
where the latter sum extends over all indecomposable partitions of the table
[TABLE]
In order to bound this sum, we need that for and
[TABLE]
with and measurable sets . This follows by Lemma 4.1 and Assumption (N1):
[TABLE]
where and have been introduced in the proof of (4.12).
Next, arguments as in Rosenblatt, (1984) on page 1177–1178 yield
[TABLE]
and together with (4.12) we obtain
[TABLE]
Furthermore, as , by (4.12)
[TABLE]
for some constant and sufficiently large. This yields (4.14) and (4.15) in the case where is Kendall’s .
In the case where is Spearman’s we have by Lemma 4.2
[TABLE]
where
[TABLE]
Observing that by construction , we have similarly as for Kendall’s
[TABLE]
where,
[TABLE]
Following the arguments of Rosenblatt, (1984) on pages 1177-1178, we express the fourth order cumulants of products of random variables in terms of cumulants of the factors which, similarly as in (4.30), can be bounded by Lemma 4.1 for and . After that, arguments as in Rosenblatt, (1984) yield
[TABLE]
and together with (4.13) we obtain \sum_{j=1}^{\mu_{n}}\textnormal{\mbox{I\negthinspace E}}\big{[}\big{(}\sum_{t\in\Gamma_{j}}\zeta^{\rho}_{n,t}(\omega)\big{)}^{4}\big{]}=O\big{(}\frac{\mu_{n}p_{n}^{2}r_{n}^{2}}{n^{4}}\big{)}. Furthermore, as , by (4.13),
[TABLE]
for some constant and sufficiently large. This concludes the proof.
Acknowledgements. The authors would like to thank Martina Stein, who typed parts of this manuscript with considerable technical expertise. This work has been supported in part by the Collaborative Research Center “Statistical modeling of nonlinear dynamic processes” (SFB 823, Teilprojekt A1, C1) of the German Research Foundation (DFG).
5 Appendix: technical details
The proofs of Theorems 3.3 and 3.4 rely on a blocking technique which will be summarized in Section 5.1. In Section 5.2 we state and prove the covariance inequalities that are crucial in order to derive the convergence of the linear and degenerate parts of the U-lag-window estimate. Finally, in Section 5.3 we provide the details for the proofs of results and equations given in Section 4.
For simplicity of notation, let .
5.1 Blocking results for stationary -mixing processes
In order to transfer classical results from the iid case to sums of -mixing stationary time series, we apply a blocking technique with alternate ”large” blocks of size and ”small” blocks of size from Arcones and Yu, (1994) based on a blocking technique introduced by Yu, (1994) with blocks of equal size . For each fixed , we divide the original sequence into blocks of size alternating with blocks of size and a remainder block of length . The block size of the ”small” blocks is chosen depending on the mixing conditions on and the size of . That is, is chosen large enough such that the blocks are ”almost” independent of each other, but small enough such that the sequence composed of these blocks behaves similarly to the original mixing sequence. The block size is chosen analogously. More precisely, we assume that
[TABLE]
and define
[TABLE]
We denote the random variables of belonging to block , or by
[TABLE]
respectively. This yields a sequence of alternating and blocks
[TABLE]
We then construct a one-dependent sequence of independent blocks defined as
[TABLE]
and independent of the original sequence . Furthermore, the blocks , and are identically distributed as the corresponding blocks in the sequence , i.e.
[TABLE]
The existence of a proper measurable space that hosts both sequences, and the independent block sequence , as well as measurability issues on this space are adressed in Yu, (1994). Denote by and block sequences corresponding to the blocks and by and the block sequences corresponding to the blocks, e.g.
[TABLE]
Note that we choose the block size such that the dependence between the blocks of the original -mixing sequence becomes weaker as increases. The next lemma is a slightly adapted version of Lemma 4.1 in Yu, (1994) and is proven analogously. It shows that the or, respectively, blocks of the original sequence can be related to the or, respectively, blocks of the independent block sequence in the following way.
Lemma 5.1**.**
Denote by and be the distributions of and , respectively. Then, for any measurable function on with ,
[TABLE]
Similarly, if and denote the distributions of and , respectively, and if is a measurable function on with , then
[TABLE]
In order to establish the convergence in probability of the parts of the U-lag-window estimate corresponding to the linear and degenerate part in the Hoeffding decomposition we prove several covariance inequalities for -mixing data. To this end we apply a coupling technique by Berbee, (1979). The idea is to replace successively dependent variables by variables that have the same distribution but are independent of the original variables and all other involved variables with the smallest error possible. Berbee, (1979) found the following in the case of -mixing data.
Lemma 5.2** (Berbee, (1979)).**
Suppose on a probability space there is defined a pair of random variables with values in Borel spaces. If the probability space is rich enough, it can be extended with a random variable , independent of and distributed as such that
[TABLE]
5.2 Auxiliary technical results
Lemma 5.3**.**
Let Assumption (C2) hold. Then, the kernels defined in (2.8), have uniform moments, i.e. there exist such that for all , ,
[TABLE]
*where , and denote the joint distributions of , and , respectively, with , the sorted version of the vector . *
Lemma 5.4**.**
Let denote an independent and identically distributed copy of on a possibly richer probability space that is independent of . Then, for and arbitrary ,
[TABLE]
*where the latter equation also holds if any other pair is replaced by an iid copy . *
Lemma 5.5**.**
If Assumptions (C1) – (C3) are satisfied, we have for any fixed
- (1)
For any ,
[TABLE]
- (2)
If and
[TABLE]
we have for any permutation of
- (i)
**
- (ii)
if or ,
[TABLE]
- (iii)
if , , and , then
[TABLE]
5.2.1 Proof of Lemma 5.3
From assumption (C2), we have
[TABLE]
and therefore, by the definition of the Hoeffding decomposition,
[TABLE]
As is recursively defined by
[TABLE]
also has uniform -moments.
5.2.2 Proof of Lemma 5.4
Recall the following property of the conditional expectation [see Theorem 6.4 in Kallenberg, (2010)] which can easily be adapted to more than one -measurable random variable:
Let and be random variables and a -algebra such that is -measurable and is independent of . Then, for any measurable function with ,
[TABLE]
*where .
Thus, by the law of total expectation, we have
[TABLE]
Obviously, are -measurable and is independent of
. As, additionally, by Lemma 5.3 we have that
[TABLE]
where
[TABLE]
We will now show that . To this end, we consider the integral representation of , i.e. similarly as in the proof of Theorem 2 in [Lee, (1990), pg. 28], we obtain by the symmetry of ,
[TABLE]
Integrating both sides with respect to yields
[TABLE]
Observing that
[TABLE]
we have
[TABLE]
and altogether,
[TABLE]
which concludes the proof.
5.2.3 Proof of Lemma 5.5
- (1)
If replace the pair using Berbee’s coupling technique by an identically distributed copy that is independent of and and such that
[TABLE]
Then,
[TABLE]
By Hölder’s inequality we obtain
[TABLE]
which gives
[TABLE]
Now, is independent of and the result follows by Lemma 5.4.
If , using Berbee’s coupling technique, first replace by an identically distributed copy that is independent of , and and such that
[TABLE]
By Hölder’s inequality, we obtain with similar arguments as in the case that
[TABLE]
Then, replace by an independent copy that is independent of , , and such that
[TABLE]
where we have used Lemma 1 in Eberlein, (1984). Then,
[TABLE]
Finally, replace by an independent copy that is independent of , , and and such that
[TABLE]
which gives
[TABLE]
Altogether,
[TABLE]
Next, observe that the last summand does not vanish as does not have distribution . Therefore, using Berbee’s coupling technique, we rereplace by an independent copy such that the couple has distribution , is independent of and
[TABLE]
Hence,
[TABLE]
Observing that \min\Big{\{}l,k-l\Big{\}}\leq k and that the -mixing coefficients are monotone decreasing, we have
[TABLE]
and as \textnormal{\mbox{I\negthinspace E}}\Big{[}h_{1,k}\begin{pmatrix}X_{t}^{\circ}\\ X_{t+k}^{*}\end{pmatrix}\Big{]}=0, we can conclude that
[TABLE]
-
(2)
-
(i)
By Hölder’s inequality and as we obtain
[TABLE]
where we have used that for . Hence, by Lemma 5.3,
[TABLE]
- (ii)
For brevity, we only consider the case . The other cases are treated similarly but require a more complex notation.
In order to prove inequality (5.3), according to the coupling Lemma 5.2 by Berbee, (1979), depending on whether or , we can choose a random variable or respectively that has the same distribution as or respectively , independent of and such that
[TABLE]
First, consider the case where , that is we replace by an independent identically distributed copy . Then, by Lemma 5.4,
[TABLE]
Splitting the probability space, we obtain
[TABLE]
The second summand vanishes and for the first summand Hölder’s inequality yields
[TABLE]
where the latter inequality is due to Lemma 5.2. In the case where , we obtain
[TABLE]
Inequalities (5.6) and ((ii)) together yield result (ii).
- (iii)
As in (ii), we only consider the case . Then, if
[TABLE]
replace one after another , , and according to Lemma 5.2 by independent identically distributed copies , , and that are independent of the other involved random variables. Denote by the pair , where . Then, by Lemma 5.3 and similarly as in the proof of part (i)
[TABLE]
where the latter equality is due to the assumption that . Hence,
[TABLE]
Note that the second summand does not necessarily vanish because, having replaced , , and by independent identically distributed copies one after another, the couple does not have distribution . However, it is possible to bound
[TABLE]
by applying Lemma 5.2 again in order to ”rereplace” successively and by independent pairs and with distribution . More precisely, according to Lemma 5.2 we can replace by a random variable with the same distribution as that is independent of the other involved variables such that the couple has distribution . Then, similarly as in the proof of (ii),
[TABLE]
where we have used that due to the independence of , and all other involved variables, . Analogously, we can replace by a random variable such that the couple has distribution and is independent of . Then,
[TABLE]
Then, and both have distribution and are independent. Similar arguments as in the proof of Lemma 5.4 (ii) show that also
[TABLE]
and hence,
[TABLE]
Observing that by the definition of the -mixing coefficients and since
[TABLE]
equations ((iii)) and (5.9) yield
[TABLE]
Analogously, if , we obtain
[TABLE]
Combining these inequalities yields (iii), which concludes the proof.
5.3 Proofs of results from Section 4
5.3.1 Proof of Lemma 4.1
By Theorem 5.2 of Bradley, (2005), is -mixing with mixing coefficients
[TABLE]
where the latter identity is due to the definition of the processes , . Following the arguments in the proof of Lemma 4.1 in Kley, (2014), we obtain the result.
5.3.2 Proof of Lemma 4.2
For notational convenience, let . Note that,
[TABLE]
Define
[TABLE]
By the law of the total expectation we obtain
[TABLE]
where the latter inequality follows as
[TABLE]
is -measurable. Moreover, is -measurable and is independent of . From the property of the conditional expectation stated in the proof of Lemma 5.4, it follows that
[TABLE]
Hence, the same arguments as above yield
[TABLE]
Repeating these steps times yields the result.
- (i)
Under (C0) we have h_{1,k}^{\tau}\begin{pmatrix}x_{1}\\ y_{1}\end{pmatrix}=\textnormal{\mbox{I\negthinspace E}}\Big{[}h^{\tau}\Big{(}\begin{pmatrix}x_{1}\\ y_{1}\end{pmatrix},\begin{pmatrix}X_{0}\\ X_{k}\end{pmatrix}\Big{)}\Big{]} with
[TABLE]
Note that under (C0), we obtain from (4.2) that
[TABLE]
The equivalent representation of moments in terms of cumulants yields
[TABLE]
For all and , and we have
[TABLE]
and
[TABLE]
where is the copula associated with [see e.g. Schmid et al., (2010)] and is the population version of Spearman’s at lag . Hence,
[TABLE]
and inserting equations ((i)) and ((i)) in ((i)) yield the result.
- (ii)
Under (C0) we have h_{1,k}^{\rho}\begin{pmatrix}x\\ y\end{pmatrix}=\textnormal{\mbox{I\negthinspace E}}\Big{[}h^{\rho}\Big{(}\begin{pmatrix}x\\ y\end{pmatrix},\begin{pmatrix}X_{0}^{(1)}\\ X_{k}^{(1)}\end{pmatrix},\begin{pmatrix}X_{0}^{(2)}\\ X_{k}^{(2)}\end{pmatrix}\Big{)}\Big{]} with
[TABLE]
The first order kernel being centered by definition of the Hoeffding decomposition, from (4.2) we know that
[TABLE]
Thus, as under (C0) for any , we obtain from (4.2) that
[TABLE]
where we have used the representation of centered fourth moments in terms of cumulants, property (v) of Theorem 2.3.1 in Brillinger, (1975) and ((i))
[TABLE]
Furthermore, the only permutations and for which not all products of second order cumulants in ((ii)) contain one cumulant with one independent factor and thus equal [math] are those with . For each of these combinations we obtain
[TABLE]
and
[TABLE]
Plugging ((ii)) and ((ii)) into ((ii)) concludes the proof.
5.3.3 Proof of (4.6)
We will prove this result only for positive lags as the proof for negative lags is analogous. More precisely we consider
[TABLE]
and prove that
[TABLE]
Finally, using that for some constant , establishes (4.6).
For any fixed , decompose (5.3.3) into sums according to the following 3 cases:
- (1)
all indices are different,
- (2)
indices are different or
- (3)
or less indices are different,
that is
[TABLE]
In the sequel, denote by the -th smallest of all distinct indices among .
In case (1) we distinguish the following cases:
- (1.1)
or .
- (1.2)
and .
In case (1.1), consider the set
[TABLE]
and observe that , where denotes the cardinality of the set .
Then, we obtain from Lemma 5.5 (2) (ii), for some constant ,
[TABLE]
where we have bounded from above by an integral and then concluded with Assumption (C3).
Next, in case (1.2),
[TABLE]
For (I), we apply similar arguments as in the proof of Lemma 5.5 (2) (iii). That is, if we replace one after another all random variables by independent copies and then rereplace them by independent pairs with cdf . We have for any permutation of ,
[TABLE]
for a constant depening only on . Next, let which is always smaller than and consider the set
[TABLE]
Then, for some constant ,
[TABLE]
since . Hence, From Lemma 5.5 (2) (iii) we know that
[TABLE]
Consider the set
[TABLE]
with . Then, for some constant ,
[TABLE]
and hence,
Therefore, in case (1.2) we have
[TABLE]
Combining equations (5.3.3) and (5.3.3) yields
[TABLE]
which concludes the consideration of case (1).
In case (2), we encounter the following situations:
- (2.1)
the index appearing twice is not or .
- (2.2)
the index appearing twice is or .
Then,
[TABLE]
In case (2.1), consider the following situations:
- (a)
- (b)
and
- (c)
and
In situation (a), similarly as in the proof of Lemma 5.5 (ii), we replace the pair with the smallest index by an independent copy in order to bound the summand from above by . Hence, by assumption (C3),
[TABLE]
Next, in situation (b), with similar arguments as in the proof of Lemma 5.5 (iii), we replace one after another , , and by independent copies and obtain with assumption (C3),
[TABLE]
In situation (c), we use Lemma 5.5 (i) and the fact that in this case the number of summands is of order , that is
[TABLE]
Therefore,
[TABLE]
Since in case (2.2), the index appearing twice is or , the indices and appear only once. Thus the case can be handled by the similar arguments as case (2.1), i.e. we obtain
[TABLE]
Cases (2.1) and (2.2) yield
[TABLE]
which concludes the consideration of case (2).
In case (3) observe that the number of summands is of order , such that together with Lemma 5.5 (2) (i) we can conclude that
[TABLE]
Finally, combining equations (5.20), (5.3.3) and (5.3.3) yields the result.
5.3.4 Proof of (4.9)
We have by (4.3) that
[TABLE]
Next,
[TABLE]
Similar arguments as in the proof of (4.4) yield
[TABLE]
where we have used that |\frac{1}{n-r_{n}}-\frac{1}{n}|=O\Big{(}\frac{r_{n}}{n^{2}}\Big{)} and . Next,
[TABLE]
Note that by the stationarity of the process ,
[TABLE]
Similarly as for we obtain
[TABLE]
and analogously,
[TABLE]
Altogether,
[TABLE]
This concludes the proof of (4.9).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Ahdesmäki et al., (2005) Ahdesmäki, M., Lähdesmäki, H., Pearson, R., Huttunen, H., and Yli-Harja, O. (2005). Robust detection of periodic time series measured from biological systems. BMC bioinformatics , 6(1):1.
- 2Anderson, (1971) Anderson, T. W. (1971). The Statistical Analysis of Time Series . John Wiley and Sons, Inc.
- 3Arcones and Yu, (1994) Arcones, M. A. and Yu, B. (1994). Central limit theorems for empirical and u-processes of stationary mixing sequences. Journal of Theoretical Probability , 7(1):47–71.
- 4Berbee, (1979) Berbee, H. C. (1979). Random walks with stationary increments and renewal theory. MC Tracts , 112:1–223.
- 5Birr et al., (2014) Birr, S., Volgushev, S., Kley, T., Dette, H., and Hallin, M. (2014). Quantile spectral analysis for locally stationary time series. ar Xiv preprint ar Xiv:1404.4605 .
- 6Blomqvist, (1950) Blomqvist, N. (1950). On a measure of dependence between two random variables. The Annals of Mathematical Statistics , 21(4):593–600.
- 7Bradley, (2005) Bradley, R. C. (2005). Basic properties of strong mixing conditions. a survey and some open questions. Probability Surveys , 2:107–144.
- 8Brillinger, (1975) Brillinger, D. R. (1975). Time series: Data Analysis and Theory . Holt, Rinehart and Winston, Inc.
