Ordinal Patterns in Long-Range Dependent Time Series
Annika Betken, Jannis Buchsteiner, Herold Dehling, Ines M\"unker,, Alexander Schnurr, Jeannette H.C. Woerner

TL;DR
This paper investigates the ordinal structure of long-range dependent time series using ordinal patterns, providing estimators, limit theorems, and simulations to understand their behavior, especially for high Hurst parameters.
Contribution
It introduces new estimators for ordinal pattern probabilities and establishes limit theorems, including for high Hurst parameters, extending existing theoretical results.
Findings
Limit theorems for ordinal pattern probabilities in different settings
Distributional results involving Rosenblatt distribution for certain cases
Simulations confirming theoretical predictions
Abstract
We analyze the ordinal structure of long-range dependent time series. To this end, we use so called ordinal patterns which describe the relative position of consecutive data points. We provide two estimators for the probabilities of ordinal patterns and prove limit theorems in different settings, namely stationarity and (less restrictive) stationary increments. In the second setting, we encounter a Rosenblatt distribution in the limit. We prove more general limit theorems for functions with Hermite rank 1 and 2. We derive the limit distribution for an estimation of the Hurst parameter if it is higher than 3/4. Thus, our theorems complement results for lower values of which can be found in the literature. Finally, we provide some simulations that illustrate our theoretical results.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Ordinal Patterns in Long-Range Dependent Time Series
Annika Betken
,
Jannis Buchsteiner
,
Herold Dehling
,
Ines Münker
,
Alexander Schnurr
and
Jeannette H.C. Woerner
Fakultät für Mathematik, Ruhr-Universität Bochum, 44780 Bochum, Germany
Fakultät für Mathematik, Ruhr-Universität Bochum, 44780 Bochum, Germany
Fakultät für Mathematik, Ruhr-Universität Bochum, 44780 Bochum, Germany
Department of Mathematics, University of Siegen, 57068 Siegen, Germany
Department of Mathematics, University of Siegen, 57068 Siegen, Germany
Department of Mathematics, TU Dortmund, 44221 Dortmund
Abstract.
We analyze the ordinal structure of long-range dependent time series. To this end, we use so called ordinal patterns which describe the relative position of consecutive data points. We provide two estimators for the probabilities of ordinal patterns and prove limit theorems in different settings, namely stationarity and (less restrictive) stationary increments. In the second setting, we encounter a Rosenblatt distribution in the limit. We prove more general limit theorems for functions with Hermite rank 1 and 2. We derive the limit distribution for an estimation of the Hurst parameter if it is higher than 3/4. Thus, our theorems complement results for lower values of which can be found in the literature. Finally, we provide some simulations that illustrate our theoretical results.
Key words and phrases:
Hurst index, limit theorems, long-range dependence, ordinal patterns
This research was supported in part by the German Research Foundation (DFG) through Collaborative Research Center SFB 823 Statistical Modelling of Nonlinear Dynamic Processes, Research Training Group RTG 2131 High-dimensional Phenomena in Probability - Fluctuations and Discontinuity and the project * Ordinal-Pattern-Dependence: Grenzwertsätze und Strukturbrüche im langzeitabhängigen Fall mit Anwendungen in Hydrologie, Medizin und Finanzmathematik* (SCHN 1231/3-1).
1. Introduction
Originally, ordinal patterns have been introduced to analyze long and noisy time series. They have proved to be useful in various contexts such as sunspot numbers (Bandt and Shiha (2007)), EEG data (Keller et al. (2015)), speech signals (Bandt (2005)) and chaotic maps which appear in the theory of dynamical systems (Bandt and Pompe (2002)). Further applications include the approximation of the Kolmogorov-Sinai entropy (Sinn et al. (2012)). Recently, ordinal patterns have been used to detect and to model dependence structures between time series; see Schnurr (2014). Limit theorems for the parameters under consideration have been proved in the short-range dependent setting in Schnurr and Dehling (2017).
In the present paper we will investigate ordinal patterns in the long-range dependent setting. To the best of our knowledge Sinn and Keller (2011) is the only article which explicitly deals with the interplay between ordinal patterns and the Hurst parameter . The authors estimate this parameter of a fractional Brownian motion restricting their considerations to . An overview and a comparison of various other techniques for estimating the Hurst parameter is given in Taqqu et al. (1995) and Rea et al. (2009). None of the therein considered methods requires a restriction on the range of admissible values for . Nonetheless, graphical methods that are used to estimate the Hurst parameter such as the aggregated variance method or the R/S method (Mandelbrot and Wallis (1969), Mandelbrot (1975) and Mandelbrot and Taqqu (1979)) are known to be biased. Estimators operating in the frequency domain of time series, such as the Whittle estimator, which are usually based on an estimation of the spectral density by the periodogram, often make parametric assumptions on the spectral density of the data-generating process. Semiparametric alternatives such as the GPH estimator (Geweke and Porter-Hudak (1983)) and the local Whittle estimator (Künsch (1987), Robinson (1995)) require the choice of a bandwidth parameter denoting the number of Fourier frequencies incorporated in the estimation of the spectral density by the periodogram. The choice of this tuning parameter is crucial to the performance of semiparametric estimates, but difficult to select in practice. For the local Whittle estimator the selection of the bandwidth has been addressed by several authors; see for example Henry (2001), Delgado and Robinson (1996) and Henry and Robinson (1996). A different approach to estimate the Hurst parameter is to apply variational methods and techniques from stochastic analysis as for example derived in coeurjolly:2001 and istas:1997. For an ordinal-pattern based estimation of the Hurst parameter, the asymptotic distribution of the estimator is derived on the basis of limit theorems for short-range dependent time serie in Sinn and Keller (2011). Complementing the results of Sinn and Keller (2011), we derive the limit distribution for the estimator if .
In Fischer et al. (2017) the authors used ordinal patterns in the context of hydrological data. It is a well known fact that hydrological data is often long-range dependent. In this case, the limit theorems presented in Schnurr and Dehling (2017) are no longer valid. In the present paper we close this gap and provide limit theorems in the long-range dependent setting.
For let denote the set of permutations of , which we write as -tuples containing each of the numbers exactly one time. By the ordinal pattern of order we refer to the permutation
[TABLE]
which satisfies
[TABLE]
Given a time series , we consider the relative frequency
[TABLE]
of an ordinal pattern as a natural estimator for the probability
[TABLE]
Sinn and Keller (2011) show that Rao-Blackwellization leads to an estimator with lower risk and therefore better statistical properties.
In this article, both estimators are studied. Confirming the results of Sinn and Keller (2011), we show that and are consistent estimators; see Proposition 5.1. We consider separately the case of a stationary time series and the case of a time series with stationary increments. While the asymptotic distribution of can be derived from a limit theorem for functions with Hermite rank , the limit behaviour of is derived from corresponding results for functions with Hermite rank . Along the way we explicitly calculate the asymptotic distribution of partial sums of the form where has Hermite rank or Hermite rank and is a stationary long-range dependent Gaussian process.
The paper is organized as follows: in the next section we introduce the mathematical framework. In Section 3 we present the main results, namely the asymptotic properties of two estimators of ordinal pattern probabilities. In Section 4, on the basis of these considerations, the asymptotic distribution of an estimator for the Hurst parameter based on ordinal patterns is derived. The detailed proofs of more general limit theorems for functions with Hermite rank and , that constitute the theoretical background of the results in Section 3 and 4, are given in Section 5. In the final section a simulation study is presented.
2. Mathematical Framework
Let be a stationary standard Gaussian process with autocovariance function
[TABLE]
where is a function, slowly varying at infinity (see Bingham et al. (1987), p.6), and . Such a process is called long-range dependent. For we consider the -valued process given by
[TABLE]
that is, we consider overlapping finite sequences of the original process. For , , the corresponding cross-covariance function satisfies
[TABLE]
and, since is a slowly varying function, we thus obtain
[TABLE]
for all . Consequently, is multivariate long-range dependent in the sense of Arcones (1994), Section 3, if . If , we speak of short-range dependence.
We recall the concept of Hermite expansion. Let denote the Hermite polynomial of order given by
[TABLE]
and define the multivariate Hermite polynomial by
[TABLE]
The collection forms an orthogonal basis of , where denotes the -dimensional standard normal distribution; see Section 3.2 in Beran et al. (2013). Thus, for any square-integrable the following -identity holds:
[TABLE]
where . The Hermite coefficients are given by the inner product, that is . The starting index
[TABLE]
is called the Hermite rank of . Since the left-hand side in (2) is centered, we have . In contrast to (1) the definition of multivariate Hermite polynomials with respect to is more complicated; see Beran et al. (2013), section 3.2. The Hermite rank is defined analogously
[TABLE]
where .
The Hermite expansion in (2) is crucial to determining the asymptotics of partial sums of the type
[TABLE]
where satisfies .
3. Ordinal Patterns
In this section we introduce the concept of ordinal pattern analysis and present asymptotic distributions for estimators of ordinal pattern probabilities, where functions with different Hermite ranks show up. We also provide examples for the calculation of the coefficients specifying the limiting distributions of these estimators for certain ordinal patterns. For detailed proofs of the given theorems the reader is referred to Section 5.
Definition 3.1**.**
Let denote the set of permutations of , which we write as -tuples containing each of the numbers exactly one time. By the ordinal pattern of order we refer to the permutation
[TABLE]
which satisfies
[TABLE]
and if for .
The latter is introduced in order to deal with ties which do not occur in our simulation study, but which might occur when dealing with real data.
Remark 3.2**.**
Naturally, ordinal patterns are closely linked to the ranks of observations. Given observations , we define the rank of by
[TABLE]
Note that if for all , then
[TABLE]
Thus, ranks provide a complete description of the order structure of the vector equivalent to the description by ordinal patterns.
In this paper we are interested in estimating the probability for a given time series and therefore define the estimator
[TABLE]
for a time series .
We will see later (Remark 3.14) that the assumption that the time series is stationary yields trivial limits. Therefore, we relax this assumption and use a helpful relation that was derived in Sinn and Keller (2011). They have shown that the estimator above is uniquely determined by the increments of this process. Let us consider for .
For a vector define
[TABLE]
Then, it holds that
[TABLE]
since ordinal patterns are not affected by monotone transformations.
In terms of random vectors we hence arrive at
[TABLE]
In the following, we will study under which assumptions on the underlying time series we can derive an asymptotic result for the estimator . Since regarding as a stationary time series is not interesting due to the degenerate limit, we relax this assumption as follows: let be a (possibly non-stationary) stochastic process and let denote the corresponding increment process given by for . We assume that is a stationary standard Gaussian process with autocovariance function
[TABLE]
where is a function, slowly varying at infinity, and .
We now rewrite the estimator in terms of the increment variables following the considerations in (4):
[TABLE]
We will show that the relative frequency of any ordinal pattern is a consistent estimator for the corresponding probability.
Theorem 3.3**.**
Suppose that is a stationary ergodic process. Then, is a consistent estimator of . More precisely,
[TABLE]
almost surely.
3.1. Limit distribution of
At first we need to determine the Hermite rank of the estimator. Here, and in what follows, proofs are postponed to Section 5.
Lemma 3.4**.**
Let be a stationary standard normal Gaussian process and let . Then, for any , the Hermite rank of
[TABLE]
is equal to 1.
We now give the asymptotic distribution of the estimator and in doing so, we will take a closer look at the Hermite coefficients which determine the limit variance and hence the limit distribution.
Theorem 3.5**.**
Let be a stochastic process and let denote the increment process of given by for . Assume that is a stationary, long-range dependent standard Gaussian process with autocovariance function . Then,
[TABLE]
where and where the vector is given by
[TABLE]
with defined by
[TABLE]
Thus, in order to compute the limit variance of , we have to calculate the constants for . We can reduce the number of calculations by making use of the time and space symmetry of stationary multivariate normal random vectors. For a normal random vector these are given by
[TABLE]
Following Sinn and Keller (2011), p. 1784, we define two mappings:
[TABLE]
Graphically, the mapping can be considered as space reversal, i.e., as the reflection of on a horizontal line, while can be considered as time reversal, i.e., as the reflection of on a vertical line.
For each , we define
[TABLE]
It is easily seen that the set is closed under and , since and . This yields a partition of into sets each having either two or four elements, depending on whether holds for the considered .
In Sinn and Keller (2011), p.1786 and Lemma 1, it is shown that with respect to ordinal patterns the above considerations yield
[TABLE]
Both equations follow from the space and time symmetry of the multivariate normal distribution. More precisely, (7) holds since ordinal patterns are not affected by monotone transformations. For we have
[TABLE]
We compute the limit variance for ordinal patterns of lengths and , i.e., we need to study increments of length and . As it is common in the literature, we restrict ourselves to small in the present article. Unfortunately, the computations for larger values of exceed the computing capacity of Mathematica.
Given the symmetry relations in (6) and (7), we only need to calculate the Hermite coefficients of the estimator for one pattern of each reversion group. Regarding it is sufficient to choose . Regarding we can partition this set into the two subsets and . In the following we will study the Hermite coefficients of for and so that we can reduce the number of lengthy calculations since we only need to consider two ordinal patterns instead of six.
Example 3.6** (Ordinal patterns of length ).**
In the case there are only two possible patterns: and the corresponding spatial (or time) reverse . We focus on . This pattern corresponds to the event . Hence, we consider
[TABLE]
Correspondingly, we obtain for since this is the spatial reversion of . Thus, for these two ordinal patterns we arrive at a limit distribution of given by , where .
We continue with the calculation of the limit variances in the case . The integrals under consideration were solved by using Mathematica as well as a lengthy calculations that make use of the Cholesky decomposition (cf. the Appendix).
Example 3.7** (Ordinal patterns of length ).**
First, we study the limit variance for . In this case, has two elements. Note that . Due to the symmetry of the bivariate normal distribution, we obtain , so that we only need to calculate
[TABLE]
where denotes the joint density of . Hence,
[TABLE]
where are the entries of given by
[TABLE]
Again, we obtain the limit variance which is here more surprising than in the case because the result is independent of . For the space reverse pattern we apply (6) and obtain leading to the same limit variance. It is an interesting question whether it is just a coincidence that this variance is independent of the covariance between the increments. The answer turns out to be yes, since the dependence is reflected in the limit variance of the pattern .
Note that . As a result, we have
[TABLE]
where denotes the joint density of . As a result, we obtain
[TABLE]
The above expression depends on . Due to space and time symmetry discussed in (6) and (7) all permutations that belong to the reversion group of , i.e., and , lead to the same limit distribution for , namely
[TABLE]
3.2. Limit distribution of an improved estimator based on Rao-Blackwellization
In the previous section we considered the natural estimator for the frequency of a certain ordinal pattern. However, in Sinn and Keller (2011) it is shown that the estimator which results from averaging the estimates of the same reversion class has better statistical properties. The corresponding estimator is therefore defined by
[TABLE]
where denotes the cardinality of the set .
Recalling that , we are, in particular, interested in the function defined by
[TABLE]
In order to specify the limit distribution of , we need to determine the Hermite rank of this function. For this, note that Sinn and Keller (2011), p. 1786, show that has Hermite rank .
For a multivariate random vector define
[TABLE]
Analogously, we obtain
[TABLE]
so that alltogether we derive
[TABLE]
With this result we can simplify the second order Hermite coefficients for the improved estimator
[TABLE]
Analogously, we obtain
[TABLE]
Hence, we can uniquely determine the second order Hermite coefficients of the improved estimator by calculating the second order Hermite coefficients for only one pattern that belongs to the considered reversion group . By following the symmetry properties discussed above we derive for the special case
[TABLE]
for all .
The second order Hermite coefficients of the improved estimator are equal to the second order Hermite coefficients of .
We use this result to determine the Hermite rank of the function defined in (8), for details see Section 5, and to simplify the calculations concerning the parameters determining the variance in the next Theorem 3.10.
Lemma 3.8**.**
The function has Hermite rank .
Remark 3.9**.**
By a similar calculation we obtain that for all for the fixed pattern in the setting above.
Following the above Lemma, we derive the asymptotic distribution of the new estimator:
Theorem 3.10**.**
Let be a stochastic process and let denote the process of increments of given by for . Assume that is a stationary, long-range dependent standard Gaussian process with autocovariance function . Then, if ,
[TABLE]
with , and
[TABLE]
Remark 3.11**.**
For , the asymptotic distribution of is derived in Keller and Sinn (2005), Theorem 7. In this case, it is Gaussian.
For small we calculate the matrix of coefficients explicitly:
Example 3.12** (The case ).**
Since we are interested in increments with length , we have to study ordinal patterns of length . Regarding we derive the event and therefore
[TABLE]
So in the trivial case (only one increment variable) we derive a degenerate limit distribution again.
For increments of length , we used Mathematica to calculate the Hermite coefficients.
Example 3.13** (The case ).**
First, we consider the pattern and the corresponding event . We know that , , and by (9) that since . We have
[TABLE]
and
[TABLE]
This yields
[TABLE]
For the left-hand side in (10) converges in distribution to .
Consider the pattern and the corresponding event . It holds that
[TABLE]
and
[TABLE]
Since the reversion group of this pattern has four elements we also need to calculate
[TABLE]
Altogether we arrive at
[TABLE]
For the left-hand side in (10) converges in distribution to .
Remark 3.14**.**
The reader might wonder which limit theorems one can derive in the special case that it is not only the increment process which is stationary but the time series itself. We have to determine the Hermite rank of the estimator in this setting and we obtain that for any the Hermite rank of the function , defined by
[TABLE]
is equal to 1 (for details, see Section 5).
We get the following asymptotic result concerning the ordinal pattern probability estimator in this modified setting:
[TABLE]
where denotes the Dirac measure in [math]. In this special case, the limit distribution for is trivial.
However, taking the classical rate of convergence , we will get a non-trivial Gaussian central limit theorem as explained in section 2.1.
4. Estimation of the Hurst parameter
Sinn and Keller (2011) derive an estimator for the Hurst parameter based on the improved estimator for ordinal pattern probabilites . They show asymptotic normality of this estimator in the case . In order to obtain the asymptotic distribution for , we briefly describe the setting that was developed in that article. The idea is to determine the probability of changes in the “up-and-down” behaviour of the process . Since we need to use orthant probabilites of the normal distribution, we restrict ourselves to the case here.
To capture this mathematically, we define
[TABLE]
with .
Therefore, we obtain
[TABLE]
where is the covariance function of the stationary and long-range dependent increment process of as defined above; see Kotz et al. (2004), p.92. Since depends on the long-range dependence parameter , which we can express as in terms of the Hurst parameter, we will write in the following.
In order to estimate this probability, we choose the relative frequency as an estimator:
[TABLE]
with . We want to estimate the Hurst parameter in the case that is fractional Gaussian noise and hence is fractional Brownian motion. The correlation function of fractional Gaussian noise is given by
[TABLE]
such that . Therefore, we obtain
[TABLE]
since for . The probability of changes in the up-and-down-behaviour gets smaller if the Hurst parameter gets larger, as expected intuitively due to the persistent behaviour of long-range dependent time series. We calculate the inverse of by
[TABLE]
so that is satisfied
The Zero-Crossing estimator of the Hurst Parameter is then defined by
[TABLE]
In Sinn and Keller (2011), Corollary 11, it is shown that is a strongly consistent and asymptotically unbiased estimator of the Hurst Parameter, as well as it is asymptotically normal if . Using Theorem 3.10 we can complement their result by the following theorem.
Theorem 4.1**.**
If ,
[TABLE]
Proof.
Since , it follows by Theorem 3.10 and Example 3.13 that
[TABLE]
where with
[TABLE]
for . Therefore, and according to Example 3.13, we arrive at
[TABLE]
We also know that and since
[TABLE]
we get (see Beran et al. (2013), p. 34) with meaning that . All in all, it follows that
[TABLE]
We have and . Due to for , exists and does not equal zero for . Applying Theorem 3 in Van der Vaart (2000) we arrive at the above limit. ∎
5. Proofs
In this section the proofs of the results derived in Section 3 are presented. We are able to give these results in a more general way than needed in the context of ordinal patterns and therefore consider a larger class of functions. In the following we consider the asymptotic behaviour of the partial sums
[TABLE]
where satisfies . A first result on the asymptotic behaviour of these partial sums, which includes the statement of Theorem 3.3, is given by the following proposition that can be derived from Birkhoff’s ergodic theorem; see also Sinn and Keller (2011).
Proposition 5.1**.**
Suppose that is a stationary ergodic process, and that is a measurable function such that . Then,
[TABLE]
almost surely, as .
Proof.
Ergodicity of the process means that the shift operator
[TABLE]
defined by , is an ergodic transformation on the sequence space , equipped with the product -field and the probability measure . Thus, by Birkhoff’s ergodic theorem, we obtain for any integrable function
[TABLE]
almost surely. We now apply the ergodic theorem to the function , defined by
[TABLE]
With this choice of , we obtain and , and thus
[TABLE]
for -almost every sequence . Thus, by definition of , we find
[TABLE]
almost surely. ∎
Remark 5.2**.**
A stationary Gaussian process with autocovariance function such that if is mixing and hence ergodic; see Samorodnitsky (2007), pp. 43, 46. Thus, we may apply the above results to such Gaussian processes.
In general, in order to derive the limit distributions of partial sums given in (11), a careful analysis of the Hermite rank of the considered function is crucial. First, we present two lemmas that are helpful tools to determine the Hermite rank of a function and later on we give the proofs of Lemma 3.4 and Lemma 3.8, in which we deal with the heuristic estimator of ordinal pattern probabilities and the improved estimator based on the Rao-Blackwellization, respectively. Furthermore, we give the justification for the Hermite rank of the estimator considered in Remark 3.14.
It is well known that for we have and that in general; see Beran et al. (2013), Lemma 3.7. The last fact is disadvantageous since determining is usually much more complicated than determining . However, it is possible to show that under a mild additional assumption is bounded by ; see Lemma 5.4.
Lemma 5.3**.**
Let be a sequence of measurable functions in , and let be another function in this space such that in the metric of this space. Furthermore, let be a positive definite matrix such that is positive semidefinite. In this case, converges to in .
Proof.
Let be the Radon-Nikodym density of with respect to such that we have
[TABLE]
Hence, proving Lemma 5.3 boils down to the boundedness of which is obtained in the above setting by an elementary calculation: let and denote the density of and , respectively. Then,
[TABLE]
∎
Given the above result, we arrive at the following upper bound for :
Lemma 5.4**.**
Let be square-integrable with respect to and let be a positive definite covariance matrix such that is positive semidefinite. Then,
[TABLE]
Remark 5.5**.**
Note that, for ,
[TABLE]
With denoting the smallest eigenvalue of , we have
[TABLE]
Given that , and thus , are positive definite matrices, so that we can choose such that is positive semidefinite. Since ordinal patterns are not affected by scaling, we may for this reason assume that is positive semidefinite.
Proof.
Expanding both and in Hermite polynomials with respect to yields
[TABLE]
with . By uniqueness of the Hermite decomposition we have , which completes the proof. ∎
Proof of Lemma 3.4:
Proof.
Since ordinal patterns are not affected by scaling, we may assume that is positive semidefinite. According to Lemma 5.4 it suffices to show for some independent standard normal random variables and some . For simplicity, we regard the pattern which corresponds to the event . Hence, we arrive at
[TABLE]
It follows by the same reasoning that none of the expected values that correspond to the other ordinal patterns equals zero. ∎
Proof of Lemma 3.8:
Proof.
For we have ; see Beran et al. (2013), Lemma 3.7. According to Lemma 5.4, we have
[TABLE]
As a result, it is sufficient to show that , such that we may conclude .
To this end, let be a standard Gaussian random vector (i.e. with autocovariance matrix ). Following the arguments above, we only need to consider the second order Hermite coefficients of for a fixed pattern :
[TABLE]
For simplicity we regard . Note that for this pattern it suffices to show to prove , since in this case for For
[TABLE]
∎
Proof of the determination of the Hermite rank of the considered estimator in Remark 3.14:
Proof.
Let be a stationary, long-range dependent, standard normal Gaussian process and let . By Lemma 5.4 it is enough to show that for some independent standard normal random variables and some . Without loss of generality let and set . This yields
[TABLE]
since we integrate a strictly positive function. Hence, for any the Hermite rank of the function , defined by
[TABLE]
is equal to 1. ∎
We have finished our preparations and are now able to turn to the limit theorems for the partial sums in (11) for function with Hermite rank and . In the following, we will assume without loss of generality that
[TABLE]
We want to apply the results of Arcones (1994) which hold for partial sums of functions of -valued random vectors that have a multivariate standard normal distribution; see also Major (2019) for an alternative approach. Thus, we need to transform the vector accordingly. Let denote the covariance matrix of the vector . Observe that is a Toeplitz matrix whose entries are determined by the autocovariance function of the process , i.e.,
[TABLE]
The Cholesky decomposition yields
[TABLE]
where is an upper triangular matrix. Thus, there exists a standard normally distributed random vector such that
[TABLE]
We can rewrite the partial sum in (3) in terms of the random vectors as follows:
[TABLE]
where is defined by
[TABLE]
In order to characterize the asymptotic distribution of the considered partial sum process, we apply Theorem 6 of Arcones (1994). Employing the special structure of we obtain explicit representations of the limit distributions for the cases Hermite rank equal to and .
5.1. Limit theorems for functions with Hermite rank 1
First we consider the asymptotic behaviour of function with Hermite rank . Note that this Theorem implies the statement of Theorem 3.5.
Theorem 5.6**.**
Let be a stationary, long-range dependent standard Gaussian process with autocovariance function and let be a function with Hermite rank satisfying . Then,
[TABLE]
where and where with .
Proof.
Given that the function has Hermite rank , the limit behaviour corresponds to the asymptotic behaviour of the first order term in the Hermite expansion of . The Hermite rank of with respect to is the same as the Hermite rank of with respect to ; see Beran et al. (2013), Lemma 3.7. Since this first order term is given by
[TABLE]
with , since . It follows by stationarity and by definition of the process that the coefficient in the Hermite expansion (15) corresponds to
[TABLE]
We can thus express the vector of coefficients as follows:
[TABLE]
where is the vector of inner products of the random variables with , i.e.,
[TABLE]
According to the results of Arcones (1994), we know that the partial sums are dominated by the corresponding partial sums of the first order term in the Hermite expansion, i.e., that
[TABLE]
where for a sequence of random variables we write if .
With the notations introduced above, we obtain
[TABLE]
where the vector is given by
[TABLE]
Thus, we obtain
[TABLE]
The distribution of the partial sum on the right-hand side can be calculated exactly, as this is a partial sum of normal random variables. ∎
In the following, we study partial sums of functions of increments of a stationary long-range dependent Gaussian process of the following type
[TABLE]
This is a special case of partial sums of the type , where
[TABLE]
Functions of this kind appear, e.g. when studying ordinal patterns, cf. Section 3, considerations in (4). Therefore the following lemma combined with Theorem 5.6 yield the justification for the asymptotic result derived in Remark 3.14.
Lemma 5.7**.**
If can be written as a function of the increments, we have
[TABLE]
Proof.
We use a well-known fact about Gaussian random variables: Let be a vector of independent standard normally distributed random variables , , and let and be two matrices. Then, the random vectors and are independent, if and only if each of the rows of is orthogonal to each of the rows of , i.e., when .
We then use the representation of that we derived in the course of the proof of Theorem 5.6, namely
[TABLE]
where , and where is the matrix defined by
[TABLE]
Let be a positive definite symmetric matrix such that and let be its inverse. Then, has a -variate standard normal distribution. With this notation, we can rewrite the above expression for as follows:
[TABLE]
If ,
[TABLE]
Now we can apply the initial remark to the vectors and . We have
[TABLE]
and thus the vectors and are independent. Hence,
[TABLE]
since . ∎
Remark 5.8**.**
Lemma 5.7 implies that the limit in Theorem 2.3 is trivial if the function can be considered as a function of the increment process of a stationary, long-range dependent Gaussian process . An explanation for this phenomenon results from the observation that the increments of long-range dependent time series do not display characteristic features of long-range dependence. To see this, let denote the spectral density of the time series , i.e. is a non-negative function satisfying
[TABLE]
By assumption, we have
[TABLE]
where is a function that is slowly varying at infinity. If, additionally, is quasi-monotone, it follows that
[TABLE]
for some function that is slowly varying at zero; see Pipiras and Taqqu (2017), p. 19. For the increment process defined by , it then holds that
[TABLE]
For this reason, corresponds to the spectral density of the process . Note that
[TABLE]
with
[TABLE]
slowly varying at zero. It follows that
[TABLE]
i.e., the increment process is antipersistent and, in particular, short-range dependent; see Pipiras and Taqqu (2017), p. 31.
This finding coincides with results on limit theorems for discretely observed processes based on fractional Brownian motion, where the application of linear difference filters leads to a smaller exponent in the autocovariance function, cf. coeurjolly:2001, istas:1997. In our setting this would mean that considering differences of the stationary, long-range dependent process would lead to a short-range dependent process and hence to a Gaussian central limit theorem with a different normalizing constant, namely .
5.2. Limit theorems for functions with Hermite rank 2
We continue to study the asymptotic behaviour of the partial sums in (3) for a function with Hermite rank and therefore obtain the statement of Theorem 3.10 along the way.
Theorem 5.9**.**
Let be a stationary, long-range dependent standard Gaussian process with autocovariance function , and let be a function with Hermite rank satisfying . Then, if ,
[TABLE]
*with ,
and .*
Remark 5.10**.**
The extension of the theorem above for is given in Arcones (1994), Theorem 4.
Proof.
Recall that for an upper triangular matrix with and . and a multivariate standard normally distributed vector . Since the Hermite rank of , defined by , equals , the partial sums are dominated by the corresponding partial sums of the second order term in the Hermite expansion, i.e., that
[TABLE]
see Theorem 6 in Arcones (1994). Note that
[TABLE]
Since the left-hand side of the above equality is centered to mean zero,
[TABLE]
With , where , it follows that
[TABLE]
where denotes the -th entry of the vector , , since .
Note that . As a result,
[TABLE]
With it follows that
[TABLE]
All in all, we arrive at
[TABLE]
Note that
[TABLE]
Define the sample covariance at lag by
[TABLE]
Considering both summands separately, we arrive at
[TABLE]
and
[TABLE]
All in all, it follows that
[TABLE]
If , it follows by Section 4.4.1.3 in Beran et al. (2013) that
[TABLE]
where is a Rosenblatt process with parameter and .
Therefore, the considered expression converges in distribution to
[TABLE]
∎
Hence, we are able to characterize the limit distribution of the partial sums in (11) for functions with Hermite rank and .
6. Simulation study
We simulate paths of fractional Gaussian noise (by the command “simFGN0” from the RPackage “longmemo”) with sample size for different values of to compare the distribution of the estimators , and with the theoretical results derived above. We standardized the estimators following the normalization constants given in Theorem 3.5 and Theorem 3.10. The results depending on the long-range dependence parameter are displayed in Figure 3 and in Figure 4.
In Figure 5, the histograms and kernel density estimations of the estimator of the Hurst parameter are given, standardized by the normalizing constants we derived in Theorem 4.1.
Acknowledgments: We would like to thank two anonymous referees for their helpful comments.
7. Appendix
Calculation of the Hermite coefficients of for for the pattern ; cf. Example 3.7.
Since we look at , the covariance matrix of is given by
[TABLE]
The Cholesky decomposition has the following form:
[TABLE]
Note that , where has a bivariate standard normal distribution. Following Theorem 3.5, we need to calculate , where . Since
[TABLE]
we need to determine to calculate the variance in the limit distribution.
We consider . From the Cholesky decomposition it follows that and and therefore and . For this choice of we also know by (6) and (7) that and hence we arrive at
[TABLE]
Therefore, it is sufficient to only determine . For this, we rewrite
[TABLE]
Hence, we need to determine
[TABLE]
Finally, we obtain
[TABLE]
As a result, we confirm the result from Example 3.7 for the pattern . For , the analytical calculations work analogously.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Arcones (1994) Miguel A. Arcones. Limit Theorems for Nonlinear Functionals of a Stationary Gaussian Sequence of Vectors. The Annals of Probability , 22(4):2242 – 2274, 1994.
- 2Bandt (2005) Christoph Bandt. Ordinal time series analysis. Ecological modelling , 182(3-4):229–238, 2005.
- 3Bandt and Pompe (2002) Christoph Bandt and Bernd Pompe. Permutation Entropy: A Natural Complexity Measure for Time Series. Physical review letters , 88(17):174102–1 – 174102–4, 2002.
- 4Bandt and Shiha (2007) Christoph Bandt and Faten Shiha. Order Patterns in Time Series. Journal of Time Series Analysis , 28(5):646 – 665, 2007.
- 5Beran et al. (2013) Jan Beran, Yuanhua Feng, Sucharita Ghosh, and Rafał Kulik. Long-Memory Processes . Springer-Verlag Berlin Heidelberg, 2013.
- 6Bingham et al. (1987) Nicholas H. Bingham, Charles M. Goldie, and Jozef L. Teugels. Regular Variation . Cambridge University Press, 1987.
- 7Delgado and Robinson (1996) Miguel A. Delgado and Peter M. Robinson. Optimal spectral bandwidth for long memory. Statistica Sinica , 6:97 – 112, 1996.
- 8Fischer et al. (2017) Svenja Fischer, Andreas Schumann, and Alexander Schnurr. Ordinal pattern dependence between hydrological time series. Journal of Hydrology , 548:536 – 551, 2017.
