An Independence Test Based on Recurrence Rates
Juan Kalemkerian, Diego Fern\'andez

TL;DR
This paper introduces a new independence test based on recurrence rates and a Cramér-von Mises type functional, demonstrating strong asymptotic properties and higher power compared to existing tests, applicable to both discrete and continuous time series.
Contribution
The paper presents a novel independence test leveraging recurrence rates and a U-process, with proven asymptotic distribution, consistency, and superior power in various scenarios.
Findings
Test shows good behavior under multiple alternatives.
Higher power compared to traditional independence tests.
Applicable to both discrete and continuous time series.
Abstract
A new test of independence between random elements is presented in this article. The test is based on a functional of the Cram\'{e}r-von Mises type, which is applied to a -process that is defined from the recurrence rates. Theorems of asymptotic distribution under and consistency under a wide class of alternatives are obtained. The results under contiguous alternatives are also shown. The test has a very good behaviour under several alternatives, which shows that in many cases there is clearly larger power when compared to other tests that are widely used in literature. In addition, the new test could be used for discrete or continuous time series.
| Test | HHG | DCOV | HSIC | PSK | N(1,1) | N(0,1) | N(1,4) | |
|---|---|---|---|---|---|---|---|---|
| Parabola | 0.791 | 0.522 | 0.733 | 0.103 | 0.824 | 0.831 | 0.814 | 0.817 |
| 2 parabolas | 0.962 | 0.204 | 0.849 | 0.194 | 1.000 | 1.000 | 1.000 | 1.000 |
| Circle | 0.646 | 0.051 | 0.488 | 0.096 | 0.923 | 0.716 | 0.947 | 0.823 |
| Diamond | 0.283 | 0.030 | 0.262 | 0.016 | 0.422 | 0.139 | 0.477 | 0.395 |
| W-shape | 0.908 | 0.569 | 0.856 | 0.179 | 0.788 | 0.887 | 0.782 | 0.874 |
| 4 clouds | 0.052 | 0.053 | 0.053 | 0.046 | 0.052 | 0.052 | 0.051 | 0.051 |
| Test | HHG | DCOV | HSIC | PSK | N(1,1) | N(0,1) | N(1,4) | |
|---|---|---|---|---|---|---|---|---|
| Parabola | 0.983 | 0.854 | 0.957 | 0.114 | 0.979 | 0.983 | 1.000 | 0.975 |
| 2 parabolas | 1.000 | 0.354 | 0.997 | 0.198 | 1.000 | 1.000 | 1.000 | 1.000 |
| Circle | 0.985 | 0.075 | 0.914 | 0.008 | 0.999 | 0.997 | 1.000 | 0.995 |
| Diamond | 0.664 | 0.048 | 0.545 | 0.013 | 0.836 | 0.630 | 0.884 | 0.761 |
| W-shape | 0.999 | 0.935 | 0.988 | 0.077 | 0.989 | 0.998 | 0.987 | 0.979 |
| 4 clouds | 0.050 | 0.047 | 0.048 | 0.046 | 0.512 | 0.055 | 0.054 | 0.051 |
| Test | HHG | DCOV | HSIC | PSK | N(1,1) | N(0,1) | N(1,4) | |
|---|---|---|---|---|---|---|---|---|
| Parabola | 1.000 | 0.994 | 1.000 | 0.105 | 1.000 | 1.000 | 1.000 | 1.000 |
| 2 parabolas | 1.000 | 0.700 | 1.000 | 0.201 | 1.000 | 1.000 | 1.000 | 1.000 |
| Circle | 1.000 | 0.196 | 0.999 | 0.004 | 0.999 | 1.000 | 1.000 | 1.000 |
| Diamond | 0.948 | 0.096 | 0.853 | 0.003 | 0.836 | 0.953 | 1.000 | 0.999 |
| W-shape | 1.000 | 0.999 | 1.000 | 0.085 | 0.988 | 1.000 | 1.000 | 1.000 |
| 4 clouds | 0.047 | 0.047 | 0.047 | 0.049 | 0.051 | 0.049 | 0.055 | 0.057 |
| Test | HHG | DCOV | HSIC | N(1,1) | N(1,4) | N(0,4) | N(2,4) | |
|---|---|---|---|---|---|---|---|---|
| Log | 0.594 | 0.154 | 0.610 | 0.710 | 0.759 | 0.321 | 0.885 | 0.813 |
| Epsilon | 0.784 | 0.226 | 0.484 | 0.470 | 0.576 | 0.194 | 0.749 | 0.858 |
| Quadratic | 0.687 | 0.302 | 0.530 | 0.197 | 0.155 | 0.170 | 0.147 | 0.144 |
| 2D-indep | 0.161 | 0.175 | 0.403 | 0.177 | 0.264 | 0.106 | 0.263 | 0.112 |
| Test | HHG | DCOV | HSIC | N(1,1) | N(1,4) | N(0,4) | N(2,4) | |
|---|---|---|---|---|---|---|---|---|
| Log | 0.936 | 0.386 | 0.958 | 0.998 | 0.999 | 1.000 | 1.000 | 0.995 |
| Epsilon | 0.969 | 0.298 | 0.689 | 0.895 | 0.967 | 0.968 | 0.999 | 0.984 |
| Quadratic | 0.934 | 0.485 | 0.904 | 0.362 | 0.293 | 0.315 | 0.733 | 0.236 |
| 2D-indep | 0.27 | 0.359 | 0.798 | 0.281 | 0.219 | 0.261 | 0.198 | 0.172 |
| Test | HHG | DCOV | HSIC | N(1,1) | N(1,4) | N(0,4) | N(2,4) | |
|---|---|---|---|---|---|---|---|---|
| Log | 1.000 | 0.793 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
| Epsilon | 0.999 | 0.382 | 0.896 | 0.998 | 1.000 | 1.000 | 1.000 | 1.000 |
| Quadratic | 0.996 | 0.725 | 0.971 | 0.595 | 0.545 | 0.535 | 0.480 | 0.416 |
| 2D-indep | 0.544 | 0.751 | 0.993 | 0.489 | 0.348 | 0.466 | 0.263 | 0.284 |
| AR | 0.350 | 0.214 | 0.772 | 0.051 | |
| AR | 0.592 | 0.402 | 0.962 | 0.050 | |
| AR | 0.999 | 0.698 | 1.000 | 0.046 | |
| AR | 1.000 | 0.903 | 1.000 | 0.035 | |
| AR | 1.000 | 0.998 | 1.000 | 0.053 | |
| AR | 1.000 | 1.000 | 1.000 | 0.039 | |
| ARMA | 0.817 | 0.323 | 0.925 | 0.057 | |
| ARMA | 0.986 | 0.566 | 0.996 | 0.047 | |
| ARMA | 1.000 | 0.921 | 1.000 | 0.051 |
| 0.770 | 0.519 | 0.402 | 0.060 | ||
| 0.924 | 0.752 | 0.656 | 0.052 | ||
| 0.994 | 0.923 | 0.839 | 0.040 | ||
| 0.732 | 0.550 | 0.366 | 0.039 | ||
| 0.883 | 0.805 | 0.586 | 0.040 | ||
| 0.987 | 0.930 | 0.804 | 0.051 |
| FOU | FOU | |||
| 0.775 | 0.183 | 0.053 | ||
| 0.906 | 0.541 | 0.046 | ||
| 0.986 | 0.880 | 0.056 | ||
| 0.380 | 0.106 | 0.045 | ||
| 0.516 | 0.282 | 0.039 | ||
| 0.707 | 0.542 | 0.042 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMathematical Dynamics and Fractals · Bayesian Methods and Mixture Models · Financial Risk and Volatility Modeling
An Independence Test Based on Recurrence Rates
Juan Kalemkerian
Universidad de la República, Facultad de Ciencias
Diego Fernández
Universidad de la República, Facultad de Ciencias Económicas
y Administración
Abstract
A new test of independence between random elements is presented in this article. The test is based on a functional of the Cramér-von Mises type, which is applied to a -process that is defined from the recurrence rates. Theorems of asymptotic distribution under and consistency under a wide class of alternatives are obtained. The results under contiguous alternatives are also shown. The test has a very good behaviour under several alternatives, which shows that in many cases there is clearly larger power when compared to other tests that are widely used in literature. In addition, the new test could be used for discrete or continuous time series.
**Keywords: ** independence tests, recurrence rates, U-process. 62H15, 62H20
1 Introduction
Let i.i.d. sample of and , where and are metric spaces. When we have the following hypothesis test: and are independent random elements, we are under the so called independent tests. The independence tests have been developed in the first instance for the case, based on the pioneering work of Galton [10] and Pearson [23] (this is the famous correlation test, which is widely used today). The limitations of this hypothesis test are well known and they have motivated several different proposals in this topic, such as the classical rank test (e.g. Spearman,[24], Kendall, [19] or Blomqvist, [6]). Another classic and intuitive result can be found in Hoeffding [15], where the test statistic is defined by , although it is not widely used. Independence between random vectors is addressed for the first time in Wilks [27]. Genest and Rémillard [16] propose a test based on copulas for continuous random variables. Kojadinovic and Holmes [18], generalize this result for random vectors using a Cramér-von Mises type statistic. Bilodeau and Lafaye de Micheaux [5], propose a test of independence between random vectors, each of which has a normal marginal distribution. Continuing in some sense this work, Beran et al. [4] propose a universally consistent test for random vectors, from empirical multidimensional distributions. Gretton et al. [12] propose a universally consistent test based on Hilbert-Schmidt norms. Another consistent test is proposed by Székely et al. [25, 26], which defines the concept of distance covariance. This test has its origin in [3] and it has since become very popular. It has been used and has had a considerable impact from the moment that it was proposed. More recently, Heller et al. [13] propose a test that in many cases has much more powerfull than the distance covariance test. In his monograph, Boglioni [7] compares several alternatives of these tests by means of intense work of power calculations. Because the tests proposed in Beran et al. [4] and Heller et al. [13] have very good performance under several alternatives, in Section 4 we will compare them with the test that we propose in our work.
Starting from another point of view, Eckman et al. [9] introduce the recurrence plot (RP). This is a very important graphical tool to understand the dynamics of a time series in high dimension. Eckman et al.’s [9] generated an appreciable amount of work and is currently applied in many different areas in which mathematical models are used, whether probabilistic or deterministic. The RP is a graphical tool that shows the recurrence in a time series and it is constructed using the recurrence matrix as defined by , where is an appropriate parameter. The objective of this tool is to determine the patterns in a time series. The choice of is a key point to detect patterns and several suggestions have been made on how to appropriately find it. Marwan [21] gives a historical review of recurrence plots techniques, together with everything developed from them. However, the potential of these techniques has not yet been studied in depth from the point of view of mathematical statistics.
The main objective of this article is to propose a hypothesis test to detect dependence between two random elements, and , based on recurrence rates by using the information of and for any values of and One advantage of our test is that instead of choosing appropriate values of and , we use the information generated by both samples for all of the possible values of and . In our test, and can take values in any metric space. Therefore, our test can be used to test if and are independent in the case where and are random variables, random vectors or time series. We can then replace the norms by distances.
The rest of this paper is organized as follows. In Section 2, we give the definitions of recurrence rates for for and for joint and we propose the statistical procedure to make the decision between vs The statistics are based on a functional of the Cramér von-Mises type applied to a -process defined from the recurrence rates of and We also give the theoretical results, which are the asymptotic distribution and consistency of the test statistic (Subsection 2.1), and the behavior under contiguous alternatives (Subsection 2.2). In Section 3, we describe how the test can be implemented, including a formula to obtain the statistic for the test. In Section 4, we use simulations to show the performance of the test against others by power comparison in the cases where and are random variables or random vectors. We also compute power in the case where and are discrete and continuous time series. Like Heller et al.’s [13] test, our test is based on distances between the elements of the sample. Likewise, our test had very good performance under several alternatives. Our concluding remarks are given in Section 5. Appendix gives the proofs of the results that are established in Section 2.
2 Test approach and theoretical results
Given i.i.d. sample of where where and are metric spaces, and given To simplify the notation and without risk of confusion, we will use the same letter for the distance function in both metric spaces and .
We define the recurrence rate for the sample of and as
[TABLE]
[TABLE]
respectively, and the joint recurrence rate for as
[TABLE]
We define the probability that the distance between any two elements of the sample is less than Similarly, we define the probability between three points as and analogously and
We also need to define
The strong law of large numbers for -statistics ([14]) allows us to affirm that for any ,
[TABLE]
We want to test and are independent, against does not hold.
If is true, then for all , and we expect that if is large, for any Then, we propose to build the test statistic, to work with the process where
[TABLE]
Therefore, it is natural to reject when where
[TABLE]
where is a constant and is a distribution function.
Throughout this work, we use the notation and for distribution and density function of random variable respectively, and for each , the set
[TABLE]
Now we will formulate the asymptotic results of our test statistic. First, we will show a result that guarantees the asymptotic distribution of under . We will also present a result that establishes a consistency of our test under a wide class of alternatives. Second, we will analyze the asymptotic bias when we consider contiguous alternatives.
2.1 Asymptotic results under and consistency
We start with the next lemma, in which we obtain the formula for the asymptotic autocovariance function of the process under .
Lemma 1**.**
Given and i.i.d. in where and are independent, then
[TABLE]
[TABLE]
The following lemma will be useful to reduce asymptotic convergence of the process to the convergence of an approximate process that we will call and is defined as follows
[TABLE]
[TABLE]
Lemma 2**.**
Given i.i.d. in , then
[TABLE]
where
[TABLE]
To obtain the weak convergence of the process to a centered Gaussian process (therefore the asymptotic distribution of the statistics defined in (3) is determined), we will use Theorem 4.10 obtained by Arcones & Giné [1]:
Let be a probability space, and for all , are i.i.d. sequence with Given , let be a class of measurable functions on the -process based on and indexed by is
[TABLE]
where .
Given , assume that exists , such that and for all
[TABLE]
[TABLE]
**Theorem (Arcones Giné 1993) **
[TABLE]
If
[TABLE]
then
[TABLE]
*where is the Brownian bridge associated with *
Convergence in the space , is in the sense of Hoffmann-Jørgensen, see ([11]).
Theorem 3**.**
Given i.i.d. in If the distribution functions of and are continuous, then
[TABLE]
where is a centered Gaussian process.
Remark 1**.**
Observe that our process lies in (because is a probability measure). Therefore, our test statistic is , thus, the functional is continuous.
Remark 2**.**
Given and i.i.d. sample of where the marginals are independent. Then
[TABLE]
where
[TABLE]
[TABLE]
If and are not independent, then our test is consistent.
Theorem 4**.**
Given i.i.d. in . If , for all and , are continuous and not independent random variables, then as
The next corollary follows from Theorem 4.
Corollary 1**.**
If , where and are not independent, and , for all then as .
Remark 3**.**
Consider , in i.i.d. with joint density and joint distribution such that and are independent.
Then
[TABLE]
[TABLE]
[TABLE]
[TABLE]
Similarly,
[TABLE]
[TABLE]
Then, for all
Of course, it could happen that condition for all is fulfilled, and nevertheless and are not independent. This is the restricted type of distributions that do not satisfy the conditions of our consistency theorem.
2.2 Contiguous alternatives
In this subsection we will analyze the behavior of this test under contiguous alternatives.
More explicitly, given i.i.d. in , consider
[TABLE]
(i.e. and are independent), vs
[TABLE]
where is a constant such that be a density, and the functions verify the conditions (i) and (ii) that are given below:
Define for , the distribution function of under analogously define
- (i)
Exists a function such that for all
- (ii)
Exists such that ,
It can be proven that conditions (i) and (ii) imply contiguity (Cabaña [8]).
The coefficient is introduced so that The function is called asymptotic drift.
We will show in the following lines that under the process has the same asymptotic limit as under plus a deterministic drift.
We use the notation and for the expectation value of , and the probability of the set under respectively. Analogously we use and under
Proposition 1**.**
[TABLE]
Under
[TABLE]
where
[TABLE]
and
With a little more work, using the Le Cam third lemma (Le Cam & Yang, [20] and Oosterhoff & Van Zwet, [22]) it is possible to prove that under ,
[TABLE]
where is the limit process under and
[TABLE]
Therefore, under
[TABLE]
3 Implementation of the test
3.1 and are random variables
In the case where and are continuous random variables, we observe that and are independent; it is equivalent to say that and are independent, where and are the distribution functions of and , respectively. If we apply the test procedure to and , then we have the advantage that now the variables are on the same scale and each has a normal centered distribution that approximates to the hypotheses of Remark 2. In addition, in this case the formula (11) for is completely determined. Another additional advantage is that under ( and are independent and ), for small values of , we can calculate the critical values at or another level because we will know the distribution of under Where and are random vectors, the same transformation can be applied in each coordinate. To give an idea of the variability of the process , in Figure 1 we show the values of for different values of The maximum is and is reached in .
3.2 General case
As happens in many statistical applications, we are able to have a moderately small sample size. However, an erroneous decision can be made if the researcher uses the p-value (or the critical value) obtained through the asymptotic distribution to make the decision in the hypothesis test. Therefore, when we have a sample of size , it is preferable to estimate the p-value (or the critical value) by estimating the distribution of the for this value of Moreover, in our test, the asymptotic distribution is difficult to obtain because we need to conduct several simulations of a centered continuous Gaussian processes indexed in . We then need to calculate the integral in
To calculate the p-value or the critical value of the test for fixed we can proceed as explained in the following lines. Fixed , if is true, we do not know the distribution of but given the observed value from our sample that we call , we could generate, by a permutation procedure, a large sample of with which we can estimate . Given i.i.d. sample of Observe that the distribution of depends of the joint distribution of If is true, and if we consider any permutation of the index set, then the joint distribution of and the joint distribution of are the same. Consider the set of all the permutation Suppose that the sample is fixed and consider defined by with probability for each If we take i.i.d. sample of , we can estimate the value of simply by using for large enough. Define the random variables for . Observe that is distributed as Bin for each Then
[TABLE]
converges as to a.s. If we now consider that are random elements that can take an expected value, and we obtain (using dominated convergence) , then is an asymptotically unbiased estimator of
3.3 A simple method to choose the weight function
The performance of our test depends on the choice of the weight function. The weight function can be chosen by the researcher in each particular case. According to Theorem 4, we can use any function such that where for any . It would be interesting to study some kind of optimality in the choice of the function, under certain kind of alternatives. Consequently, we propose a simple method to chose the function. As will be seen in the next section, this simple choice of , has very good performance under the alternatives studied in this work.
Define , where and are Gaussian densities. In the case of we can use and . The values of and can easily be estimated by the sample with . We can proceed similarly with the election of and for the density . In this way, we give more weight in the neighbourhoods of the average distance between two independent observations and for , and analogously for . Meanwhile, observe that we can avoid the problem of choosing , if we use to test independence because all of the theoretical results obtained in this work for are still valid for .
3.4 Computing the statistic
In this subsection we will see how to calculate the statistic . We will consider the case in which where and are density functions with and their respective distribution functions.
[TABLE]
[TABLE]
[TABLE]
[TABLE]
To simplify the notation and for the rest of this section, we will call We will also index with in the form . Analogously, we use the same indexes as , to the values . We will also call to the order statistics of , and analogously .
[TABLE]
[TABLE]
[TABLE]
[TABLE]
Analogously
[TABLE]
Then
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
Then
[TABLE]
where , and are given in the formulas (15), (14) and (16) respectively.
4 A simulation study
In this section we will compare the performance of our test with respect to other recently proposed tests that have good performance. Tables 1 to 6 show the power of our test for different functions and also for other tests, for , and sample sizes. All power calculations that we have considered have been calculated at the significance level of . The calculations were made using (17) and taking as a function of weights where is the density function of a random variable for some values of and , except for the last column, where we take the functions and suggested in Subsection 3.3. We will compare the power of our test with respect to the test proposed in Heller et al. [13] (which we will call HHG), the test of covariance distance proposed in Székely et al. [25] (which we will call DCOV) and the test proposed in Gretton et al. [12] (which we will call HSIC). In Subsection 4.1 we will consider the case in which and are random variables; that is, . Meanwhile, in Subsection 4.2 we consider examples in dimensions greater than two. Lastly, in Subsection 4.3 we simulate discrete and continuous time series for certain alternatives and representspower as a function of sample size. In this case, we take the functions and suggested in Subsection 3.3.
4.1 and are random variables
Table 3 considers Heller et al.’s [13] tests, which are called “Parabola”, “Two parabolas”, “Circle”, “Diamond”, “W-shape” and “Four independent clouds” and which are defined as follows:
Parabola:
Two parabolas: with probability and with probability
Circle: , ,
Diamond: independent, for
W-shape: , independent. and
Four independent clouds: with probability , with probability and with probability , with probability , where are independent.
Observe that in “Four independent clouds”, is true, and the power in all the cases should be around . In all cases, the critical values of our test were calculated through 50000 replications and the power of all of the tests considered from 10000 replications. The first three columns of Table 1 give the power of the HHG, DCV and HSIC tests. Column 4 gives the maximum power among the classic correlation test: Pearson, Spearman and Kendall, which we call PSK. Columns 5, 6 and 7 give the power of our test for different function considered in the weight function . In column 8, we use the function and proposed in Subsection 3.3, analogously in Table 2 and Table 3. Figure 2 give us simulations of the alternatives considered in this subsection.
4.2 and are random vectors
In our test, the distance considered for the calculations of recurrences measures is given for the Euclidean norm. Because the Euclidean distance increases with the dimension, the densities of and were aggregated in the columns 6 and 7. In this subsection, we consider the last two alternatives in Table 3, and in Table 4 of Heller et al. [13], which we will call “Logarithmic”, “Epsilon” and “Quadratic” tests and which are defined as follows:
Logarithmic: where are independent, for
Epsilon: where are independent, for
Quadratic: where are independent, for all
We also add the alternatives considered in Boglioni, which are called “2D-pairwise independent” and are defined as follows:
2D-pairwise independent: independent, where
In all cases, the critical values of our test were calculated through 50000 replications and the power of all of the tests were considered from 10000 replications.
To have an idea of the size of the test for random vectors, we have simulated using and proposed in Subsection 3.3. The power of the test were , and for sample sizes of and , respectively.
4.3 and are time series
In this subsection, we consider the case in which and are time series. In all cases and are time series of length and the power (due to the computational cost) were calculated by a permutation method for replications (Table 7 and Table 8) and replications (Table 9). All the power were calculated using and proposed in Subsection 4.3. The power for different alternatives and sample sizes in the discrete case are given in Table 7. The AR and AR means that the time series is an AR with parameter and , respectively. The case called ARMA, is an ARMA model with parameters and . In column 4 of Table 7, represents a white noise where is the standard deviation of . In Table 7 and Table 8, and are independent white noises with . In Table 8 are given the power for different alternatives and sample sizes in the continuous case. In this table, represents that is a Brownian motion with observed in (at times ) and is a fractional Brownian motion with Hurst parameter . Finally, Table 9 shows the power for cases in which the dependency between and is more difficult to detect. In these cases, is a fractional Ornstein-Uhlenbeck process driven by a () for and , which we call and , respectively. A particular linear combination of , which we call , and whose definition and theoretical developed is found in [17], is a particular case of the models proposed in [2]. Table 9 considers the parameters (column 3) and (column 4). More explicitly, in column 3 (where is a fBm), and in column 4 (where is a fBm). To give an idea of the size of the test, in column 5 is a independent of .
5 Conclusions
In this work we have presented a new test of independence between two random elements lying in metric spaces. Our test is based on percentages of recurrences for which we need, for each sample, only the information obtained by the distance between points. We have obtained the asymptotic distribution of our statistic and we have shown that the limit distribution under contiguous alternatives has a bias. We have also proven the consistency of the test for a wide class of alternatives, which include the particular case in which follows a multivariate normal distribution. The performance of the test measured through the calculation of power through several alternatives has shown very good results, clearly improving on others in many cases for different dimensions of the spaces. In future work, we think that the result can be generalized to the case in which there is some kind of dependence between the observation of the sample. In addition, the work of the simulations should be expanded and deepened.
Acknowledgments
Our gratitude to José Rafael León, Ricardo Fraiman, Ernesto Mordecki and Jorge Graneri for their comments that were very useful in the preparation of this work. Also the editor of the journal and the two anonymous referees for their enriching comments.
6 Proofs
Proof of Lemma 1.
[TABLE]
Observe that as i.i.d, then
[TABLE]
for all such that Therefore
[TABLE]
[TABLE]
Analogously, and Given that and are independent, then
[TABLE]
Thus,
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
Decomposing (19) in the terms in which are pairwise different, and has three elements, and using that the random vectors are i.i.d, we obtain that (19) is equal to
[TABLE]
[TABLE]
Analogously
[TABLE]
Similarly, using that the random vectors are i.i.d. and also that and are independent,
[TABLE]
[TABLE]
[TABLE]
With the same technique as in (20) and (21), we obtain
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
Therefore
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
Putting (20), (21) and (22) in (18), we obtain that (18) is equal to
[TABLE]
[TABLE]
[TABLE]
Then
[TABLE]
[TABLE]
∎
Proof of Lemma 2.
[TABLE]
[TABLE]
[TABLE]
[TABLE]
where
[TABLE]
Then, is equal to
[TABLE]
[TABLE]
[TABLE]
[TABLE]
Now, we decompose
[TABLE]
[TABLE]
[TABLE]
and substituting in (23) we obtain that (23) is equal to
[TABLE]
[TABLE]
[TABLE]
[TABLE]
Observe that (24) it is bounded between [math] and
[TABLE]
∎
Proof of Theorem 3.
[TABLE]
Every continuous function with finit limits as is uniformly continuous. Therefore given , exist such that and for all such that , where and are the distribution functions of and respectively. If is true, consider for each the functions defined by
[TABLE]
where and and consider the family . To simplify the notation, we call throughout the demonstration.
Observe that
[TABLE]
then the process is an process of order
To obtain the convergence, according to Arcones & Giné’s Theorem 4.10, it is enough to prove that
[TABLE]
If then for all and Then , satisfied (6) Thus, , therefore
If , we take such that , then we partition into subintervals of the form such that where is interpreted as Define the following functions
[TABLE]
and
[TABLE]
Observe that for each there exists such that and
Then
[TABLE]
Thus and where and for Also
[TABLE]
Define the sets , then
[TABLE]
[TABLE]
[TABLE]
Analogously,
[TABLE]
putting (27) and (26) in (25) we obtain that
Lastly, observe that the cardinal of and is , then
[TABLE]
∎
Proof of Theorem 4.
[TABLE]
Define Then, exist, such that , thus exist and such that and for all Then, as
[TABLE]
Now, using that we obtain that
[TABLE]
[TABLE]
Thus
[TABLE]
∎
Proof of Corollary 1.
[TABLE]
Because all of the norms in and are equivalent, it is enough to give the proof for the Euclidean norm case. We use that if has centered normal bivariate distribution, then
Let us call and Then
[TABLE]
If and are not independent, then and exist such that then then and are not independent, therefore and are not independent, and then exist and positive numbers such that If we apply this argument for and instead and , then we obtain that
[TABLE]
Lastly, the result follows from Theorem 2. ∎
Proof of Proposition 1.
[TABLE]
[TABLE]
Define then (28) is equal to
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
where for all and is a constant.
[TABLE]
[TABLE]
[TABLE]
Therefore
[TABLE]
[TABLE]
[TABLE]
Then
[TABLE]
as ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Arcones, M. A. and Giné, E. (1993). Limit Theorems for U-Processes. The Annals of Probability Vol 21-3, 1494-1542.
- 2[2] Arratia, A., Cabaña, A. & Cabaña, E., (2016). A construction of Continuous time ARMA models by iterations of Ornstein-Uhlenbeck process, SORT Vol 40 (2) 267-302.
- 3[3] Bakirov, N. K., Rizzo, M. L., and Székely, G. J. (2006). A multivariate non- parametric test of independence. Journal of Multivariate Analysis, 97(8):1742-1756.
- 4[4] Beran, R., Bilodeau, M., and Lafaye de Micheaux, P. (2007). Nonparametric tests of independence between random vectors. Journal of Multivariate Analysis. 98(9):1805–1824.
- 5[5] Bilodeau, M. and Lafaye de Micheaux, P. (2005). A multivariate empirical caracteristic function test of independence with normal marginals. Journal of ultivariate Analysis, 95(2):345–369.
- 6[6] Blomqvist, N. (1950). On a measure of dependence between two random variables. The Annals of Mathematical Statistics 593-600.
- 7[7] Boglioni, G. (2016). A consistent test of independence between random vectors. https://papyrus.bib.umontreal.ca/xmlui/bitstream/handle/1866/18773/Bo glioni_Beaulieu_Guillaume_ 2016_memoire.pdf?sequence=2.
- 8[8] Cabaña, E. M. (1997). Contiguidad, pruebas de ajuste y Procesos Empí ricos Transformados. Décima escuela venezolana de Matemáticas.
