An Independence Test Based on Recurrence Rates

Juan Kalemkerian; Diego Fern\'andez

arXiv:1908.03305·math.ST·August 12, 2019·J. Multivar. Anal.

An Independence Test Based on Recurrence Rates

Juan Kalemkerian, Diego Fern\'andez

PDF

Open Access

TL;DR

This paper introduces a new independence test based on recurrence rates and a Cramér-von Mises type functional, demonstrating strong asymptotic properties and higher power compared to existing tests, applicable to both discrete and continuous time series.

Contribution

The paper presents a novel independence test leveraging recurrence rates and a U-process, with proven asymptotic distribution, consistency, and superior power in various scenarios.

Findings

01

Test shows good behavior under multiple alternatives.

02

Higher power compared to traditional independence tests.

03

Applicable to both discrete and continuous time series.

Abstract

A new test of independence between random elements is presented in this article. The test is based on a functional of the Cram\'{e}r-von Mises type, which is applied to a $U$ -process that is defined from the recurrence rates. Theorems of asymptotic distribution under $H_{0},$ and consistency under a wide class of alternatives are obtained. The results under contiguous alternatives are also shown. The test has a very good behaviour under several alternatives, which shows that in many cases there is clearly larger power when compared to other tests that are widely used in literature. In addition, the new test could be used for discrete or continuous time series.

Tables9

Table 1. Table 1: Power comparison for the different test for sample size of n = 30 𝑛 30 n=30 .

Test	HHG	DCOV	HSIC	PSK	N(1,1)	N(0,1)	N(1,4)	$g_{1}, g_{2}$
Parabola	0.791	0.522	0.733	0.103	0.824	0.831	0.814	0.817
2 parabolas	0.962	0.204	0.849	0.194	1.000	1.000	1.000	1.000
Circle	0.646	0.051	0.488	0.096	0.923	0.716	0.947	0.823
Diamond	0.283	0.030	0.262	0.016	0.422	0.139	0.477	0.395
W-shape	0.908	0.569	0.856	0.179	0.788	0.887	0.782	0.874
4 clouds	0.052	0.053	0.053	0.046	0.052	0.052	0.051	0.051

Table 2. Table 2: Power comparison for the different test for sample size of n = 50 𝑛 50 n=50 .

Test	HHG	DCOV	HSIC	PSK	N(1,1)	N(0,1)	N(1,4)	$g_{1}, g_{2}$
Parabola	0.983	0.854	0.957	0.114	0.979	0.983	1.000	0.975
2 parabolas	1.000	0.354	0.997	0.198	1.000	1.000	1.000	1.000
Circle	0.985	0.075	0.914	0.008	0.999	0.997	1.000	0.995
Diamond	0.664	0.048	0.545	0.013	0.836	0.630	0.884	0.761
W-shape	0.999	0.935	0.988	0.077	0.989	0.998	0.987	0.979
4 clouds	0.050	0.047	0.048	0.046	0.512	0.055	0.054	0.051

Table 3. Table 3: Power comparison for the different test for sample size of n = 80 𝑛 80 n=80 .

Test	HHG	DCOV	HSIC	PSK	N(1,1)	N(0,1)	N(1,4)	$g_{1}, g_{2}$
Parabola	1.000	0.994	1.000	0.105	1.000	1.000	1.000	1.000
2 parabolas	1.000	0.700	1.000	0.201	1.000	1.000	1.000	1.000
Circle	1.000	0.196	0.999	0.004	0.999	1.000	1.000	1.000
Diamond	0.948	0.096	0.853	0.003	0.836	0.953	1.000	0.999
W-shape	1.000	0.999	1.000	0.085	0.988	1.000	1.000	1.000
4 clouds	0.047	0.047	0.047	0.049	0.051	0.049	0.055	0.057

Table 4. Table 4: Power comparison for the different test for sample size of n = 30 𝑛 30 n=30 .

Test	HHG	DCOV	HSIC	N(1,1)	N(1,4)	N(0,4)	N(2,4)	$g_{1}, g_{2}$
Log	0.594	0.154	0.610	0.710	0.759	0.321	0.885	0.813
Epsilon	0.784	0.226	0.484	0.470	0.576	0.194	0.749	0.858
Quadratic	0.687	0.302	0.530	0.197	0.155	0.170	0.147	0.144
2D-indep	0.161	0.175	0.403	0.177	0.264	0.106	0.263	0.112

Table 5. Table 5: Power comparison for the different test for sample size of n = 50 𝑛 50 n=50 .

Test	HHG	DCOV	HSIC	N(1,1)	N(1,4)	N(0,4)	N(2,4)	$g_{1}, g_{2}$
Log	0.936	0.386	0.958	0.998	0.999	1.000	1.000	0.995
Epsilon	0.969	0.298	0.689	0.895	0.967	0.968	0.999	0.984
Quadratic	0.934	0.485	0.904	0.362	0.293	0.315	0.733	0.236
2D-indep	0.27	0.359	0.798	0.281	0.219	0.261	0.198	0.172

Table 6. Table 6: Power comparison for the different test for sample size of n = 80 𝑛 80 n=80 .

Test	HHG	DCOV	HSIC	N(1,1)	N(1,4)	N(0,4)	N(2,4)	$g_{1}, g_{2}$
Log	1.000	0.793	1.000	1.000	1.000	1.000	1.000	1.000
Epsilon	0.999	0.382	0.896	0.998	1.000	1.000	1.000	1.000
Quadratic	0.996	0.725	0.971	0.595	0.545	0.535	0.480	0.416
2D-indep	0.544	0.751	0.993	0.489	0.348	0.466	0.263	0.284

Table 7. Table 7: Power for the case of discrete time series and different sample sizes.

$n$	$X$	$Y = X^{2} + 3 ε$	$Y = \sqrt{\| X \|} + Z$	$Y = ε X$	$Y = ε$
$30$	AR $(0, 1)$	0.350	0.214	0.772	0.051
$50$	AR $(0, 1)$	0.592	0.402	0.962	0.050
$100$	AR $(0, 1)$	0.999	0.698	1.000	0.046
$30$	AR $(0, 9)$	1.000	0.903	1.000	0.035
$50$	AR $(0, 9)$	1.000	0.998	1.000	0.053
$100$	AR $(0, 9)$	1.000	1.000	1.000	0.039
$30$	ARMA $(2, 1)$	0.817	0.323	0.925	0.057
$50$	ARMA $(2, 1)$	0.986	0.566	0.996	0.047
$100$	ARMA $(2, 1)$	1.000	0.921	1.000	0.051

Table 8. Table 8: Power for the case of continuous time series and different sample sizes.

$n$	$X$	$Y = X^{2} + 3 ε$	$Y = \sqrt{\| X \|} + ε$	$Y = ε X + 3 ε^{'}$	$Y = ε$
$30$	$B m$	0.770	0.519	0.402	0.060
$50$	$B m$	0.924	0.752	0.656	0.052
$80$	$B m$	0.994	0.923	0.839	0.040
$30$	$f B m$	0.732	0.550	0.366	0.039
$50$	$f B m$	0.883	0.805	0.586	0.040
$80$	$f B m$	0.987	0.930	0.804	0.051

Table 9. Table 9: Power where the dependence is between a fractional Brownian motion and its associated FOU and FOU(2), for the cases H = 0.5 𝐻 0.5 H=0.5 ( B m 𝐵 𝑚 Bm ) and H = 0.7 𝐻 0.7 H=0.7 ( f B m 𝑓 𝐵 𝑚 fBm ).

$n$	$X$	$Y =$ FOU	$Y =$ FOU $(2)$	$Y = B m$
$30$	$B m$	0.775	0.183	0.053
$50$	$B m$	0.906	0.541	0.046
$80$	$B m$	0.986	0.880	0.056
$30$	$f B m$	0.380	0.106	0.045
$50$	$f B m$	0.516	0.282	0.039
$80$	$f B m$	0.707	0.542	0.042

Equations280

R R_{n}^{X} (r) := \frac{1}{n ^{2} - n} i \neq = j \sum 1_{{d (X_{i}, X_{j}) < r}}

R R_{n}^{X} (r) := \frac{1}{n ^{2} - n} i \neq = j \sum 1_{{d (X_{i}, X_{j}) < r}}

R R_{n}^{Y} (s) := \frac{1}{n ^{2} - n} i \neq = j \sum 1_{{d (Y_{i}, Y_{j}) < s}}

R R_{n}^{Y} (s) := \frac{1}{n ^{2} - n} i \neq = j \sum 1_{{d (Y_{i}, Y_{j}) < s}}

R R_{n}^{X, Y} (r, s) := \frac{1}{n ^{2} - n} i \neq = j \sum 1_{{d (X_{i}, X_{j}) < r, d (Y_{i}, Y_{j}) < s}} .

R R_{n}^{X, Y} (r, s) := \frac{1}{n ^{2} - n} i \neq = j \sum 1_{{d (X_{i}, X_{j}) < r, d (Y_{i}, Y_{j}) < s}} .

R R_{n}^{X} (r) \to a . s . p_{X} (r), R R_{n}^{Y} (s) \to a . s . p_{Y} (s) and R R_{n}^{X, Y} (r, s) \to a . s . p_{X, Y} (r, s) .

R R_{n}^{X} (r) \to a . s . p_{X} (r), R R_{n}^{Y} (s) \to a . s . p_{Y} (s) and R R_{n}^{X, Y} (r, s) \to a . s . p_{X, Y} (r, s) .

E_{n} (r, s) := n (R R_{n}^{X, Y} (r, s) - R R_{n}^{X} (r) R R_{n}^{Y} (s)) .

E_{n} (r, s) := n (R R_{n}^{X, Y} (r, s) - R R_{n}^{X} (r) R R_{n}^{Y} (s)) .

T_{n} := n \int_{0}^{+ \infty} \int_{0}^{+ \infty} (R R_{n}^{X, Y} (r, s) - R R_{n}^{X} (r) R R_{n}^{Y} (s))^{2} d G (r, s)

T_{n} := n \int_{0}^{+ \infty} \int_{0}^{+ \infty} (R R_{n}^{X, Y} (r, s) - R R_{n}^{X} (r) R R_{n}^{Y} (s))^{2} d G (r, s)

I_{m}^{n} := {(i_{1}, ..., i_{m}) : i_{j} \neq = i_{k} for all j \neq = k, and i_{j} \in {1, ..., n} for all j = 1, ..., m} .

I_{m}^{n} := {(i_{1}, ..., i_{m}) : i_{j} \neq = i_{k} for all j \neq = k, and i_{j} \in {1, ..., n} for all j = 1, ..., m} .

n \to + \infty lim COV (E_{n} (r, s), E_{n} (r^{'}, s^{'})) =

n \to + \infty lim COV (E_{n} (r, s), E_{n} (r^{'}, s^{'})) =

4 (p_{X}^{(3)} (r \land r^{'}) - p_{X} (r) p_{X} (r^{'})) (p_{Y}^{(3)} (s \land s^{'}) - p_{Y} (s) p_{Y} (s^{'})) .

4 (p_{X}^{(3)} (r \land r^{'}) - p_{X} (r) p_{X} (r^{'})) (p_{Y}^{(3)} (s \land s^{'}) - p_{Y} (s) p_{Y} (s^{'})) .

E_{n}^{'} (r, s) := \frac{n}{n ( n - 1 ) ( n - 2 ) ( n - 3 )} \times

E_{n}^{'} (r, s) := \frac{n}{n ( n - 1 ) ( n - 2 ) ( n - 3 )} \times

(i, j, k, h) \in I_{4}^{n} \sum (1_{{d (X_{i}, X_{j}) < r, d (Y_{i}, Y_{j}) < s}} - 1_{{d (X_{i}, X_{j}) < r, d (Y_{h}, Y_{k}) < s}}) .

(i, j, k, h) \in I_{4}^{n} \sum (1_{{d (X_{i}, X_{j}) < r, d (Y_{i}, Y_{j}) < s}} - 1_{{d (X_{i}, X_{j}) < r, d (Y_{h}, Y_{k}) < s}}) .

E_{n} (r, s) = n (R R_{n}^{X, Y} (r, s) - R R_{n}^{X} (r) R R_{n}^{Y} (s)) = E_{n}^{'} (r, s) - H_{n} (r, s)

E_{n} (r, s) = n (R R_{n}^{X, Y} (r, s) - R R_{n}^{X} (r) R R_{n}^{Y} (s)) = E_{n}^{'} (r, s) - H_{n} (r, s)

0 \leq H_{n} (r, s) \leq \frac{4}{n} for all r, s > 0.

0 \leq H_{n} (r, s) \leq \frac{4}{n} for all r, s > 0.

U_{m}^{n} (f) = \frac{( n - m ) !}{m !} (i_{1}, ..., i_{m}) \in I_{m}^{n} \sum f (X_{i_{1},} X_{i_{2}}, ..., X_{i_{m}})

U_{m}^{n} (f) = \frac{( n - m ) !}{m !} (i_{1}, ..., i_{m}) \in I_{m}^{n} \sum f (X_{i_{1},} X_{i_{2}}, ..., X_{i_{m}})

f \in F, exists l_{f} \in L and u_{f} \in U where l_{f} \leq f \leq u_{f} a . s . and E (u_{f} - l_{f})^{2} < ε^{2} .

f \in F, exists l_{f} \in L and u_{f} \in U where l_{f} \leq f \leq u_{f} a . s . and E (u_{f} - l_{f})^{2} < ε^{2} .

N_{[]}^{(2)} (ε, F, P^{m}) = min {υ : (\ref e n t r o p ia) holds} .

N_{[]}^{(2)} (ε, F, P^{m}) = min {υ : (\ref e n t r o p ia) holds} .

\int_{0}^{+ \infty} (lo g N_{[]}^{(2)} (ε, F, P^{m}))^{1/2} d ε < + \infty

\int_{0}^{+ \infty} (lo g N_{[]}^{(2)} (ε, F, P^{m}))^{1/2} d ε < + \infty

L (n (U_{m}^{n} - P^{m}) f) \to w L (m G_{p} \circ P^{m - 1} f) in l^{\infty} (F)

L (n (U_{m}^{n} - P^{m}) f) \to w L (m G_{p} \circ P^{m - 1} f) in l^{\infty} (F)

{E_{n} (r, s) - E (E_{n} (r, s))}_{r, s > 0} w {E (r, s)}_{r, s > 0}

{E_{n} (r, s) - E (E_{n} (r, s))}_{r, s > 0} w {E (r, s)}_{r, s > 0}

n (R R_{n}^{X, Y)} (r, s) - R R_{n} (r) R R_{n}^{Y} (s)) \to w N (0, σ_{X, Y}^{2} (r, s))

n (R R_{n}^{X, Y)} (r, s) - R R_{n} (r) R R_{n}^{Y} (s)) \to w N (0, σ_{X, Y}^{2} (r, s))

σ_{X, Y}^{2} (r, s) = 4 (\int_{- \infty}^{+ \infty} (ϕ (x + r) - ϕ (x - r))^{2} φ (x) d x - (2 ϕ (r / 2) - 1)^{2}) \times

σ_{X, Y}^{2} (r, s) = 4 (\int_{- \infty}^{+ \infty} (ϕ (x + r) - ϕ (x - r))^{2} φ (x) d x - (2 ϕ (r / 2) - 1)^{2}) \times

(\int_{- \infty}^{+ \infty} (ϕ (x + s) - ϕ (x - s))^{2} φ (x) d x - (2 ϕ (s / 2) - 1)^{2}) .

(\int_{- \infty}^{+ \infty} (ϕ (x + s) - ϕ (x - s))^{2} φ (x) d x - (2 ϕ (s / 2) - 1)^{2}) .

α (r, s) := P (∣ X_{1} - X_{2} ∣ \leq r, ∣ Y_{1} - Y_{2} ∣ \leq s) =

α (r, s) := P (∣ X_{1} - X_{2} ∣ \leq r, ∣ Y_{1} - Y_{2} ∣ \leq s) =

\iint_{R^{2}} f_{X, Y} (x_{1}, y_{1}) d x_{1} d y_{1} \int_{x_{1} - r}^{x_{1} + r} d x_{2} \int_{y_{1} - s}^{y_{1} + s} f_{X, Y} (x_{2}, y_{2}) d y_{2} =

\iint_{R^{2}} f_{X, Y} (x_{1}, y_{1}) d x_{1} d y_{1} \int_{x_{1} - r}^{x_{1} + r} d x_{2} \int_{y_{1} - s}^{y_{1} + s} f_{X, Y} (x_{2}, y_{2}) d y_{2} =

\iint_{R^{2}} P (x_{1} - r \leq X_{1} \leq x_{1} + r, y_{1} - s \leq Y_{2} \leq y_{1} + s) f_{X, Y} (x_{1}, y_{1}) d x_{1} d y_{1} =

\iint_{R^{2}} P (x_{1} - r \leq X_{1} \leq x_{1} + r, y_{1} - s \leq Y_{2} \leq y_{1} + s) f_{X, Y} (x_{1}, y_{1}) d x_{1} d y_{1} =

E (F (X + r, Y + s) - F (X + r, Y - s) - F (X - r, Y + s) + F (X - r, Y - s)) .

E (F (X + r, Y + s) - F (X + r, Y - s) - F (X - r, Y + s) + F (X - r, Y - s)) .

β (r, s) := P (∣ X_{1} - X_{2} ∣ \leq r) P (∣ Y_{1} - Y_{2} ∣ \leq s) =

β (r, s) := P (∣ X_{1} - X_{2} ∣ \leq r) P (∣ Y_{1} - Y_{2} ∣ \leq s) =

E (F_{X} (X + r) - F_{X} (X - r)) E (F_{Y} (Y + r) - F_{Y} (Y - r)) .

E (F_{X} (X + r) - F_{X} (X - r)) E (F_{Y} (Y + r) - F_{Y} (Y - r)) .

H_{0} : f_{X, Y} (x, y) = f_{X} (x) f_{Y} (y) for all (x, y)

H_{0} : f_{X, Y} (x, y) = f_{X} (x) f_{Y} (y) for all (x, y)

H_{n} : f_{X, Y} (x, y) = f_{X, Y}^{(n)} (x, y) for all (x, y)

H_{n} : f_{X, Y} (x, y) = f_{X, Y}^{(n)} (x, y) for all (x, y)

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMathematical Dynamics and Fractals · Bayesian Methods and Mixture Models · Financial Risk and Volatility Modeling

Full text

An Independence Test Based on Recurrence Rates

Juan Kalemkerian

Universidad de la República, Facultad de Ciencias

Diego Fernández

Universidad de la República, Facultad de Ciencias Económicas

y Administración

Abstract

A new test of independence between random elements is presented in this article. The test is based on a functional of the Cramér-von Mises type, which is applied to a $U$ -process that is defined from the recurrence rates. Theorems of asymptotic distribution under $H_{0},$ and consistency under a wide class of alternatives are obtained. The results under contiguous alternatives are also shown. The test has a very good behaviour under several alternatives, which shows that in many cases there is clearly larger power when compared to other tests that are widely used in literature. In addition, the new test could be used for discrete or continuous time series.

**Keywords: ** independence tests, recurrence rates, U-process. 62H15, 62H20

1 Introduction

Let $\left(X_{1},Y_{1}\right),\left(X_{2},Y_{2}\right),...,\left(X_{n},Y_{n}\right)$ i.i.d. sample of $\left(X,Y\right),$ $X\in S_{X}$ and $Y\in S_{Y}$ , where $S_{X}$ and $S_{Y}$ are metric spaces. When we have the following hypothesis test: $H_{0}:$ $X$ and $Y$ are independent random elements, we are under the so called independent tests. The independence tests have been developed in the first instance for the $S_{X}=S_{Y}=\mathbb{R}$ case, based on the pioneering work of Galton [10] and Pearson [23] (this is the famous correlation test, which is widely used today). The limitations of this hypothesis test are well known and they have motivated several different proposals in this topic, such as the classical rank test (e.g. Spearman,[24], Kendall, [19] or Blomqvist, [6]). Another classic and intuitive result can be found in Hoeffding [15], where the test statistic is defined by $\int\int\left(F_{X,Y}(x,y)-F_{X}(x)F_{Y}(y)\right)^{2}dF_{X,Y}(x,y)$ , although it is not widely used. Independence between random vectors is addressed for the first time in Wilks [27]. Genest and Rémillard [16] propose a test based on copulas for continuous random variables. Kojadinovic and Holmes [18], generalize this result for random vectors using a Cramér-von Mises type statistic. Bilodeau and Lafaye de Micheaux [5], propose a test of independence between random vectors, each of which has a normal marginal distribution. Continuing in some sense this work, Beran et al. [4] propose a universally consistent test for random vectors, from empirical multidimensional distributions. Gretton et al. [12] propose a universally consistent test based on Hilbert-Schmidt norms. Another consistent test is proposed by Székely et al. [25, 26], which defines the concept of distance covariance. This test has its origin in [3] and it has since become very popular. It has been used and has had a considerable impact from the moment that it was proposed. More recently, Heller et al. [13] propose a test that in many cases has much more powerfull than the distance covariance test. In his monograph, Boglioni [7] compares several alternatives of these tests by means of intense work of power calculations. Because the tests proposed in Beran et al. [4] and Heller et al. [13] have very good performance under several alternatives, in Section 4 we will compare them with the test that we propose in our work.

Starting from another point of view, Eckman et al. [9] introduce the recurrence plot (RP). This is a very important graphical tool to understand the dynamics of a time series in high dimension. Eckman et al.’s [9] generated an appreciable amount of work and is currently applied in many different areas in which mathematical models are used, whether probabilistic or deterministic. The RP is a graphical tool that shows the recurrence in a time series $\left(X\right)$ and it is constructed using the recurrence matrix $RM\left(X\right)$ as defined by $RM_{ij}\left(X\right)=\mathbf{1}_{\left\{\left\|X_{i}-X_{j}\right\|<r\right\}}$ , where $r$ is an appropriate parameter. The objective of this tool is to determine the patterns in a time series. The choice of $r$ is a key point to detect patterns and several suggestions have been made on how to appropriately find it. Marwan [21] gives a historical review of recurrence plots techniques, together with everything developed from them. However, the potential of these techniques has not yet been studied in depth from the point of view of mathematical statistics.

The main objective of this article is to propose a hypothesis test to detect dependence between two random elements, $X$ and $Y$ , based on recurrence rates by using the information of $\mathbf{1}_{\left\{d\left(X_{i},X_{j}\right)<r\right\}}$ and $\mathbf{1}_{\left\{d\left(Y_{i},Y_{j}\right)<s\right\}}$ for any values of $r$ and $s.$ One advantage of our test is that instead of choosing appropriate values of $r$ and $s$ , we use the information generated by both samples for all of the possible values of $r$ and $s$ . In our test, $X$ and $Y$ can take values in any metric space. Therefore, our test can be used to test if $X$ and $Y$ are independent in the case where $X$ and $Y$ are random variables, random vectors or time series. We can then replace the norms by distances.

The rest of this paper is organized as follows. In Section 2, we give the definitions of recurrence rates for $X,$ for $Y$ and for joint $\left(X,Y\right)$ and we propose the statistical procedure to make the decision between $H_{0}$ vs $H_{1}.$ The statistics are based on a functional of the Cramér von-Mises type applied to a $U$ -process defined from the recurrence rates of $X,$ $Y$ and $\left(X,Y\right).$ We also give the theoretical results, which are the asymptotic distribution and consistency of the test statistic (Subsection 2.1), and the behavior under contiguous alternatives (Subsection 2.2). In Section 3, we describe how the test can be implemented, including a formula to obtain the statistic for the test. In Section 4, we use simulations to show the performance of the test against others by power comparison in the cases where $X$ and $Y$ are random variables or random vectors. We also compute power in the case where $X$ and $Y$ are discrete and continuous time series. Like Heller et al.’s [13] test, our test is based on distances between the elements of the sample. Likewise, our test had very good performance under several alternatives. Our concluding remarks are given in Section 5. Appendix gives the proofs of the results that are established in Section 2.

2 Test approach and theoretical results

Given $\left(X_{1},Y_{1}\right),\left(X_{2},Y_{2}\right),...,\left(X_{n},Y_{n}\right)$ i.i.d. sample of $\left(X,Y\right)$ where $X\in S_{X},$ $Y\in S_{Y}$ where $S_{X}$ and $S_{Y}$ are metric spaces, and given $r,s>0.$ To simplify the notation and without risk of confusion, we will use the same letter $d$ for the distance function in both metric spaces $S_{X}$ and $S_{Y}$ .

We define the recurrence rate for the sample of $X$ and $Y$ as

[TABLE]

respectively, and the joint recurrence rate for $\left(X,Y\right)$ as

[TABLE]

We define $p_{X}(r):=P\left(d\left(X_{1},X_{2}\right)<r\right)$ the probability that the distance between any two elements of the sample $X$ is less than $r.$ Similarly, we define the probability between three points as $p_{X}^{\left(3\right)}(r):=P\left(d\left(X_{1},X_{2}\right)<r,\text{ }d\left(X_{1},X_{3}\right)<r\right)$ and analogously $p_{Y}$ and $p_{Y}^{\left(3\right)}.$

We also need to define $p_{X,Y}(r,s):=P\left(d\left(X_{1},X_{2}\right)<r,\text{ }d\left(Y_{1},Y_{2}\right)<s\right).$

The strong law of large numbers for $U$ -statistics ([14]) allows us to affirm that for any $r,s>0$ ,

[TABLE]

We want to test $H_{0}:$ $X$ and $Y$ are independent, against $H_{1}:$ $H_{0}$ does not hold.

If $H_{0}$ is true, then $p_{X,Y}(r,s)=p_{X}(r)p_{Y}(s)$ for all $r,s>0$ , and we expect that if $n$ is large, $RR_{n}^{X,Y}(r,s)\cong RR_{n}^{X}(r)RR_{n}^{Y}(s)$ for any $r,s>0.$ Then, we propose to build the test statistic, to work with the process $\{E_{n}(r,s)\}_{r,s>0}$ where

[TABLE]

Therefore, it is natural to reject $H_{0}$ when $T_{n}>c$ where

[TABLE]

where $c$ is a constant and $G$ is a distribution function.

Throughout this work, we use the notation $\phi$ and $\varphi$ for distribution and density function of $N(0,1)$ random variable respectively, and for each $m$ , the set

[TABLE]

Now we will formulate the asymptotic results of our test statistic. First, we will show a result that guarantees the asymptotic distribution of $T_{n}$ under $H_{0}$ . We will also present a result that establishes a consistency of our test under a wide class of alternatives. Second, we will analyze the asymptotic bias when we consider contiguous alternatives.

2.1 Asymptotic results under $H_{0}$ and consistency

We start with the next lemma, in which we obtain the formula for the asymptotic autocovariance function of the process $\{E_{n}(r,s)\}_{r,s>0}$ under $H_{0}$ .

Lemma 1.

Given $r,r^{\prime},s,s^{\prime}>0,$ and $\left(X_{1},Y_{1}\right),\left(X_{2},Y_{2}\right),...,\left(X_{n},Y_{n}\right)$ i.i.d. in $S_{X}\times S_{Y}$ where $X$ and $Y$ are independent, then

[TABLE]

The following lemma will be useful to reduce asymptotic convergence of the process $\{E_{n}(r,s)\}_{r,s>0}$ to the convergence of an approximate $U-$ process that we will call $\{E_{n}^{\prime}(r,s)\}_{r,s>0}$ and is defined as follows

[TABLE]

Lemma 2.

Given $\left(X_{1},Y_{1}\right),\left(X_{2},Y_{2}\right),...,\left(X_{n},Y_{n}\right)$ i.i.d. in $S_{X}\times S_{Y}$ , then

[TABLE]

where

[TABLE]

To obtain the weak convergence of the process $\{E_{n}(r,s)-\mathbb{E}(E_{n}(r,s))\}_{r,s>0}$ to a centered Gaussian process (therefore the asymptotic distribution of the statistics $T_{n}$ defined in (3) is determined), we will use Theorem 4.10 obtained by Arcones & Giné [1]:

Let $\left(S,\mathcal{S},P\right)$ be a probability space, and for all $i\in\mathbb{N}$ , $X_{i}:S\rightarrow S$ are i.i.d. sequence with $\mathcal{L}\left(X_{i}\right)=P.$ Given $m$ , let $\mathcal{F}$ be a class of measurable functions on $S^{m},$ the $U$ -process based on $P$ and indexed by $\mathcal{F}$ is

[TABLE]

where $f\in\mathcal{F}$ .

Given $\varepsilon>0$ , assume that exists $\mathcal{L}=\left\{l_{1},l_{2},...,l_{v}\right\}$ , $\mathcal{U}=\left\{u_{1},u_{2},...,u_{\upsilon}\right\}$ such that $\mathcal{L},\mathcal{U}\subset L^{2}$ and for all

[TABLE]

**Theorem (Arcones $\&$ Giné 1993) **

[TABLE]

If

[TABLE]

then

[TABLE]

*where $G_{P}$ is the Brownian bridge associated with $P.$ *

Convergence in the space $l^{\infty}\left(\mathcal{F}\right)$ , is in the sense of Hoffmann-Jørgensen, see ([11]).

Theorem 3.

Given $\left(X_{1},Y_{1}\right),\left(X_{2},Y_{2}\right),...,\left(X_{n},Y_{n}\right)$ i.i.d. in $S_{X}\times S_{Y}.$ If the distribution functions of $d(X_{1},X_{2})$ and $d(Y_{1},Y_{2})$ are continuous, then

[TABLE]

where $\{E(r,s)\}_{r,s>0}$ is a centered Gaussian process.

Remark 1.

Observe that our process $\{E_{n}(r,s)\}_{r,s>0}$ lies in $L^{2}(dG)$ (because $G$ is a probability measure). Therefore, our test statistic $T_{n}$ is $\left||\{E(r,s)\}_{r,s>0}\right||$ , thus, the functional is continuous.

Remark 2.

Given $r,s>0$ and $\left(X_{1},Y_{1}\right),\left(X_{2},Y_{2}\right),...,\left(X_{n},Y_{n}\right)\in\mathbb{R}^{2}$ i.i.d. sample of $\left(X,Y\right)$ where the marginals $X,Y$ are $N\left(0,1\right)$ independent. Then

[TABLE]

where

[TABLE]

If $d(X_{1},X_{2})$ and $d(Y_{1},Y_{2})$ are not independent, then our test is consistent.

Theorem 4.

Given $\left(X_{1},Y_{1}\right),\left(X_{2},Y_{2}\right),...,\left(X_{n},Y_{n}\right)$ i.i.d. in $S_{X}\times S_{Y}$ . If $dG(r,s)=g(r,s)drds$ , $g(r,s)>0$ for all $r,s>0,$ and $d\left(X_{1},X_{2}\right)$ , $d\left(Y_{1},Y_{2}\right)$ are continuous and not independent random variables, then $T_{n}\overset{P}{\rightarrow}+\infty$ as $n\rightarrow+\infty.$

The next corollary follows from Theorem 4.

Corollary 1.

If $\left(X,Y\right)\sim N\left(0,\Sigma\right)$ , where $X$ and $Y$ are not independent, and $dG(r,s)=g(r,s)drds$ , $g(r,s)>0$ for all $r,s>0,$ then $T_{n}\overset{P}{\rightarrow}+\infty$ as $n\rightarrow+\infty$ .

Remark 3.

Consider $\left(X_{1},Y_{1}\right)$ , $\left(X_{2},Y_{2}\right)$ in $\mathbb{R}^{2}$ i.i.d. with joint density $f_{X,Y}$ and joint distribution $F$ such that $\left|X_{1}-X_{2}\right|$ and $\left|Y_{1}-Y_{2}\right|$ are independent.

Then

[TABLE]

Similarly,

[TABLE]

Then, $\alpha\left(r,s\right)=\beta\left(r,s\right)$ for all $r,s>0.$

Of course, it could happen that condition $\alpha\left(r,s\right)=\beta\left(r,s\right)$ for all $r,s>0$ is fulfilled, and nevertheless $X$ and $Y$ are not independent. This is the restricted type of distributions that do not satisfy the conditions of our consistency theorem.

2.2 Contiguous alternatives

In this subsection we will analyze the behavior of this test under contiguous alternatives.

More explicitly, given $\left(X_{1},Y_{1}\right),\left(X_{2},Y_{2}\right),...,\left(X_{n},Y_{n}\right)$ i.i.d. in $\mathbb{R}^{p}\times\mathbb{R}^{q}$ , consider

[TABLE]

(i.e. $X$ and $Y$ are independent), vs

[TABLE]

where $f_{X,Y}^{(n)}(x,y)=c_{n}\left(\delta\right)f_{X}(x)f_{Y}(y)\left(1+\frac{\delta}{2\sqrt{n}}k_{n}(x,y)\right)^{2},$ $\delta>0,$ $c_{n}\left(\delta\right)$ is a constant such that $f_{X,Y}^{(n)}(x,y)$ be a density, and the functions $k_{n}$ verify the conditions (i) and (ii) that are given below:

Define $L_{0}^{2}=L^{2}\left(dF_{0}\right)$ for $dF_{0}(x,y)=f_{X}(x)f_{Y}(y)dxdy$ , the distribution function of $\left(X,Y\right)$ under $H_{0},$ analogously define $L_{0}^{1}.$

(i)

Exists a function $K\in L_{0}^{1}$ such that $k_{n}\leq K$ for all $n$

(ii)

Exists $k\in L_{0}^{2}$ such that $k_{n}\overset{L_{0}^{2}}{\rightarrow}k$ , $\left\|k\right\|=1.$

It can be proven that conditions (i) and (ii) imply contiguity (Cabaña [8]).

The $\delta$ coefficient is introduced so that $\left\|k\right\|=1.$ The function $\delta k$ is called asymptotic drift.

We will show in the following lines that under $H_{n},$ the process $\left\{E_{n}(r,s)\right\}_{r,s>0}$ has the same asymptotic limit as under $H_{0}$ plus a deterministic drift.

We use the notation $\mathbb{E}^{\left(n\right)}\left(T\right)$ and $P^{\left(n\right)}\left(\left(X,Y\right)\in A\right)$ for the expectation value of $T$ , and the probability of the set $\left\{\left(X,Y\right)\in A\right\}$ under $H_{n}$ respectively. Analogously we use $\mathbb{E}^{\left(0\right)}\left(T\right)$ and $P^{\left(0\right)}\left(\left(X,Y\right)\in A\right)$ under $H_{0}.$

Proposition 1.

[TABLE]

Under $H_{n}$

[TABLE]

where $\mu(r,s)=$

[TABLE]

and $A_{r,s}:=\left\{\left(x_{1},y_{1},x_{2},y_{2}\right)\in\mathbb{R}^{2p+2q}:\text{ }d(x_{1},x_{2})<r,\text{ }d(y_{1},y_{2})<s\right\}.$

With a little more work, using the Le Cam third lemma (Le Cam & Yang, [20] and Oosterhoff & Van Zwet, [22]) it is possible to prove that under $H_{n}$ ,

[TABLE]

where $\left\{E(r,s)\right\}_{r,s>0}$ is the limit process under $H_{0}$ and $\mu\left(r,s\right)=$

[TABLE]

Therefore, under $H_{n}$

[TABLE]

3 Implementation of the test

3.1 $X$ and $Y$ are random variables

In the case where $X$ and $Y$ are continuous random variables, we observe that $X$ and $Y$ are independent; it is equivalent to say that $X^{\prime}=\phi^{-1}\left(F_{X}\left(X\right)\right)$ and $Y^{\prime}=\phi^{-1}\left(F_{Y}\left(Y\right)\right)$ are independent, where $F_{X}$ and $F_{Y}$ are the distribution functions of $X$ and $Y$ , respectively. If we apply the test procedure to $X^{\prime}$ and $Y^{\prime}$ , then we have the advantage that now the variables are on the same scale and each has a normal centered distribution that approximates to the hypotheses of Remark 2. In addition, in this case the formula (11) for $\sigma_{X^{\prime},Y^{\prime}}^{2}(r,s)$ is completely determined. Another additional advantage is that under $H_{0}$ ( $X^{\prime}$ and $Y^{\prime}$ are independent and $N\left(0,1\right)$ ), for small values of $n$ , we can calculate the critical values at $5\%$ or another level because we will know the distribution of $T_{n}$ under $H_{0}.$ Where $X$ and $Y$ are random vectors, the same transformation can be applied in each coordinate. To give an idea of the variability of the process $\{E_{n}(r,s)\}_{r,s>0}$ , in Figure 1 we show the values of $\sigma_{X^{\prime},Y^{\prime}}^{2}(r,r)$ for different values of $r.$ The maximum is $0.06409$ and is reached in $r=1.3488$ .

3.2 General case

As happens in many statistical applications, we are able to have a moderately small sample size. However, an erroneous decision can be made if the researcher uses the p-value (or the critical value) obtained through the asymptotic distribution to make the decision in the hypothesis test. Therefore, when we have a sample of size $n$ , it is preferable to estimate the p-value (or the critical value) by estimating the distribution of the $T_{n}$ for this value of $n.$ Moreover, in our test, the asymptotic distribution is difficult to obtain because we need to conduct several simulations of a centered continuous Gaussian processes indexed in $D=\left(0,+\infty\right)\times\left(0,+\infty\right)$ . We then need to calculate the integral in $D.$

To calculate the p-value or the critical value of the test for fixed $n$ we can proceed as explained in the following lines. Fixed $n$ , if $H_{0}$ is true, we do not know the distribution of $T_{n},$ but given the observed value from our sample that we call $t_{obs}$ , we could generate, by a permutation procedure, a large sample of $T_{n}$ with which we can estimate $P\left(T_{n}\geq t_{obs}\right)$ . Given $\left(X_{1},Y_{1}\right),\left(X_{2},Y_{2}\right),...,\left(X_{n},Y_{n}\right)$ i.i.d. sample of $\left(X,Y\right).$ Observe that the distribution of $T_{n}$ depends of the joint distribution of $\left(X_{1},Y_{1}\right),\left(X_{2},Y_{2}\right),...,\left(X_{n},Y_{n}\right).$ If $H_{0}$ is true, and if we consider any $\sigma:\left\{1,2,3,...,n\right\}\rightarrow\left\{1,2,3,...,n\right\}$ permutation of the index set, then the joint distribution of $\left(X_{1},Y_{1}\right),\left(X_{2},Y_{2}\right),...,\left(X_{n},Y_{n}\right)$ and the joint distribution of $\left(X_{\sigma(1)},Y_{1}\right),\left(X_{\sigma(2)},Y_{2}\right),...,\left(X_{\sigma(n)},Y_{n}\right)$ are the same. Consider $S\left(n\right)=\left\{\sigma_{1},\sigma_{2},...,\sigma_{n!}\right\}$ the set of all the permutation $\sigma:\left\{1,2,...,n\right\}\rightarrow\left\{1,2,...,n\right\}.$ Suppose that the sample $\left(X_{1},Y_{1}\right),...,\left(X_{n},Y_{n}\right)$ is fixed and consider $Z$ defined by $Z=T_{n}\left(\left(X_{\sigma_{i}(1)},Y_{1}\right),...,\left(X_{\sigma_{i}(n)},Y_{n}\right)\right)$ with probability $1/n!$ for each $i=1,2,...,n!.$ If we take $Z_{1},Z_{2},...,Z_{m}$ i.i.d. sample of $Z$ , we can estimate the value of $p_{n}=P\left(T_{n}\geq t_{obs}\right)$ simply by using $\widehat{p}_{n}^{\left(m\right)}=\frac{1}{m}\sum_{i=1}^{m}\mathbf{1}_{\left\{Z_{i}\geq t_{obs}\right\}}$ for $m$ large enough. Define the random variables $B_{i}=\sum_{j=1}^{m}\mathbf{1}_{\left\{Z_{j}=T_{n}\left(\left(X_{\sigma_{i}(1)},Y_{1}\right),...,\left(X_{\sigma_{i}(n)},Y_{n}\right)\right)\right\}}.$ for $i=1,2,,...,n!$ . Observe that $B_{i}$ is distributed as Bin $\left(m,1/n!\right)$ for each $i=1,2,...,n!.$ Then

[TABLE]

converges as $m\rightarrow+\infty$ to $\frac{1}{n!}\sum_{i=1}^{n!}\mathbf{1}_{\left\{T_{n}\left(\left(X_{\sigma_{i}(1)},Y_{1}\right),...,\left(X_{\sigma_{i}(n)},Y_{n}\right)\right)\geq t_{obs}\right\}}$ a.s. If we now consider that $\left(X_{1},Y_{1}\right),...,\left(X_{n},Y_{n}\right)$ are random elements that can take an expected value, and we obtain (using dominated convergence) $\mathbb{E}\left(\widehat{p}_{n}^{\left(m\right)}\right)\underset{m\rightarrow+\infty}{\rightarrow}p_{n}$ , then $\widehat{p}_{n}^{\left(m\right)}$ is an asymptotically unbiased estimator of $p_{n}.$

3.3 A simple method to choose the weight function

The performance of our test depends on the choice of the weight function. The weight function can be chosen by the researcher in each particular case. According to Theorem 4, we can use any function $G$ such that $dG(r,s)=g(r,s)drds$ where $g(r,s)>0$ for any $r,s>0$ . It would be interesting to study some kind of optimality in the choice of the $G$ function, under certain kind of alternatives. Consequently, we propose a simple method to chose the $G$ function. As will be seen in the next section, this simple choice of $G$ , has very good performance under the alternatives studied in this work.

Define $dG(r,s)=g_{1}(r)g_{2}(s)drds$ , where $g_{1}$ and $g_{2}$ are Gaussian densities. In the case of $g_{1}$ we can use $\mu_{1}=\mathbb{E}\left(d(X_{1},X_{2})\right)$ and $\sigma_{1}^{2}=\mathbb{V}\left(d(X_{1},X_{2})\right)$ . The values of $\mu_{1}$ and $\sigma_{1}$ can easily be estimated by the sample $d\left(X_{i},X_{j}\right)$ with $(i,j)\in I_{2}^{n}$ . We can proceed similarly with the election of $\mu_{2}$ and $\sigma_{2}$ for the density $g_{2}$ . In this way, we give more weight in the neighbourhoods of the average distance between two independent observations $X_{1}$ and $X_{2}$ for $g_{1}$ , and analogously for $g_{2}$ . Meanwhile, observe that we can avoid the problem of choosing $G$ , if we use $T^{\prime}_{n}=\sqrt{n}\sup_{r,s>0}\left|RR_{n}^{X,Y}(r,s)-RR_{n}^{X}(r)RR_{n}^{Y}(s)\right|$ to test independence because all of the theoretical results obtained in this work for $T_{n}$ are still valid for $T^{\prime}_{n}$ .

3.4 Computing the statistic

In this subsection we will see how to calculate the statistic $T_{n}$ . We will consider the case in which $dG(r,s)=g_{1}(r)g_{2}(s)drds$ where $g_{1}$ and $g_{2}$ are density functions with $G_{1}$ and $G_{2}$ their respective distribution functions.

[TABLE]

To simplify the notation and for the rest of this section, we will call $N=n(n-1).$ We will also index $d\left(X_{i},X_{j}\right)$ with $(i,j)\in I_{2}^{n}$ in the form $Z_{1},Z_{2},...,Z_{N}$ . Analogously, we use the same indexes as $Z^{\prime}s$ , $T_{1},T_{2},...,T_{N}$ to the values $d\left(Y_{i},Y_{j}\right)$ . We will also call $Z_{1}^{\ast},Z_{2}^{\ast},...,Z_{N}^{\ast}$ to the order statistics of $Z^{\prime}s$ , and analogously $T_{1}^{\ast},T_{2}^{\ast},...,T_{N}^{\ast}$ .

[TABLE]

Analogously

[TABLE]

Then

[TABLE]

Then

[TABLE]

where $A_{n}$ , $B_{n}$ and $C_{n}$ are given in the formulas (15), (14) and (16) respectively.

4 A simulation study

In this section we will compare the performance of our test with respect to other recently proposed tests that have good performance. Tables 1 to 6 show the power of our test for different functions $G$ and also for other tests, for $n=30$ , $n=50$ and $n=80$ sample sizes. All power calculations that we have considered have been calculated at the significance level of $5\%$ . The calculations were made using (17) and taking as a function of weights $dG(r,s)=g_{1}(r)g_{2}(s)drds$ where $g_{1}=g_{2}=g$ is the density function of a $N(\mu,\sigma^{2})$ random variable for some values of $\mu$ and $\sigma^{2}$ , except for the last column, where we take the functions $g_{1}$ and $g_{2}$ suggested in Subsection 3.3. We will compare the power of our test with respect to the test proposed in Heller et al. [13] (which we will call HHG), the test of covariance distance proposed in Székely et al. [25] (which we will call DCOV) and the test proposed in Gretton et al. [12] (which we will call HSIC). In Subsection 4.1 we will consider the case in which $X$ and $Y$ are random variables; that is, $(X,Y)\in\mathbb{R}^{2}$ . Meanwhile, in Subsection 4.2 we consider examples in dimensions greater than two. Lastly, in Subsection 4.3 we simulate discrete and continuous time series for certain alternatives and representspower as a function of sample size. In this case, we take the functions $g_{1}$ and $g_{2}$ suggested in Subsection 3.3.

4.1 $X$ and $Y$ are random variables

Table 3 considers Heller et al.’s [13] tests, which are called “Parabola”, “Two parabolas”, “Circle”, “Diamond”, “W-shape” and “Four independent clouds” and which are defined as follows:

Parabola: $X\sim U\left(-1,1\right),$ $Y=\left(X^{2}+U\left(0,1\right)\right)/2.$

Two parabolas: $X\sim U\left(-1,1\right),$ $Y=\left(X^{2}+U\left(0,1\right)/2\right)$ with probability $1/2$ and $Y=-\left(X^{2}+U\left(0,1\right)/2\right)$ with probability $1/2.$

Circle: $U\sim U\left(-1,1\right)$ , $X=\sin\left(\pi U\right)+N\left(0,1\right)/8$ , $Y=\cos\left(\pi U\right)+N\left(0,1\right)/8.$

Diamond: $U_{1},U_{2}\sim U\left(-1,1\right)$ independent, $X=\sin\left(\theta\right)U_{1}+\cos\left(\theta\right)U_{2},$ $Y=-\sin\left(\theta\right)U_{1}+\cos\left(\theta\right)U_{2}$ for $\theta=\pi/4.$

W-shape: $U\sim U(-1,1)$ , $U_{1},U_{2}\sim U(0,1)$ independent. $X=U+U_{1}/3$ and $Y=4\left(U^{2}-1/2\right)^{2}+U_{2}/n.$

Four independent clouds: $X=1+Z_{1}/3$ with probability $1/2$ , $X=-1+Z_{2}/3$ with probability $1/2$ and $Y=1+Z_{3}/3$ with probability $1/2$ , $Y=-1+Z_{4}/3$ with probability $1/2$ , where $Z_{1},Z_{2},Z_{3},Z_{4}\sim N(0,1)$ are independent.

Observe that in “Four independent clouds”, $H_{0}$ is true, and the power in all the cases should be around $0.05$ . In all cases, the critical values of our test were calculated through 50000 replications and the power of all of the tests considered from 10000 replications. The first three columns of Table 1 give the power of the HHG, DCV and HSIC tests. Column 4 gives the maximum power among the classic correlation test: Pearson, Spearman and Kendall, which we call PSK. Columns 5, 6 and 7 give the power of our test for different $g=g_{1}=g_{2}$ function considered in the weight function $G$ . In column 8, we use the function $g_{1}$ and $g_{2}$ proposed in Subsection 3.3, analogously in Table 2 and Table 3. Figure 2 give us $n=1000$ simulations of the alternatives considered in this subsection.

4.2 $X$ and $Y$ are random vectors

In our test, the distance considered for the calculations of recurrences measures is given for the Euclidean norm. Because the Euclidean distance increases with the dimension, the densities of $N(0,4)$ and $N(2,4)$ were aggregated in the columns 6 and 7. In this subsection, we consider the last two alternatives in Table 3, and in Table 4 of Heller et al. [13], which we will call “Logarithmic”, “Epsilon” and “Quadratic” tests and which are defined as follows:

Logarithmic: $X,Y\in\mathbb{R}^{5}$ where $X_{i}$ $\sim$ $N\left(0,1\right)$ are independent, $Y_{i}=\log\left(X_{i}^{2}\right)$ for $i=1,2,3,4,5.$

Epsilon: $X,Y,\varepsilon\in\mathbb{R}^{5}$ where $X_{i},\varepsilon_{i}$ $\sim$ $N\left(0,1\right)$ are independent, $Y_{i}=\varepsilon_{i}X_{i}$ for $i=1,2,3,4,5.$

Quadratic: $X,Y,\varepsilon\in\mathbb{R}^{5}$ where $X_{i},\varepsilon_{i}$ are independent, $X_{i}\sim N\left(0,1\right),$ $\varepsilon_{i}\sim N\left(0,3\right),$ $Y_{i}=X_{i}+4X_{i}^{2}+\varepsilon_{i}$ $i=1,2,$ $Y_{i}=\varepsilon_{i}$ for all $i=3,4,5.$

We also add the alternatives considered in Boglioni, which are called “2D-pairwise independent” and are defined as follows:

2D-pairwise independent: $X,Z_{0},Y_{1}\sim N\left(0,1\right)$ independent, $Y=\left(Y_{1},Y_{2}\right)$ where $Y_{2}=\left|Z_{0}\right|sign\left(XY_{1}\right).$

In all cases, the critical values of our test were calculated through 50000 replications and the power of all of the tests were considered from 10000 replications.

To have an idea of the size of the test for random vectors, we have simulated $X,Y\in\mathbb{R}^{5}$ using $g_{1}$ and $g_{2}$ proposed in Subsection 3.3. The power of the test were $0.051$ , $0.048$ and $0.052$ for sample sizes of $30,50$ and $80$ , respectively.

4.3 $X$ and $Y$ are time series

In this subsection, we consider the case in which $X$ and $Y$ are time series. In all cases $X$ and $Y$ are time series of length $100$ and the power (due to the computational cost) were calculated by a permutation method for $m=1.000$ replications (Table 7 and Table 8) and $m=100$ replications (Table 9). All the power were calculated using $g_{1}$ and $g_{2}$ proposed in Subsection 4.3. The power for different alternatives and sample sizes in the discrete case are given in Table 7. The AR $(0.1)$ and AR $(0.9)$ means that the time series $X$ is an AR $(1)$ with parameter $0.1$ and $0.9$ , respectively. The case called ARMA $(2,1)$ , is an ARMA $(2,1)$ model with parameters $\phi=(0.2,0.5)$ and $\theta=0.2$ . In column 4 of Table 7, $Z$ represents a white noise where $\sigma$ is the standard deviation of $\sqrt{|X|}$ . In Table 7 and Table 8, $\varepsilon$ and $\varepsilon^{\prime}$ are independent white noises with $\sigma=1$ . In Table 8 are given the power for different alternatives and sample sizes in the continuous case. In this table, $Bm$ represents that $X$ is a Brownian motion with $\sigma=1$ observed in $[0,1]$ (at times $0,1/100,2/100,...,99/100$ ) and $fBm$ is a fractional Brownian motion with Hurst parameter $H=0.7$ . Finally, Table 9 shows the power for cases in which the dependency between $X$ and $Y$ is more difficult to detect. In these cases, $Y$ is a fractional Ornstein-Uhlenbeck process driven by a $fBm$ ( $X$ ) for $H=0.5$ and $H=0.7$ , which we call $OU$ and $FOU$ , respectively. A particular linear combination of $FOU$ , which we call $FOU(2)$ , and whose definition and theoretical developed is found in [17], is a particular case of the models proposed in [2]. Table 9 considers the parameters $\sigma=1,\lambda=0.3$ (column 3) and $\sigma=1,\lambda_{1}=0.3,\lambda_{2}=0.8$ (column 4). More explicitly, $Y_{t}=\sigma\int_{-\infty}^{t}e^{-\lambda(t-s)}dX_{s}$ in column 3 (where $X=\{X_{t}\}$ is a fBm), and $Y_{t}=\dfrac{\lambda_{1}}{\lambda_{1}-\lambda_{2}}\sigma\int_{-\infty}^{t}e^{-\lambda_{1}(t-s)}dX_{s}+\dfrac{\lambda_{2}}{\lambda_{2}-\lambda_{1}}\sigma\int_{-\infty}^{t}e^{-\lambda_{2}(t-s)}dX_{s}$ in column 4 (where $X=\{X_{t}\}$ is a fBm). To give an idea of the size of the test, in column 5 $Y$ is a $Bm$ independent of $X$ .

5 Conclusions

In this work we have presented a new test of independence between two random elements lying in metric spaces. Our test is based on percentages of recurrences for which we need, for each sample, only the information obtained by the distance between points. We have obtained the asymptotic distribution of our statistic and we have shown that the limit distribution under contiguous alternatives has a bias. We have also proven the consistency of the test for a wide class of alternatives, which include the particular case in which $\left(X,Y\right)$ follows a multivariate normal distribution. The performance of the test measured through the calculation of power through several alternatives has shown very good results, clearly improving on others in many cases for different dimensions of the spaces. In future work, we think that the result can be generalized to the case in which there is some kind of dependence between the observation of the sample. In addition, the work of the simulations should be expanded and deepened.

Acknowledgments

Our gratitude to José Rafael León, Ricardo Fraiman, Ernesto Mordecki and Jorge Graneri for their comments that were very useful in the preparation of this work. Also the editor of the journal and the two anonymous referees for their enriching comments.

6 Proofs

Proof of Lemma 1.

[TABLE]

Observe that as $\left(X_{1},Y_{1}\right),\left(X_{2},Y_{2}\right),...,\left(X_{n},Y_{n}\right)$ i.i.d, then

[TABLE]

for all $i,j$ such that $i\neq j.$ Therefore

[TABLE]

Analogously, $\mathbb{E}\left(RR_{n}^{X}(r)\right)=p_{X}(r)$ and $\mathbb{E}\left(RR_{n}^{Y}(s)\right)=p_{Y}(s).$ Given that $X$ and $Y$ are independent, then

[TABLE]

Thus,

[TABLE]

Decomposing (19) in the terms in which $i,j,k,h$ are pairwise different, $\left\{i,j\right\}=\left\{h,k\right\}$ and $\left\{i,j,h,k\right\}$ has three elements, and using that the $X-$ random vectors are i.i.d, we obtain that (19) is equal to

[TABLE]

Analogously

[TABLE]

Similarly, using that the $\left(X,Y\right)-$ random vectors are i.i.d. and also that $X$ and $Y$ are independent,

[TABLE]

With the same technique as in (20) and (21), we obtain $\mathbb{E}\left[RR_{n}^{X,Y}(r,s)RR_{n}^{X}(r^{\prime})RR_{n}^{Y}(s^{\prime})\right]=$

[TABLE]

Therefore

[TABLE]

Putting (20), (21) and (22) in (18), we obtain that (18) is equal to

[TABLE]

Then

[TABLE]

∎

Proof of Lemma 2.

[TABLE]

where $H_{n}(r,s)=$

[TABLE]

Then, $H_{n}(r,s)$ is equal to

[TABLE]

Now, we decompose

[TABLE]

and substituting in (23) we obtain that (23) is equal to

[TABLE]

Observe that (24) it is bounded between [math] and

[TABLE]

∎

Proof of Theorem 3.

[TABLE]

Every continuous function $h:\mathbb{R}\rightarrow\mathbb{R}$ with finit limits as $x\rightarrow\pm\infty$ is uniformly continuous. Therefore given $\varepsilon>0$ , exist $\delta>0$ such that $|F(x)-F(y)|\leq\varepsilon^{2}/8$ and $|G(x)-G(y)|\leq\varepsilon^{2}/8$ for all $(x,y)$ such that $|x-y|<\delta$ , where $F$ and $G$ are the distribution functions of $d(X_{1},X_{2})$ and $d(Y_{1},Y_{2})$ respectively. If $H_{0}$ is true, consider for each $r,s>0$ the functions $f_{r,s}:(S_{X}\times S_{Y})^{4}\rightarrow\mathbb{R}$ defined by

[TABLE]

where $x,x^{\prime},x^{\prime\prime},x^{\prime\prime\prime}\in S_{X}$ and $y,y^{\prime},y^{\prime\prime},y^{\prime\prime\prime}\in S_{Y}.$ and consider the family $\mathcal{F}=\{f_{r,s}\}_{r,s>0}$ . To simplify the notation, we call $z=\left(x,y,x^{\prime},y^{\prime},x^{\prime\prime},y^{\prime\prime},x^{\prime\prime\prime},y^{\prime\prime\prime}\right)$ throughout the demonstration.

Observe that

[TABLE]

then the process $\left\{E_{n}^{\prime}(r,s)\right\}_{r,s>0}$ is an $U-$ process of order $4.$

To obtain the convergence, according to Arcones & Giné’s Theorem 4.10, it is enough to prove that

[TABLE]

If $\varepsilon\geq 2,$ then $-1\leq f_{r,s}(z)\leq 1$ for all $z\in(S_{X}\times S_{Y})^{4}$ and $r,s>0.$ Then $\mathcal{L}=\left\{-1\right\}$ , $\mathcal{U}=\left\{1\right\}$ satisfied (6) Thus, $N_{\left[{\ }\right]}^{\left(2\right)}\left(\varepsilon,\mathcal{F},P^{4}\right)=1$ , therefore $\int_{0}^{+\infty}\left(\log N_{\left[{\ }\right]}^{\left(2\right)}\left(\varepsilon,\mathcal{F},P^{4}\right)\right)^{1/2}d\varepsilon=\int_{0}^{2}\left(\log N_{\left[{\ }\right]}^{\left(2\right)}\left(\varepsilon,\mathcal{F},P^{4}\right)\right)^{1/2}d\varepsilon.$

If $\varepsilon<2$ , we take $T>0$ such that $\max\left\{1-F(T),1-G(T)\right\}<\varepsilon^{2}/8$ , then we partition $\left[0,+\infty\right)$ into $m+1$ subintervals of the form $\left[\frac{iT}{m},\frac{(i+1)T}{m}\right)$ such that $\frac{T}{m}<\delta,$ where $\frac{(m+1)T}{m}$ is interpreted as $+\infty.$ Define the following functions

[TABLE]

and

[TABLE]

Observe that for each $r,s>0$ there exists $i,j\in\left\{0,1,2,...,m\right\}$ such that $\frac{iT}{m}\leq r<\frac{(i+1)T}{m}$ and $\frac{jT}{m}\leq s<\frac{(j+1)T}{m}.$

Then

[TABLE]

Thus $\mathcal{L}=\left\{l_{i,j}\right\}$ and $\mathcal{U}=\left\{u_{i,j}\right\}$ where $l_{i,j}(z)=g_{i,j}(z)-h_{i+1,j+1}(z)$ and $u_{i,j}(z)=g_{i+1,j+1}(z)-h_{i,j}(z)$ for $i,j=0,1,2,...,m.$ Also

[TABLE]

Define the sets $A_{i,j}:=\left[0,\frac{(i+1)T}{m}\right)\times\left[0,\frac{(j+1)T}{m}\right)-\left[0,\frac{iT}{m}\right)\times\left[0,\frac{jT}{m}\right)$ , then

[TABLE]

Analogously,

[TABLE]

putting (27) and (26) in (25) we obtain that $\mathbb{E}\left(u_{i,j}(Z)-l_{i,j}(Z)\right)^{2}\leq\varepsilon^{2}.$

Lastly, observe that the cardinal of $\mathcal{L}$ and $\mathcal{U}$ is $\left(m+1\right)^{2}$ , then

[TABLE]

∎

Proof of Theorem 4.

[TABLE]

Define $\mu\left(r,s\right)=P\left(d(X_{1},X_{2})<r,\text{ }d(Y_{1},Y_{2})<s\right)-P\left(d(X_{1},X_{2})<r\right)P\left(d(Y_{1},Y_{2})<s\right).$ Then, $r_{0},s_{0}>0$ exist, such that $\mu^{2}\left(r_{0},s_{0}\right)>0$ , thus $\varepsilon>0$ exist and $A\subset\left[0,+\infty\right)^{2}$ such that $\left(r_{0},s_{0}\right)\in A$ and $\mu^{2}\left(r,s\right)>\varepsilon$ for all $\left(r,s\right)\in A.$ Then, as $n\rightarrow+\infty,$

[TABLE]

Now, using that $\left(a+b\right)^{2}\leq 2\left(a^{2}+b^{2}\right)$ we obtain that $n\int_{0}^{+\infty}\int_{0}^{+\infty}\mu^{2}\left(r,s\right)g(r,s)drds\leq$

[TABLE]

Thus

[TABLE]

∎

Proof of Corollary 1.

[TABLE]

Because all of the norms in $\mathbb{R}^{p}$ and $\mathbb{R}^{q}$ are equivalent, it is enough to give the proof for the Euclidean norm case. We use that if $\left(Z,T\right)$ has centered normal bivariate distribution, then $\mathbb{COV}\left(Z^{2},T^{2}\right)=2\left(\mathbb{COV}\left(Z,T\right)\right)^{2}.$

Let us call $X=\left(X_{\left(1\right)},X_{\left(2\right)},...,X_{\left(p\right)}\right)$ and $Y=\left(Y_{\left(1\right)},Y_{\left(2\right)},...,Y_{\left(q\right)}\right).$ Then

[TABLE]

If $X$ and $Y$ are not independent, then $i$ and $j$ exist such that $\mathbb{COV}\left(X_{\left(i\right)},Y_{\left(j\right)}\right)\neq 0,$ then $\mathbb{COV}\left(\left\|X\right\|^{2},\left\|Y\right\|^{2}\right)>0,$ then $\left\|X\right\|^{2}$ and $\left\|Y\right\|^{2}$ are not independent, therefore $\left\|X\right\|$ and $\left\|Y\right\|$ are not independent, and then exist $r$ and $s$ positive numbers such that $P\left(\left\|X\right\|<r,\text{ }\left\|Y\right\|<s\right)\neq P\left(\left\|X\right\|<r\right)P\left(\left\|Y\right\|<s\right).$ If we apply this argument for $X_{1}-X_{2}$ and $Y_{1}-Y_{2}$ instead $X$ and $Y$ , then we obtain that

[TABLE]

Lastly, the result follows from Theorem 2. ∎

Proof of Proposition 1.

[TABLE]

Define $A_{r,s}:=\left\{\left(x_{1},y_{1},x_{2},y_{2}\right)\in\mathbb{R}^{2p+2q}:\text{ }d(x_{1},x_{2})<r,\text{ }d(y_{1},y_{2})<s\right\},$ then (28) is equal to

[TABLE]

where $\left|\varepsilon_{n}\left(r,s\right)\right|\leq\frac{c}{\sqrt{n}}$ for all $r,s>0$ and $c$ is a constant.

[TABLE]

Therefore

[TABLE]

Then $\mathbb{E}^{\left(n\right)}\left(E_{n}(r,s)\right)\rightarrow$

[TABLE]

as $n\rightarrow+\infty.$ ∎

Bibliography27

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Arcones, M. A. and Giné, E. (1993). Limit Theorems for U-Processes. The Annals of Probability Vol 21-3, 1494-1542.
2[2] Arratia, A., Cabaña, A. & Cabaña, E., (2016). A construction of Continuous time ARMA models by iterations of Ornstein-Uhlenbeck process, SORT Vol 40 (2) 267-302.
3[3] Bakirov, N. K., Rizzo, M. L., and Székely, G. J. (2006). A multivariate non- parametric test of independence. Journal of Multivariate Analysis, 97(8):1742-1756.
4[4] Beran, R., Bilodeau, M., and Lafaye de Micheaux, P. (2007). Nonparametric tests of independence between random vectors. Journal of Multivariate Analysis. 98(9):1805–1824.
5[5] Bilodeau, M. and Lafaye de Micheaux, P. (2005). A multivariate empirical caracteristic function test of independence with normal marginals. Journal of ultivariate Analysis, 95(2):345–369.
6[6] Blomqvist, N. (1950). On a measure of dependence between two random variables. The Annals of Mathematical Statistics 593-600.
7[7] Boglioni, G. (2016). A consistent test of independence between random vectors. https://papyrus.bib.umontreal.ca/xmlui/bitstream/handle/1866/18773/Bo glioni_Beaulieu_Guillaume_ 2016_memoire.pdf?sequence=2.
8[8] Cabaña, E. M. (1997). Contiguidad, pruebas de ajuste y Procesos Empí ricos Transformados. Décima escuela venezolana de Matemáticas.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

An Independence Test Based on Recurrence Rates

Abstract

1 Introduction

2 Test approach and theoretical results

2.1 Asymptotic results under H0H_{0}H0​ and consistency

Lemma 1**.**

Lemma 2**.**

Theorem 3**.**

Remark 1**.**

Remark 2**.**

Theorem 4**.**

Corollary 1**.**

Remark 3**.**

2.2 Contiguous alternatives

Proposition 1**.**

3 Implementation of the test

3.1 XXX and YYY are random variables

3.2 General case

3.3 A simple method to choose the weight function

3.4 Computing the statistic

4 A simulation study

4.1 XXX and YYY are random variables

4.2 XXX and YYY are random vectors

4.3 XXX and YYY are time series

5 Conclusions

Acknowledgments

6 Proofs

Proof of Lemma 1.

Proof of Lemma 2.

Proof of Theorem 3.

Proof of Theorem 4.

Proof of Corollary 1.

Proof of Proposition 1.

2.1 Asymptotic results under $H_{0}$ and consistency

Lemma 1.

Lemma 2.

Theorem 3.

Remark 1.

Remark 2.

Theorem 4.

Corollary 1.

Remark 3.

Proposition 1.

3.1 $X$ and $Y$ are random variables

4.1 $X$ and $Y$ are random variables

4.2 $X$ and $Y$ are random vectors

4.3 $X$ and $Y$ are time series