Validation of Association

\'Cmiel Bogdan; Ledwina Teresa

arXiv:1904.06519·stat.ME·April 16, 2019

Validation of Association

\'Cmiel Bogdan, Ledwina Teresa

PDF

Open Access

TL;DR

This paper introduces a new quantile dependence function to measure and visualize complex dependence structures between variables, providing new tests for independence and insightful diagnostic plots.

Contribution

It develops a novel function-valued measure of dependence, new estimators, and tests for independence, enhancing interpretability and detection of dependence in joint distributions.

Findings

01

The new tests outperform existing independence tests in simulations.

02

The dependence function reveals detailed dependence structures in real data.

03

Graphical tools aid in interpreting complex dependence patterns.

Abstract

Recognizing, quantifying and visualizing associations between two variables is increasingly important. This paper investigates how a new function-valued measure of dependence, the quantile dependence function, can be used to construct tests for independence and to provide an easily interpretable diagnostic plot of existing departures from the null model. The dependence function is designed to detect general dependence structure between variables in quantiles of the joint distribution. It gives an insight into how the dependence structures changes in different parts of the joint distribution. We define new estimators of the dependence function, discuss some of their properties, and apply them to construct new tests of independence. Numerical evidence is given on the test's benefits against three recognized independence tests introduced in the previous years. In real-data analysis, we…

Figures6

Click any figure to enlarge with its caption.

Equations131

q (u, v) = q_{C} (u, v) = \frac{C ( u , v ) - uv}{uv ( 1 - u ) ( 1 - v )}, (u, v) \in (0, 1)^{2} .

q (u, v) = q_{C} (u, v) = \frac{C ( u , v ) - uv}{uv ( 1 - u ) ( 1 - v )}, (u, v) \in (0, 1)^{2} .

w (u, v) = 1/ uv (1 - u) (1 - v) . \vspace - 0.4 c m

w (u, v) = 1/ uv (1 - u) (1 - v) . \vspace - 0.4 c m

E_{n}(u,v)=\frac{1}{n}\sum_{i=1}^{n}{\bf 1}\Bigl{(}\frac{R_{i}}{n}\leq u,\frac{S_{i}}{n}\leq v\Bigr{)}.

E_{n}(u,v)=\frac{1}{n}\sum_{i=1}^{n}{\bf 1}\Bigl{(}\frac{R_{i}}{n}\leq u,\frac{S_{i}}{n}\leq v\Bigr{)}.

C_{n}(u,v)=\frac{1}{n}\sum_{i=1}^{n}{\bf 1}\Bigl{(}\frac{R_{i}}{n+1}\leq u,\frac{S_{i}}{n+1}\leq v\Bigr{)}.

C_{n}(u,v)=\frac{1}{n}\sum_{i=1}^{n}{\bf 1}\Bigl{(}\frac{R_{i}}{n+1}\leq u,\frac{S_{i}}{n+1}\leq v\Bigr{)}.

N (u, v) = C (u, v) - uv,

N (u, v) = C (u, v) - uv,

N_{n} (u, v) = C_{n} (u, v) - uv,

N_{n} (u, v) = C_{n} (u, v) - uv,

N (u, v)

N (u, v)

N_{n}^{(1)} (u, v)

N_{n}^{(1)} (u, v)

N_{n}^{(2)} (u, v)

N_{n}^{(3)} (u, v)

N_{n}^{(4)} (u, v)

N_{n}^{*} (u, v)

N_{n}^{*} (u, v)

N_{n}^{*} (0, v) = N_{n}^{*} (u, 0) = N_{n}^{*} (u, 1) = N_{n}^{*} (1, v) = 0.

N_{n}^{*} (0, v) = N_{n}^{*} (u, 0) = N_{n}^{*} (u, 1) = N_{n}^{*} (1, v) = 0.

Q_{n} (u, v) = w (u, v) N_{n} (u, v) \mbox an d Q_{n}^{*} (u, v) = w (u, v) N_{n}^{*} (u, v),

Q_{n} (u, v) = w (u, v) N_{n} (u, v) \mbox an d Q_{n}^{*} (u, v) = w (u, v) N_{n}^{*} (u, v),

Q_{n,s}^{*}(u,v)=\Bigl{(}\frac{2s}{n+1}\Bigr{)}^{-2}\int_{u-\frac{s}{n+1}}^{u+\frac{s}{n+1}}\int_{v-\frac{s}{n+1}}^{v+\frac{s}{n+1}}Q_{n}^{*}(x,y)dydx.

Q_{n,s}^{*}(u,v)=\Bigl{(}\frac{2s}{n+1}\Bigr{)}^{-2}\int_{u-\frac{s}{n+1}}^{u+\frac{s}{n+1}}\int_{v-\frac{s}{n+1}}^{v+\frac{s}{n+1}}Q_{n}^{*}(x,y)dydx.

n Q_{n} (u, v) \to D N (0, 1), n Q_{n}^{*} (u, v) \to D N (0, 1) \mbox an d n Q_{n, s}^{*} (u, v) \to D N (0, 1),

n Q_{n} (u, v) \to D N (0, 1), n Q_{n}^{*} (u, v) \to D N (0, 1) \mbox an d n Q_{n, s}^{*} (u, v) \to D N (0, 1),

H_{0} : C (u, v) = uv, \mbox f or (u, v) \in [0, 1]^{2}

H_{0} : C (u, v) = uv, \mbox f or (u, v) \in [0, 1]^{2}

n \int_{0}^{1} \int_{0}^{1} u^{2 γ} v^{2 δ} {E_{n} (u, v) - uv}^{2} d u d v,

n \int_{0}^{1} \int_{0}^{1} u^{2 γ} v^{2 δ} {E_{n} (u, v) - uv}^{2} d u d v,

n \int_{0}^{1} \int_{0}^{1} {g (u, v)}^{- γ} {C_{n}^{β} (u, v) - uv}^{2} d u d v,

n \int_{0}^{1} \int_{0}^{1} {g (u, v)}^{- γ} {C_{n}^{β} (u, v) - uv}^{2} d u d v,

{\cal L}_{r,n}^{*}=\sqrt{n}\Bigl{\{}\int_{0}^{1}\int_{0}^{1}\big{|}Q_{n}^{*}(u,v)\big{|}^{r}dudv\Bigr{\}}^{1/r}.

{\cal L}_{r,n}^{*}=\sqrt{n}\Bigl{\{}\int_{0}^{1}\int_{0}^{1}\big{|}Q_{n}^{*}(u,v)\big{|}^{r}dudv\Bigr{\}}^{1/r}.

A (ϵ) = [0, 1] ∖ {[0, ϵ]^{2} \cup [1 - ϵ, 1] \times [0, ϵ] \cup [1 - ϵ, 1]^{2} \cup [0, ϵ] \times [1 - ϵ, 1]}

A (ϵ) = [0, 1] ∖ {[0, ϵ]^{2} \cup [1 - ϵ, 1] \times [0, ϵ] \cup [1 - ϵ, 1]^{2} \cup [0, ϵ] \times [1 - ϵ, 1]}

{\cal L}_{\epsilon,r,n}^{*}=\sqrt{n}\Bigl{\{}\int_{A({\epsilon})}|Q_{n}^{*}(u,v)|^{r}dudv\Bigr{\}}^{1/r}.

{\cal L}_{\epsilon,r,n}^{*}=\sqrt{n}\Bigl{\{}\int_{A({\epsilon})}|Q_{n}^{*}(u,v)|^{r}dudv\Bigr{\}}^{1/r}.

{\cal M}_{n}^{*}=\min\bigl{\{}p^{(1)}_{n},p^{(2)}_{n}\bigr{\}}.

{\cal M}_{n}^{*}=\min\bigl{\{}p^{(1)}_{n},p^{(2)}_{n}\bigr{\}}.

D_{κ, 0, n}^{*} (u, v) \in [κ, 1 - κ]^{2} sup n ∣ Q_{n}^{*} (u, v) ∣.

D_{κ, 0, n}^{*} (u, v) \in [κ, 1 - κ]^{2} sup n ∣ Q_{n}^{*} (u, v) ∣.

D_{κ, s, n}^{*} = (u, v) \in [κ, 1 - κ]^{2} sup n ∣ Q_{n, s}^{*} (u, v) ∣.

D_{κ, s, n}^{*} = (u, v) \in [κ, 1 - κ]^{2} sup n ∣ Q_{n, s}^{*} (u, v) ∣.

C_{n}^{(1)} (u, v) = n {C_{n} (u, v) - uv}, w (u, v) = \frac{1}{uv ( 1 - u ) ( 1 - v )},

C_{n}^{(1)} (u, v) = n {C_{n} (u, v) - uv}, w (u, v) = \frac{1}{uv ( 1 - u ) ( 1 - v )},

N_{n}^{(2)} (u, v) = N_{n}^{(1)} (u, v) - r_{n}^{(2)}, \mbox w h er e r_{n}^{(2)} = \frac{1}{n} ⌊(n + 1) u ⌋ - u, \mbox an d ∣ r_{n}^{(2)} ∣ \leq \frac{1}{n} \mbox f or u \in [0, 1],

N_{n}^{(2)} (u, v) = N_{n}^{(1)} (u, v) - r_{n}^{(2)}, \mbox w h er e r_{n}^{(2)} = \frac{1}{n} ⌊(n + 1) u ⌋ - u, \mbox an d ∣ r_{n}^{(2)} ∣ \leq \frac{1}{n} \mbox f or u \in [0, 1],

N_{n}^{(4)} (u, v) = N_{n}^{(1)} (u, v) - r_{n}^{(4)}, \mbox w h er e r_{n}^{(4)} = \frac{1}{n} ⌊(n + 1) v ⌋ - v, \mbox an d ∣ r_{n}^{(4)} ∣ \leq \frac{1}{n} \mbox f or v \in [0, 1],

N_{n}^{(4)} (u, v) = N_{n}^{(1)} (u, v) - r_{n}^{(4)}, \mbox w h er e r_{n}^{(4)} = \frac{1}{n} ⌊(n + 1) v ⌋ - v, \mbox an d ∣ r_{n}^{(4)} ∣ \leq \frac{1}{n} \mbox f or v \in [0, 1],

N_{n}^{(3)} (u, v) = - N_{n}^{(1)} (u, v) + N_{n}^{(2)} (u, v) + N_{n}^{(4)} (u, v) = N_{n}^{(1)} (u, v) - r_{n}^{(2)} - r_{n}^{(4)} .

N_{n}^{(3)} (u, v) = - N_{n}^{(1)} (u, v) + N_{n}^{(2)} (u, v) + N_{n}^{(4)} (u, v) = N_{n}^{(1)} (u, v) - r_{n}^{(2)} - r_{n}^{(4)} .

\bigl{|}Q_{n}^{*}(u_{0},v_{0})-Q_{n,s}^{*}(u_{0},v_{0})\bigr{|}\leq\frac{1}{n}c(n_{0},u_{0},v_{0},s).

\bigl{|}Q_{n}^{*}(u_{0},v_{0})-Q_{n,s}^{*}(u_{0},v_{0})\bigr{|}\leq\frac{1}{n}c(n_{0},u_{0},v_{0},s).

B_{n,s}(u_{0},v_{0})=\Bigl{[}u_{0}-\frac{s}{n+1},u_{0}+\frac{s}{n+1}\Bigr{]}\times\Bigl{[}v_{0}-\frac{s}{n+1},v_{0}+\frac{s}{n+1}\Bigr{]}.

B_{n,s}(u_{0},v_{0})=\Bigl{[}u_{0}-\frac{s}{n+1},u_{0}+\frac{s}{n+1}\Bigr{]}\times\Bigl{[}v_{0}-\frac{s}{n+1},v_{0}+\frac{s}{n+1}\Bigr{]}.

\sup_{(u,v)\in B_{n,s}(u_{0},v_{0})}\bigl{|}N_{n}^{*}(u,v)-N_{n}^{*}(u_{0},v_{0})\bigr{|}=\sup_{(u,v)\in B_{n,s}(u_{0},v_{0})}\bigl{|}C_{n}(u,v)-uv-C_{n}(u_{0},v_{0})+u_{0}v_{0}\bigr{|}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Statistical Methods and Models · Financial Risk and Volatility Modeling · Statistical Methods and Inference

Full text

Validation of Association

Bogdan Ćmiel

Faculty of Applied Mathematics, AGH University of Science and Technology,

Al. Mickiewicza 30, 30-059 Cracov, Poland

e-mail: [email protected]

and

Teresa Ledwina

Institute of Mathematics, Polish Academy of Sciences

ul. Kopernika 18, 51-617 Wrocław, Poland

e-mail: [email protected]

Abstract

Recognizing, quantifying and visualizing associations between two variables is increasingly important. This paper investigates how a new function-valued measure of dependence, the quantile dependence function, can be used to construct tests for independence and to provide an easily interpretable diagnostic plot of existing departures from the null model. The dependence function is designed to detect general dependence structure between variables in quantiles of the joint distribution. It gives an insight into how the dependence structures changes in different parts of the joint distribution. We define new estimators of the dependence function, discuss some of their properties, and apply them to construct new tests of independence. Numerical evidence is given on the test’s benefits against three recognized independence tests introduced in the previous years. In real-data analysis, we illustrate the use of our tests and the graphical presentation of the underlying dependence structure.

Keywords: Copula; Cross-quantilogram; Independence testing; Measure of dependence; Quantile, Weighted statistics

1 Introduction

Measuring dependence and testing for independence have been intensively studied over the recent years. On the one hand, in many applied sciences it is a fundamental question how to quantify the dependence between variables under study. On the other hand, the present-day knowledge demonstrates that classical solutions are designed to capture specific, relatively simple, structures of dependence between two random variables. Thus they are not well suited to the scope of modern statistical analysis, and therefore may lead to completely misleading conclusions. Another inspiration is exploratory data analysis, where one of the goals is to investigate a large data set and search for pairs of variables which are closely associated. Obviously, the dependence structure of random vector cannot be neglected in reliable data analysis. In particular, this problem is crucial in insurance and finance. For example, Albers, (1999) showed that substantial deviations in the fair price of stop-loss premiums may occur even when there are small departures from independence. For further evidence and related discussion see Kass, (1993), Dhaene and Goovaerts, (1997), and Dhaene et al., (2009).

These challenges have stimulated great development in the area of evaluation of relationships, measuring their strength and testing for a lack of dependence via pertinent statistics. An emphasis has been put on capturing complex dependence structures which cannot be detected by classical solutions. For an illustration of several ideas and approaches to quantifying dependence and detecting associations which have been considered over the last decade (in chronological order), see: Székely and Rizzo, (2009), Delicado and Smrekar, (2009), Reshef et al., (2011), Póczos et al., (2012), Zheng et al., (2012), Heller et al., (2013), Kwitt and Neumeyer, (2013), Reimherr and Nicolae, (2013), Berentsen and Tjøstheim, (2014), Vexler et al., (2014), Ledwina, (2015), Heller et al., (2016), Reshef et al., (2016), Bagkavos and Patil, (2017), Ding et al., (2017), Xu et al., (2017), Vexler et al., (2017), Wang et al., (2017), Reshef et al., (2018), Vexler et al., (2018), Zhang et al., (2018). These papers also discuss many earlier ideas and developments. Most of them concern bivariate vectors and independent observations. A parallel stream of articles deals with dependent data and multivariate observations. It is beyond the scope of this paper to survey these more general situations.

The present paper proposes and studies new tests of independence which are closely related to the function-valued measure of dependence $q$ proposed in Ledwina, (2014, 2015). The measure is conceptually appealing and has a straightforward interpretation facilitating a comprehensive view of the association structure. The pertaining tests are of simple form and have good power. Their extension to multivariate observations, dependent data, and for detecting positive or negative dependence is straightforward. Additionally, their big advantage is a natural link to an easily interpretable diagnostic plot. Consequently, in the case of rejection of the hypothesis on independence, reliable information on which part of the population — and to which extent — invalidates the hypothesis is available.

In many practical situations variables are measured in different scales. Therefore, a common requirement is to consider dependence measures that are invariant to strictly increasing transformations of the marginal variables. The copula-based dependence measures fulfill the above postulate and therefore are scale invariant. Several real-valued measures of such type have been studied for a long time. For a nice overview see Schweizer and Wolf, (1981), and Ding and Li, (2015). However, nowadays there is strong evidence that attempting to express a complex dependence structure via a single number is hopeless. Vexler et al., (2017) admitted that such a conclusion was clearly spelled out as early as in the celebrated book by Kendall and Stuart, (1961).

To help understand the underlying dependence structure with aid of a procedure invariant under strictly increasing transformations of the marginal distributions, Fisher and Switzer, (1985) proposed a rank-based graphical tool called a chi-plot. The procedure is rather complicated and not easy to interpret. A simpler solution was introduced by Genest and Boies, (2003). Their display is called a Kendall plot and refers to the idea of a Q-Q plot. It also is based on ranks and is more directly related to the underlying copula function than the chi-plot. See also Vexler et al., (2018) for further development of this idea. Ledwina, (2014, 2015) has proposed a simple dependence measure $q$ which is explicitly based on the copula function. The measure aggregates some Fourier coefficients of the copula in some special non-orthogonal basis. Expansions in the Hilbert space using systems of functions which are not mutually orthogonal appear in the literature under the label “quasiorthogonal expansions”; cf. Daubechies et al., (1986). The above mentioned Fourier coefficients can also be interpreted in terms of some local correlations; cf. Ledwina and Wyłupek, (2014), p. 39, and Ledwina and Wyłupek, (2012), p. 361. For further interpretation see Remark 1, below. Here we mention only that the measure $q$ can be seen to be naturally related to the cross-quantilogram, a notion with growing importance in econometry.

The remainder of the article is organized as follows: Section 2 recalls the definition of the measure $q$ and its properties, shows some links of $q$ to other existing notions, and illustrates the shape of the measure in a series of interesting bivariate models considered in recent literature on independence testing. Section 3 introduces a new useful estimator of $q$ . In Section 4, we present three new test statistics pertaining to the proposed estimator of $q$ . Two test statistics are the supremum-norm and integral-norm of the estimated $q$ , respectively. The third solution exploits the minimum $p$ -value principle. We also report there basic theoretical results on the new tests. In Section 5, we provide a comparative study on power analysis for the new solutions and some competitors which have been already introduced in the literature on the subject. Section 6 contains a study of a real data example. Proofs of all results are relegated to Supplementary Materials, which are located at final part of the manuscript. In the Supplementary Materials we include also some comments on our implementation of new statistics, and provide related C codes.

2 A Copula-Based Measure of Dependence: the Quantile Dependence Function

2.1 Definition and Properties

Consider a pair of random variables $X$ and $Y$ with bivariate cumulative distribution function $H$ and with continuous margins $F$ and $G$ , respectively. Then there exists a unique copula $C$ such that $C(u,v)=H(F^{-1}(u),G^{-1}(v)),\;(v,v)\in[0,1]^{2}$ . Obviously, $F^{-1}(u)$ and $G^{-1}(v)$ , appearing in this formula, are the $u$ -quantile and $v$ -quantile of the respective marginal distribution functions. Set

[TABLE]

We shall call $q$ the quantile dependence function. By (1), the measure $q$ attributes the copula $C$ to the continuous function on $(0,1)^{2}$ . As stated and justified in Ledwina, (2014, 2015), the measure $q$ fulfills natural postulates, motivated by the axioms formulated in Schweizer and Wolf, (1981) and updated in Embrechts et al., (2002). For convenience, we recall here the basic properties of $q$ . Below, we set

[TABLE]

Proposition 1.

The quantile dependence function $q$ , given by (1), has the following properties.

$-1\leq q(u,v)\leq 1$ * for all $(u,v)\in(0,1)^{2}$ .* 2. 2.

By the Fréchet-Hoeffding bounds for copulas, the property 1 can be further sharpened to $B_{o}(u,v)\leq q(u,v)\leq B^{o}(u,v),\;\;(u,v)\in(0,1)^{2},$ where $B_{o}(u,v)=w(u,v)\times$ $[\max\{u+v-1,0\}-uv]$ and $B^{o}(u,v)=w(u,v)[\min\{u,v\}-uv]$ . 3. 3.

$q$ * is maximal (minimal) if and only if $Y=f(X)$ and $f$ is strictly increasing (decreasing) a.s. on the range of $X$ .* 4. 4.

$q(u,v)\equiv 0$ * if and only if $X$ and $Y$ are independent.* 5. 5.

The equation $q(u,v)\equiv c$ , $c$ a constant, can hold true if and only if $c=0$ . 6. 6.

$q$ * is non-negative (non-positive) if and only if $(X,Y)$ are positively (negatively) quadrant-dependent.* 7. 7.

$q$ * is invariant under transformations which are strictly increasing a.s. on ranges of $X$ and $Y$ , respectively.* 8. 8.

If $X$ and $Y$ are transformed by strictly decreasing a.s. functions, then $q(u,v)$ is transformed to $q(1-u,1-v)$ . 9. 9.

If $f$ and $g$ are strictly decreasing a.s. on ranges of $X$ and $Y$ , respectively, then $q$ ’s for the pairs $(f(X),Y)$ and $(X,g(Y))$ take the forms $-q(1-u,v)$ and $-q(u,1-v)$ , accordingly. 10. 10.

$q$ * respects concordance ordering, i.e. for cdf’s $H_{1}$ and $H_{2}$ with the same marginals and corresponding copulas $C_{1}$ and $C_{2}$ , $H_{1}(x,y)\leq H_{2}(x,y)$ for all $(x,y)\in\mathbb{R}^{2}$ implies $q_{C_{1}}(u,v)\leq q_{C_{2}}(u,v)$ for all $(u,v)\in(0,1)^{2}$ .* 11. 11.

If $(X,Y)$ and $(X_{n},Y_{n}),\;n=1,2,\ldots,$ are pairs of random variables with joint cdf’s $H$ and ${\check{H}}_{n}$ , and the corresponding copulas $C$ and ${\check{C}}_{n}$ , respectively, then weak convergence of $\{\check{H}_{n}\}$ to $H$ implies $q_{\check{C}_{n}}(u,v)\to q_{C}(u,v)$ for each $(u,v)\in(0,1)^{2}$ .

Remark 1.

The measure $q(u,v)$ can be explicitly related to some tail-dependence indices: lower tail-dependence coefficient $\lambda_{L}=\lim_{u\searrow 0}\frac{C(u,u)}{u}=\lim_{u\searrow 0}q(u,u)$ and the coefficient of upper tail dependence $\lambda_{U}=\lim_{u\nearrow 1}\frac{1-2u+C(u,u)}{1-u}$ $=\lim_{u\nearrow 1}q(u,u)$ , which are defined in the situations when the respective limits exist; coefficients of tail dependence: $\tau^{UU}(u)=P(U>u|V>u),\;$ $\tau^{LL}(u)=P(U<1-u|V<1-u),$ $\tau^{LU}(u)=P(U<1-u|V>u)$ and $\tau^{UL}=P(U>u|V<1-u)$ . In particular, on the diagonal, $q(u,u)=\tau^{UU}(u)+\tau^{LL}(1-u)-1$ while on anti-diagonal $q(u,1-u)=1-\tau^{UL}(u)-\tau^{LU}(1-u)$ . See Sibuya, (1960), and Chicheportiche, (2013), respectively.

The measure $q$ is weak-equitable and weakly-robust-equitable in the sense of Definitions 2 and 4 in Ding et al., (2017), accordingly.

For radially symmetric copulas $(C(u,v)=\underline{C}(1-u,1-v);$ where $\underline{C}(u,v)=P(U>u,V>v))$ it holds that $q(u,u)=\beta((u,u),(1,1))$ , where $\beta((u_{1},u_{2}),(v_{1},v_{2}))$ stands for generalized Blomqvist’s measure of concordance considered in Schmid and Schmidt, (2007).

Finally note that $q$ coincides with the cross-quantilogram applied to $(X,Y)$ . Therefore, using precise terminology, we can say that $q(u,v)$ is the cross-correlation of the respective quantile hits. The cross-quantilogram for time series was, somewhat in passing, introduced on p. 261 in Linton and Whang, (2007) in the context of predictability studies. In Han et al., (2016) the idea has received considerable attention and development.

2.2 Graphical illustration

Given a formula for $C$ , the quantile dependence function $q$ can be graphically presented in many convenient ways. However, in the next section we introduce a new estimate $Q_{n}^{*}$ of $q$ and here we simply display the corresponding (averaged) values of the estimate for large $n$ , and for several interesting forms of dependence of $X$ and $Y$ . To be specific, in Figures 1 and 2 we took $n=1000$ and calculated the averages over 10 000 MC runs. This gives quite precise information on the shape of $q$ .

To see how the measure $q$ reflects different forms of dependence and to study empirical powers of new tests proposed in Section 4, we have considered a wide spectrum of models. Some of them are classical ones, a considerable portion has been recently introduced in different simulation experiments in some related papers, a few of the models have been defined just for this study. Below we only show the sources of some recently introduced models. Classical ones were used in numerous earlier simulation studies. Throughout the paper ${\bf 1}(A)$ stands for the indicator of the set $A$ . The list of models is as follows.

Simple Regression:

SR1:

linear $\;\;Y=2+X+\epsilon,\ \;X\sim U[0,1],\ \;\epsilon\sim N(0,1)$ ; Vexler et al., (2014);

SR2:

root $\;\;Y=X^{1/4}+\epsilon,\ \;X\sim U[0,1],\ \;\epsilon\sim N(0,0.25)$ ; Simon and Tibshirani, (2012), Reshef et al., (2018);

SR3:

step $\;\;Y={\bf 1}(X\leq 0.5)+\epsilon,\;X\sim U[0,1],\ \epsilon\sim N(0,2)$ ; Simon and Tibshirani, (2012), Reshef et al., (2018);

SR4:

logarithmic $\;\;Y=\log(1+|X|)+\epsilon,\;X\sim N(0,1);\epsilon\sim N(0,1)$ ; Vexler et al., (2014);

SR5:

W $\;\;Y=4[(2X-1)^{2}-0.5]^{2}+\epsilon,\;X\sim U[0,1],\;\epsilon\sim N(0,0.5)$ ; Ding and Li, (2015).

Heterosceadestic Regression:

HR1:

reciprocal $\;\;Y=\sigma(X)\epsilon,\;$ $X$ has exponential distribution with $\lambda=0.1$ , $\epsilon\sim N(0,1),\;\sigma(X)=\sqrt{1+1/X^{2}}$ ;

HR2:

*linear * $\;\;Y=\sigma(X)\epsilon,\;$ $X\sim U[1,16],\;$ $\epsilon\sim N(0,1),\;\sigma(X)=\sqrt{X}$ .

Random-Effect-Type Models:

RE1:

linear $\;\;Y=2+X+\epsilon_{M}X+\epsilon_{A},\ \;X\sim U[0,1],\ \;\epsilon_{M}\sim N(0,4),\ \;\epsilon_{A}\sim N(0,1)$ , Vexler et al., (2014);

RE2:

quadratic $\;\;Y=\epsilon_{M}(2+X+X^{2})+\epsilon_{A},\;\epsilon_{M}\sim N(0,1),\;\epsilon_{A}\sim N(0,1)$ , Vexler et al., (2017);

RE3:

reciprocal $\;\;Y=\epsilon_{M}X^{-1}+\epsilon_{A},\;\epsilon_{M}\sim N(0,1),\;\epsilon_{A}\sim N(0,1)$ , Vexler et al., (2017);

RE4:

heavy tailed $\;(X_{0},Y_{0})$ is bivariate Cauchy. Define $X=X_{0},Y=\epsilon_{M}Y_{0}+\epsilon_{A}$ , $\epsilon_{M}\sim N(0,1),\;\epsilon_{A}\sim N(0,1)$ , Supplemental Material for Vexler et al., (2017).

Bivariate Models:

BM1:

Gaussian bivariate normal distribution with $\rho=0.3$ ;

BM2:

mixture I the mixture (0.1)(standard bivariate Gaussian) + (0.9)(bivariate Gaussian with mean 0, variances 6 and the covariance 5);

BM3:

mixture II the mixture (0.3)(bivariate Cauchy) + (0.7)(standard bivariate Gaussian);

BM4:

switched regression $Y=\mu(X)+\epsilon,\;\epsilon\sim N(0,1),\;\mu(X)=0$ for $|X|\leq 1.96$ and $\mu(X)=-X$ otherwise;

BM5:

*Mardia * Mardia family of copulas with $\theta=-0.55$ ;

BM6:

*Gumbel * Gumbel bivariate distribution with $\theta=0.5$ ;

BM7:

Clayton Clayton model with $\theta=0.5$ ;

BM8:

Cauchy bivariate Cauchy distribution;

BM9:

Student symmetric symmetric Student’s distribution with 2 degrees of freedom;

BM10:

Student skew skew bivariate Student’s distribution with 5 degrees of freedom and parameters $(0.3,0.7,-0.7)$ ;

BM11:

sub-Gaussian bivariate sub-Gaussian distribution with parameters $(0.1,1.5)$ , Kallenberg and Ledwina, (1999).

Each panel in Figures 1 and 2 only has a short label related to the above detailed description. Moreover, to increase readability, each label also contains two numbers $M$ and $m$ which are the maximal and minimal values, respectively, of the corresponding $q(u,v)$ for $(u,v)\in(0,1)^{2}$ .

3 Symmetrized estimate of $q$

3.1 Motivation

Let $(X_{1},Y_{1}),...,(X_{n},Y_{n})$ be independent and identically distributed random vectors drawn from a bivariate distribution function $H$ with continuous marginals $F$ and $G$ . Furthermore, let $R_{i}$ denote the rank of $X_{i},\;i=1,...,n,$ in the sample $X_{1},...,X_{n}$ , while $S_{i}$ stands for the rank of $Y_{i},\;i=1,...,n,$ in the sample $Y_{1},...,Y_{n}$ .

A natural way of estimating $q$ , given by (1), is the plug-in method. There are several estimators of $C$ available. The first proposal presumably goes back to Ruymgaart, (1973), see pp. 6 and 12, and has the form

[TABLE]

Monte Carlo simulations show that it is better to use

[TABLE]

Under the above assumptions, $C_{n}$ and $E_{n}$ differ by $O(1/n)$ almost surely. Alternatively, one can include the continuity correction in $E_{n}(u,v)$ . Another option are kernel based smooth versions of $E_{n}(u,v)$ or $C_{n}(u,v)$ ; cf. Fermanian et al., (2004), Omelka et al., (2009) and the references therein. A different smoothed variant has been proposed by Sancetta and Satchell, (2004). They introduced the Bernstein copula estimator, which has been further studied by Janssen et al., (2012), and Segers et al., (2017), among others.

It should be emphasized, however, that our ultimate goal is not the estimation of the copula itself, but a construction of some tests of independence based on the corresponding estimator of $q$ . In such an application, excessive smoothing of the underlying parameter, i.e. the copula function, is often not profitable to the power of the resulting test. On the other hand, we shall consider test statistics as functionals of the process $w(u,v)\{\check{C}_{n}(u,v)-uv\}$ , where $\check{C}_{n}$ is an estimate of the copula, while the weight is given by (2). Therefore, we have to be very careful about the behavior of the quantity $\{\check{C}_{n}(u,v)-uv\}$ near the edges of $[0,1]^{2}$ . In this sense, $C_{n}$ is not convenient for our purposes. It can be seen that the expression $Q_{n}(u,v)=w(u,v)\{C_{n}(u,v)-uv\}$ takes on very large (absolute) values near the points (0,1), (1,0) and (1,1). In contrast, empirical behavior of $Q_{n}(u,v)$ near (0,0) is satisfactory. Therefore, to estimate the numerator of $q(u,v)$ , i.e. the function

[TABLE]

say, we propose and we shall apply a symmetrized variant of the random function

[TABLE]

which exploits the useful behavior of $Q_{n}(u,v)$ in the first quadrant.

3.2 Symmetrization

To define the symmetrization, note that the numerator $N(u,v)$ of $q(u,v)$ , for every $(u,v)\in[0,1]^{2}$ , can be rewritten in the following four forms

[TABLE]

Therefore, similarly as in (3), we can consider the following variants of estimators of $N(u,v)$ .

[TABLE]

This leads us to the symmetrized estimator of $N(u,v),\;(u,v)\in[0,1]^{2},$ given by

[TABLE]

Note that for any $(u,v)\in[0,1]^{2}$ it holds that

[TABLE]

We shall call $\{\sqrt{n}N_{n}^{*}(u,v),\;(u,v)\in[0,1]^{2}\}$ the symmetrized version of the process $\{\sqrt{n}[C_{n}(u,v)-uv],\;(u,v)\in[0,1]^{2}\}$ . Finally, the initial and symmetrized estimators of $q(u,v)$ are given by

[TABLE]

respectively. Note that outside $[1/{(n+1)},1-1/{(n+1)}]^{2}$ the estimator $Q_{n}^{*}(u,v)$ is deterministic, bounded, and it tends to 0 when its arguments approach the edges of $[0,1]^{2}$ . This makes a great difference in finite sample behavior in comparison with $Q_{n}(u,v)$ , given in (6) as well.

We shall also consider a smoothed variant of $Q_{n}^{*}$ . To be specific, for some $s\geq 0$ we shall use

[TABLE]

The three above defined estimators of $q$ have the following useful property:

Proposition 2.

Let $(X_{1},Y_{1}),...,(X_{n},Y_{n})$ be independent and identically distributed random vectors drawn from a bivariate population $(X,Y)$ obeying a joint distribution function $H$ with continuous marginals $F$ and $G$ . Under the independence of $X$ and $Y$ , given $(u,v)\in(0,1)^{2}$ , it holds that

[TABLE]

where $\stackrel{{\scriptstyle D}}{{\rightarrow}}$ stands for convergence in distribution.

This, in particular, makes it possible to immediately see on the graphs of the estimates of $q(u,v)$ some evidence indicating in which part of the population and to which extent (at least roughly) the independence is invalidated. Formal tests, based on some functionals of these estimators, are defined in Section 4.

4 New weighted test statistics and their properties

For continuous random variables, testing for independence is equivalent to verification if the true copula is equal to the independence copula. Therefore, we consider the null hypothesis of the form

[TABLE]

and study test statistics defined as some functionals of the estimated difference $C(u,v)-uv$ .

4.1 Integral statistics

Weighted integral copula-based statistics of the form

[TABLE]

where $\gamma>-1/2$ and $\delta>-1/2$ , were studied in Deheuvels et al., (2006). Recently, Berghaus and Segers, (2018) have introduced the independence statistic which in the two dimensional case reads as

[TABLE]

where $C_{n}^{\beta}$ is the empirical beta copula - a particular case of the empirical Bernstein copula, $\gamma\in[0,2)$ , and $g(u,v)=\min\{u,v,1-\min(u,v)\}$ .

We shall consider the (standardized) $L_{r}$ -norm of $Q_{n}^{*}(u,v)$

[TABLE]

In view of the form of $Q_{n}^{*}$ , cf. (6), ${\cal L}_{r,n}^{*}$ is a weighted integral-type statistic for the symmetrized version of the process $\{\sqrt{n}[C_{n}(u,v)-uv]\}$ , where the weight is given by (2). For relatively small $r$ one could find a closed expression for ${\cal L}_{r,n}^{*}$ . In general, for reasonably large values of $n$ , it suffices to approximate the value of ${\cal L}_{r,n}^{*}$ by $\frac{\sqrt{n}}{(n+1)^{2}}\bigl{\{}\sum_{i=1}^{n}\sum_{j=1}^{n}\big{|}Q_{n}^{*}(\frac{i+0.5}{n+1},\frac{j+0.5}{n+1})\big{|}^{r}\bigr{\}}^{1/r}$ .

To achieve consistency of statistics like (9), using some existing results on the classical empirical copula process, we have to slightly modify the integration area in (9) around the vertices of $[0,1]^{2}$ . For an illustration, given $\epsilon>0$ , we shall consider

[TABLE]

and the modified variant of ${\cal L}_{r,n}^{*}$ given by

[TABLE]

To derive the consistency of ${\cal L}_{\epsilon,r,n}^{*}$ , some smoothness assumptions have to be imposed on the underlying copula $C$ . The following non-restrictive requirements have been formulated in Segers, (2012) and turn out to be sufficient in many practically important situations. For more details, see Segers, (2012), and Berghaus and Segers, (2018).

Assumption 0.

$\frac{\partial}{\partial u}C(u,v)$ and $\frac{\partial}{\partial v}C(u,v)$ exist and are continuous on $R_{1}=(0,1)\times[0,1]$ and $R_{2}=[0,1]\times(0,1)$ , respectively; 2. 2.

$\frac{{\partial}^{2}}{\partial u\partial v}C(u,v)$ exists and is continuous on $R_{1}\cap R_{2}$ ; 3. 3.

There exists a constant $K>0$ such that $\;\;\Bigl{|}\frac{{\partial}^{2}}{\partial u\partial v}C(u,v)\Bigr{|}\leq K\min\Bigl{\{}\frac{1}{u(1-u)},\frac{1}{v(1-v)}\Bigr{\}},\;$ for $(u,v)\in R_{1}\cap R_{2}$ .

Proposition 3.

Assume that the underlying copula $C$ satisfies the Assumption ‣ 4.1. Consider any $r>2$ and a positive $\epsilon=\epsilon_{n}$ . Suppose that $\epsilon_{n}\to 0$ in such a way that $\epsilon_{n}n^{1/(r+2)}\to\infty$ , as $n\to\infty$ . Then, under the alternative corresponding to $C$ , the test rejecting ${\bf H_{0}}$ for large values of ${\cal L}_{\epsilon,r,n}^{*}$ is consistent.

4.2 Minimum $p$ -value statistics

A test rejecting ${\bf H_{0}}$ for large values of the distance ${\cal L}_{\epsilon,r,n}^{*}$ is expected to be especially sensitive to alternatives shifting the probability mass towards the edges of $[0,1]^{2}$ . In contrast, the statistic proposed in Heller et al., (2013) proves to be very efficient in detecting some noisy functional relationships. Therefore, it seems useful to propose a procedure which combines advantages of both solutions. Our approach to this question is via combining $p$ -values. For this purpose denote by ${\cal T}_{n}$ the rank variant of the statistic, based on pairwise distances, introduced in Heller et al., (2013). Given the sample $(X_{1},Y_{1}),...,(X_{n},Y_{n})$ , let $p^{(1)}_{n}$ denote $p$ -value of ${\cal T}_{n}$ and let $p^{(2)}_{n}$ be respective $p$ -value of ${\cal L}_{\epsilon,r,n}^{*}$ .

We propose to reject ${\bf H_{0}}$ for small values of

[TABLE]

The idea of a minimum $p$ -value statistic goes back to Tippett Tippett, (1931).

4.3 Supremum-type solutions

Another classical approach to measuring departures from ${\bf H_{0}}$ is by taking a supremum norm of $Q_{n}^{*}$ . More precisely, given $\kappa=\kappa_{n}\in(0,1/2)$ one can consider

[TABLE]

Analogously to ${\cal L}_{r,n}^{*}$ , ${\cal D}_{\kappa,0,n}^{*}$ can be interpreted as weighted sup-type statistic. However, extensive simulations, which shall be partially reported in Section 5, have shown that it is not necessarily a very powerful solution and some smoothing of $Q_{n}^{*}$ improves the finite sample behavior of such constructions. Therefore, we shall take $Q_{n,s}^{*}(u,v)$ into account and introduce the corresponding class of supremum type statistics

[TABLE]

Obviously, ${\cal D}_{\kappa,s,n}^{*}$ with $s=0$ coincides with the solution (12). Supremum type statistics are particularly convenient for interpretation of obtained values of the selected estimator of the measure $q$ and at least rough, but practically immediate, evaluation of the extent to which ${\bf H_{0}}$ is possibly invalidated.

Proposition 4.

Suppose that the requirement 1. of Assumption ‣ 4.1 holds for $C$ . Consider positive $\kappa=\kappa_{n},$ such that $\;\kappa_{n}\to 0,$ and $\sqrt{n}\kappa_{n}\to\infty$ , as $n\to\infty.$ Then, tests rejecting ${\bf H_{0}}$ for large values of ${\cal D}_{\kappa,0,n}^{*}$ and ${\cal D}_{\kappa,s,n}^{*}$ , respectively, are consistent under the alternative $C$ .

5 Simulated powers

We shall study the empirical behavior of the following statistics:

•

the statistic rank-dCov of Székely and Rizzo, (2009), Section 4.3. We shall concisely denote this variant by $dCov$ , as done in Heller et al., (2016), p. 17, as well;

•

the rank based variant of the statistic introduced in Heller et al., (2013), denoted by HHG, similarly as proposed in Heller et al., (2016), p. 17;

•

the empirical likelihood ratio test $VT_{n}$ , defined on p. 160 of Vexler et al., (2014);

•

${\cal L}_{\epsilon,r,n}^{*}$ for two choices of $r:r=2,\;6$ and $\epsilon=0.01$ ;

•

${\cal D}_{\kappa,s,n}^{*}$ for two values of $s:s=0,\;4$ and $\kappa=0.025$ ; as usual in this area, the supremum in the Kolmogorov-Smirnov statistic was replaced by a maximum over a grid of $(n+1)\times(n+1)$ points.

•

${\cal M}_{n}^{*}$ using HHG and ${\cal L}_{\epsilon,6,n}^{*}$ with $\epsilon=0.01$ .

The outcomes of our Monte Carlo experiments, done for $n=100$ and the significance level $\alpha=0.05$ , are collected in Table 1. Table 1 presents empirical powers under the 22 models introduced in Section 2.

The simulation results show that ${\cal D}_{\kappa,4,n}^{*}$ with $\kappa=0.025$ outperforms (in the average) the rank distance covariance and also slightly the empirical likelihood ratio test. The rank based statistic HHG is slightly more powerful (in the average) than ${\cal D}_{\kappa,4,n}^{*}$ while our second solution ${\cal L}_{\epsilon,6,n}^{*}$ with $\epsilon=0.01$ provides some improvement over the two last mentioned statistics. The most powerful solution turns out to be our third, ${\cal M}_{n}^{*}$ , which combines the advantages of HHG (high power for regression models) with the benefits of ${\cal L}_{\epsilon,6,n}^{*}$ (sensitivity to heavy tails). It is also worth noting that ${\cal D}_{\kappa,4,n}^{*}$ turns out to be much less sensitive to the choice of $\kappa$ than ${\cal D}_{\kappa,0,n}^{*}$ . In particular, under our simulation scheme, for $\kappa<0.025$ ${\cal D}_{\kappa,0,n}^{*}$ works much worse than its smoothed version ${\cal D}_{\kappa,4,n}^{*}$ .

In view of the simulation results it appears that the class of statistics ${\cal L}_{\epsilon,r,n}^{*}$ shows considerable potential for further use. On the one hand, the integration process naturally smooths $Q_{n}^{*}$ , and extra smoothing seems to be unnecessary. We have also investigated some smoother integrands in $L_{r}$ -type norm statistics, but they have not resulted in more stable powers, and no average gain of power has been noticed. So, this is one advantage of such a solution over the supremum-type ones. On the other hand, it is well known that the $L_{r}$ -norm, under $r\to\infty$ , approximates the supremum norm. Therefore, the class is flexible and enough reach. Needless to say, such smooth functionals of weighted empirical processes are also easier to analyze than weighted supremum-type statistics. Obviously, several questions arise in the context. The first one is the choice of $r$ . In the course of the present simulation study we have simply inspected $r=2,...,8$ and decided for $r=6$ . However, it is possible to study the problem more deeply and carefully, by calculating for example asymptotic relative efficiency of ${\cal L}_{\epsilon,r,n}^{*}$ with respect to some standard, and investigating the classes of alternatives for which particular recommendations about $r$ are optimal. A recent paper by Inglot et al., (2018) provides tools that make it possible to answer to such a question. Obviously, solving such a problem is a fairly non-trivial task.

Finally, note that the test statistic ${\cal M}_{n}^{*}$ performs very well. It has relatively simple structure which nicely reflects the advantages of its ingredients. The solution is similar in spirit to a much more complex one proposed in Heller et al., (2016). The latter combines $p$ -values of chi-square-type tests over increasingly fine data-dependent sample space partitions.

6 Application

We demonstrate our testing procedures on a data set of $n=230$ aircraft span (X) and speed (Y) data, on log scales, from years 1956-1984, collected by Saviotti, and reported and studied in Bowman and Azzalini, (1997). Standard empirical correlation measures, such as Pearson’s, Spearman’s, and Blomqvist’s rank statistics, applied to this data, do not invalidate independence, as their $p$ -values are well above 0.7. This suggests that any dependence structure, if present at all, is likely to be nonlinear. Székely and Rizzo, (2009), and Heller et al., (2013) have applied their tests to this data and have received very small $p$ -values (less than 0.00001). In conclusion, the above evidence implies strong nonlinear dependence structure.

This data was also analyzed by Jones and Koch, (2003), and Berentsen and Tjøstheim, (2014). The two papers contain a presentation of some empirical dependence measures, defined on $\mathbb{R}^{2}$ . Jones and Koch, (2003) have started with a local dependence function of Holland and Wang, (1987) and proposed a so-called dependence map. The result of this approach suggests a rather complicated picture of the joint behavior of $X$ and $Y$ ; cf. their Figure 2. In turn, Berentsen and Tjøstheim, (2014) have applied another local correlation concept, namely the local Gaussian correlation introduced in Tjøstheim and Hufthammer, (2013). The approaches by Jones and Koch, (2003), and Berentsen and Tjøstheim, (2014), are related to estimation and modeling bivariate densities on $\mathbb{R}^{2}$ , respectively. The second solution is especially technically involved. The overall picture resulting from both implementations is very similar; cf. Figure 4 in Berentsen and Tjøstheim, (2014): on the plane there are three separated regions in which local dependence is positive, one region in which it is negative. There are also areas on the plane with no local dependence between log(span) and log(speed).

In Figure 3 we show two our estimators $Q_{n}^{*}$ and $Q_{n,4}^{*}$ for these data. The quantile dependence function $q$ , relying on the copula, separates the dependence structure from any marginal effects. At first glance it can be noticed that the dependence pattern, which has been revealed by the estimated $q$ , is much simpler than the above mentioned pictures suggest. Both our displays show two separated regions of strong dependence (positive and negative) of spans and speeds. Spans in the range from the second to ninth decile are positively correlated with speeds lying below their median. For speed above the median and below the ninth decile, a strong negative trend is exhibited. There is also a large region of $u$ -quantiles and $v$ -quantiles in which the variables span and speed seem to be unrelated.

It is visible at first glance that in some regions of points $(u,v)$ ’s of the right hand panel in Figure 3 the absolute value of $\sqrt{n}Q^{*}_{n,4}(u,v)$ exceeds values which, in view of Proposition 2, are highly improbable under the null hypothesis. For instance, $\sqrt{n}Q^{*}_{n,4}(0.106,0.742)\approx-5.9.$ Arguing formally, all $p$ -values of our new statistics ${\cal D}_{\kappa,s,n}^{*},\;s=0,4,\kappa=0.025$ , ${\cal L}_{\epsilon,r,n}^{*},\;r=2,6,\epsilon=0.01$ , and related ${\cal M}_{n}^{*}$ are practically 0.

7 Conclusion

In this article, we proposed a framework for measuring and visualizing the dependence structure, and constructing formal tests of independence of two random variables. Our approach exploits the quantile dependence function $q$ , a recently introduced local dependence measure. The function $q$ gives a detailed picture of the underlying dependence structure. It provides a means to carefully examine local association structure at different quantile levels. The measure distinguishes between negative and positive quadrant dependence and can be immediately generalized to the multivariate case. We have proposed three new tests based on simple and useful nonparametric estimators of the measure. Estimating the measure naturally leads to some weighted copula processes. Our test statistics are based on classical supremum-type and integral-type functionals of the related processes. Both the measure and the corresponding test statistics are invariant under strictly increasing transformations of observations. These statistics are easy and fast to calculate. A more refined solution has been proposed as well. Some consistency results are stated and proved, and extensive evidence on stability of empirical powers of new solutions is provided. Finally, we have applied our approach to analyze Saviotti aircraft data. This application shows the usefulness of the proposed inference tools in two respects: as a simple and reliable graphical device allowing for visual inspection of regions of evident departures from independence and formal tests verifying validity of the independence structure.

8 Supplementary Materials

**Appendix A: Proofs

**

**A.1 Proof of Proposition 2

**

The first part of (8) follows immediately from Theorem 3 in Fermanian et al. (2004) and the forms of the limiting process and pertaining covariance.

To prove the second statement of (8) let us introduce auxiliary rank process and recall the abbreviated notation for our weight function

[TABLE]

where $C_{n}(u,v)$ is given by (3). With these notation we have: for each fixed $(u,v)\in(0,1)^{2}$ it holds that $w(u,v)\mathbb{C}_{n}^{(1)}(u,v)=\sqrt{n}Q_{n}(u,v)=\sqrt{n}w(u,v)N_{n}^{(1)}(u,v)$ . Hence $\sqrt{n}w(u,v)N_{n}^{(1)}(u,v)\stackrel{{\scriptstyle D}}{{\rightarrow}}N(0,1)$ , provided that ${\bf H_{0}}$ is true. The last statement is key observation to complete a proof of the second part of (8). Indeed, consider the succeeding expression $N_{n}^{(2)}(u,v)$ appearing in $N_{n}^{*}(u,v)$ , which in turn defines $Q_{n}^{*}(u,v)$ . We have

[TABLE]

while $\lfloor\bullet\rfloor$ denotes the integer part of the number $\bullet$ . Analogously,

[TABLE]

and

[TABLE]

Obviously, the relations (A.2)-(A.4) hold irrespective of ${\bf H_{0}}$ is true or not. Under ${\bf H_{0}}$ , asymptotic normality of $\sqrt{n}w(u,v)N_{n}^{(1)}(u,v)$ and the above yield the required statement for $\sqrt{n}Q_{n}^{*}(u,v)$ .

Now we shall show that $Q_{n}^{*}$ and $Q_{n,s}^{*}$ are close enough to infer the last statement in (8) from the middle one. For this purpose take any $(u_{0},v_{0})\in(0,1)^{2}$ . We claim that for $(u_{0},v_{0})$ it holds that: there exists $n_{0}\in\mathbb{N}$ and a constant $c(n_{0},u_{0},v_{0},s)$ such that for all $n\geq n_{0}$

[TABLE]

Without loss of generality assume that $(u_{0},v_{0})\in(0,1/2)^{2}$ . Set

[TABLE]

Then, there exists $n_{0}\in\mathbb{N}$ such that $B_{n,s}(u_{0},v_{0})\subset(0,1/2)^{2}.$ Moreover, for any $n\in\mathbb{N}$ it holds that $B_{n+1,s}(u_{0},v_{0})\subset B_{n,s}(u_{0},v_{0})$ . Hence, for any $n\geq n_{0}$

[TABLE]

Since $w(u,v)$ is continuous and bounded on $B_{n_{0},s}(u_{0},v_{0})$ , therefore there exists $\bar{c}(n_{0},u_{0},v_{0},s)$ such that for all $n\geq n_{0}$

[TABLE]

By (A.6) and (A.7),

[TABLE]

This proves (A.5) and finally yields the last conclusion in (8). $\Box$

In view of (5) and an analogous relations for $N_{n}(u,v)$ , the zero set of the denominator in (6) is included in the zero set of the numerators in (6). Hence, we can additionally define $Q_{n}$ and $Q_{n}^{*}$ on this set to be 0. With this convention, the sample paths of $Q_{n}$ and $Q_{n}^{*}$ are bounded on $[0,1]^{2}$ . This enables to treat $Q_{n}$ and $Q_{n}^{*}$ as random elements with values in ${\ell}^{\infty}([0,1]^{2})$ . This observation helps us to use below some ready results on weighted copula process.

A.2 Proof of Proposition 3

We shall consider the integral ${\cal L}_{\epsilon,r,n}^{*}$ on four subsets of $A({\epsilon})$ in (10), separately. On the set $A^{(1)}(\epsilon)=[0,1/2]^{2}\setminus[0,\epsilon]^{2}$ it holds that $\sqrt{n}Q_{n}^{*}(u,v)=w(u,v)\mathbb{C}_{n}^{(1)}(u,v)$ , where $\mathbb{C}_{n}^{(1)}(u,v)$ is defined in (A.1). On the set $A^{(3)}(\epsilon)=[1/2,1]^{2}\setminus[1-\epsilon,1]^{2}$ we have

[TABLE]

Moreover, the process $\sqrt{n}\Bigl{\{}\frac{1}{n}\sum_{i=1}^{n}{\bf 1}\Bigl{(}\frac{R_{i}}{n+1}>1-s,\frac{S_{i}}{n+1}>1-t\Bigr{)}-st\Bigr{\}}$ has the same distribution as the process $\mathbb{C}_{n}^{(3)}(s-,t-)$ , where $\mathbb{C}_{n}^{(3)}$ is the rank process for the sample $(f(X_{i}),g(Y_{i})),\;i=1,...,n$ , where the functions $f$ and $g$ are strictly decreasing. It is so because the ranks of the transformed observations $R_{i}^{\prime}$ and $S_{i}^{\prime}$ , say, are related to $R_{i}$ and $S_{i}$ as follows $R_{i}^{\prime}=n+1-R_{i},\;S_{i}^{\prime}=n+1-S_{i}$ . For the two remaining subsets of the integration area $A({\epsilon})$ similar argument applies. It shows that asymptotic behavior of ${\cal L}_{\epsilon,r,n}^{*}$ is determined via pertaining asymptotics of the variables

[TABLE]

Moreover,

[TABLE]

On the other hand, under the Assumption 0, by Theorem 2.2 of Berghaus et al. (2017), the empirical copula process

[TABLE]

for $(u,v)\in J_{n}$ is well approximated, in some weighted supremum norm, by pertaining bivariate empirical process $\bar{\mathbb{C}}_{n}(u,v)$ . To define $\bar{\mathbb{C}}_{n}(u,v)$ set $U_{i}=F(X_{i}),V_{i}=G(Y_{i}),i=1,...,n$ , and

[TABLE]

Next, introduce the (unobservable) empirical process $\alpha_{n}$ , based on $(U_{1},V_{1}),...,(U_{n},V_{n})$ ,

[TABLE]

and finally define

[TABLE]

where $\dot{C}_{1}(u,v)=\frac{\partial}{\partial u}C(u,v)$ and $\dot{C}_{2}(u,v)=\frac{\partial}{\partial v}C(u,v)$ . With this notation,

[TABLE]

where $g_{\omega}(u,v)=[\min\{u,v,1-u,1-v\}]^{\omega}$ and $\omega\in(0,1/2)$ . Moreover, it holds that ${\bar{\mathbb{C}}_{n}(u,v)}/{g_{\omega}(u,v)}$ converges weakly in $\ell^{\infty}([0,1]^{2})$ , equipped with the supremum norm, to centered Gaussian process.

The above implies that asymptotic behavior of the linear functional ${\cal L}_{\epsilon,r,n}^{*}$ is determined by respective asymptotics of the weighted bivariate empirical process and its pertaining variants, provided that we are well controlling the magnitude of $I_{r}(\epsilon)=\int_{A({\epsilon})}[w(u,v)g_{\omega}(u,v)]^{r}dudv$ . Note also that asymptotic behavior of

[TABLE]

is decisive to the asymptotic power of ${\cal L}_{\epsilon,r,n}^{*}$ under the alternative pertaining to $C$ . By symmetries of $w$ and $q_{\omega}$ , we have

[TABLE]

In the last integral the behavior of $\int_{\epsilon}^{1/2}\int_{0}^{v}[u^{\omega}/\sqrt{uv}]^{r}dudv$ is crucial. Hence, the requirement $\omega>1/2-1/r$ follows. This implies our assumption $r>2.$

The above yields

[TABLE]

where $c_{1}$ and $c_{2}$ are absolute constants. Hence, under our assumptions on ${\epsilon_{n}}$ , $I_{r}(\epsilon_{n})=o(\sqrt{n})$ . The above implies that asymptotics of ${\cal L}_{r,n}^{*}(\epsilon_{n})$ , under $C$ , is determined by the two terms: a random component, which is at most $o_{P}(\sqrt{n})$ and an asymptotic shift, which is at least $O(\sqrt{n})$ . This ensures the consistency. $\Box$

A.3 Proof of Proposition 4

Due to 1. of the Assumption 0, by Proposition 3.1 of Segers (2012), the process $\hat{\mathbb{C}}_{n}$ , given in the equation (A.8), converges weakly in $\ell^{\infty}([0,1]^{2})$ to the Gaussian process $\mathbb{C}$ , defined by (3.1) in Segers (2012). This implies that for any $\kappa=\kappa_{n}$ , $\kappa_{n}\to 0$ as $n\to\infty$ ,

[TABLE]

On the other hand, by (6),

[TABLE]

Under the alternative $C$ , the second term in (A.13) is, in $[\kappa_{n},1-\kappa_{n}]^{2}$ , at least $O({\sqrt{n}})$ . Therefore, the assertion (A.12) along with the assumptions on $\kappa_{n}$ yield the consistency of $\sup_{(u,v)\in[\kappa,1-\kappa]^{2}}|\sqrt{n}Q_{n}(u,v)|$ .

The case of $\sqrt{n}Q^{*}_{n}(u,v)$ can be treated similarly, as by (A.2)-(A.4), in an analogue of (A.13) an immaterial extra term, being at most $O(1/(\sqrt{n}\kappa_{n}))$ , appears, only. Hence, for $\kappa_{n}$ ’s under consideration, the consistency of ${\cal D}^{*}_{\kappa,0,n}$ follows.

To prove consistency of the test rejecting ${\bf H_{0}}$ for large values of ${\cal D}^{*}_{\kappa,s,n}$ observe that always it holds that ${\cal D}^{*}_{\kappa,s,n}\leq{\cal D}^{*}_{\kappa,0,n}$ . Let $c_{\alpha,n}$ be a critical value of $\alpha$ -level test based on ${\cal D}^{*}_{\kappa,0,n}$ . By the above, $c_{\alpha,n}=O(1/\kappa_{n})$ . On the other hand, by (A.5), under any alternative defined by the underlying $C$ ,

[TABLE]

In view of ${\cal D}^{*}_{\kappa,s,n}\leq{\cal D}^{*}_{\kappa,0,n}$ the proof of the consistency of the smoothed variant is concluded. $\Box$

References

(1) Berghaus, B., Bücher, A., Volgushev, S. (2017). Weak convergence of the empirical copula process with respect to weighted metrics. Bernoulli 23, 743-772.
(2) Fermanian, J.-D., Radulović, D., Wegkamp, M. (2004). Weak convergence of empirical copula processes. Bernoulli 10, 847-860.
(3) Segers, J. (2012). Asymptotics of empirical copula processes under non-restrictive smoothness assumptions. Bernoulli 18, 764-782.

**Appendix B: Data set referenced in the article

**

Aircraft span and speed data, from the third period, 1956-1984, are available in electronic form from A.W. Bowman and A. Azzalini (2007). R package ‘sm’: Nonparametric smoothing methods (version 2.2); https://cran.r-project.org/web/packages/sm/sm.pdf

**Appendix C: Description and Codes

**

In this Section we provide some $C$ codes for computation of ${\cal Q}_{n}^{*}$ , ${\cal Q}_{n,s}^{*}$ , ${\cal L}^{*}_{\epsilon,r,n},\;{\cal D}^{*}_{\kappa,0,n},\;{\cal D}^{*}_{\kappa,s,n}$ , and comment on calculation of ${\cal M}_{n}^{*}$ . We start with a preliminary information.

**C.1. Preliminaries

**

**Calculation of $Q^{*}_{n}$

**

Let us denote by $R=(R_{1},...,R_{n})$ and $S=(S_{1},...,S_{n})$ the vectors of ranks related to $(X_{1},...,X_{n})$ and $(Y_{1},...,Y_{n})$ , respectively. Set $R^{\prime}_{i}=n+1-R_{i}$ and $S^{\prime}_{i}=n+1-S_{i}$ , $i=1,...,n$ . Next, introduce the notation $R^{\prime}=(R^{\prime}_{1},...,R^{\prime}_{n})$ and $S^{\prime}=(S^{\prime}_{1},...,S^{\prime}_{n})$ for vectors of transformed ranks. Additionally define $u^{\prime}=1-u$ and $v^{\prime}=1-v$ . Recall also that, using $R$ and $S$ , we consider empirical copula of the form

[TABLE]

With these notations, the functions $N_{n}^{(k)}(u,v),\;k=1,...,4$ , appearing in the definition of $N_{n}^{*}(u,v)$ , and hence in the formula (6) for $Q_{n}^{*}(u,v)$ , have the following alternative forms

[TABLE]

for any $(u,v)\notin\{1/(n+1),...,n/(n+1)\}^{2}$ . That is why we calculate the transformed ranks in our computer program and we calculate empirical copula for ranks and transformed ranks. Tables ”Ctab”, ”Ctabs12”, ”Ctabs22”, ”Ctabs21” contain values of the empirical copulas $C_{n}(u,v;R,S)$ , $C_{n}(u,v^{\prime};R,S^{\prime})$ , $C_{n}(u^{\prime},v^{\prime};R^{\prime},S^{\prime})$ , $C_{n}(u^{\prime},v;R^{\prime},S)$ , corresponding to a grid of points $(u_{i},v_{j})$ ’s, where

[TABLE]

Notice that a calculation of $C_{n}(u,v)$ , on the grid $(u_{i},v_{j})$ , $i=0,...,n$ , $j=0,...,n$ , using the initial formula (C.1) is completely ineffective since for every point from the grid we have to calculate a sum of $n$ indicators. The computational complexity of that approach is $n^{3}$ . Much better method is the following recursion. If we sort vectors $(R_{1},S_{1}),...,(R_{n},S_{n})$ in ascending order according to the first coordinate, we obtain vectors $(1,S_{[1]}),...,(1,S_{[n]})$ , where $S_{[k]}$ is the rank in the vector $S$ corresponding to the rank $k$ in the vector $R$ . Observe that for all $j=0,1,...,n$ we have $C_{n}(u_{0},v_{j};R,S)=0$ . Moreover, for all $i=1,...,n$ , and for $j=0,...,n$ it holds that

[TABLE]

Let us explain this recursion using the following example, in which we took $n=10$ . The resulting $S_{[k]}$ ’s were as follows: 3, 6, 2, 9, 4, 1, 7, 5, 8, 10. The entries of the presented $11\times 11$ table are equal to respective values of $C_{n}(u_{i},v_{j};R,S)$ for $i=0,...,n$ , $j=0,...,n$ . It is useful to imagine that succeeding vertical and horizontal lines of the table are located at points $l/(n+1),\;l=0,...,n+1.$

All entries in the first column are equal to [math]. In next columns the values on white background are equal to the first value on the left and the values on green background are equal to the first value on the left plus $1/n$ . When looking from the left to the right, black dots in the table are the sorted pseudo-observations $(k/(n+1),S_{[k]}/(n+1)),\;k=1,...,n$ . The computational complexity of that approach is $n^{2}$ . Tables ”T”, ”Ts12”, ”Ts22”, ”Ts21” contain second coordinates of the sorted vectors $((R_{1},S_{1}),...,(R_{n},S_{n}))$ , $((R_{1},S^{\prime}_{1}),...,(R_{n},S^{\prime}_{n}))$ , $((R^{\prime}_{1},S^{\prime}_{1}),...,(R^{\prime}_{n},S^{\prime}_{n}))$ , $((R^{\prime}_{1},S_{1}),...,(R^{\prime}_{n},S_{n}))$ , according to the first coordinate, respectively. Using the above recursion, we use them to calculate $C_{n}(u,v;R,S)$ , $C_{n}(u,v^{\prime};R,S^{\prime})$ , $C_{n}(u^{\prime},v^{\prime};R^{\prime},S^{\prime})$ , $C_{n}(u^{\prime},v;R^{\prime},S)$ on the grid $(u_{i},v_{j})$ , $i=0,...,n$ , $j=0,...,n$ . In view of (C.2), this allows to calculate $Q^{*}_{n}(u,v)$ on the grid. Pertaining values are collected by our program in a table ”KS”.

Calculation of $Q^{*}_{n}$ and $Q^{*}_{n,s}$

We calculate

[TABLE]

on the grid $(u_{i},v_{j})$ , $i=0,...,n$ , $j=0,...,n$ numerically, i.e. we apply the approximation

[TABLE]

The values $Q_{n,s}^{*}(u_{i},v_{j})$ for $j=0,...,n$ , $j=0,...,n$ are collected in a table ”KWyg”.

Notice also that $Q_{n}^{*}(u_{i},v_{j})=Q_{n,0}^{*}(u_{i},v_{j})$ . So, in this way, both estimators of the quantile dependence function $q$ can be calculated.

**Calculation of ${\cal L}^{*}_{\epsilon,r,n}$

**

Using the definition of $A(\epsilon)$ , given by the formula (10) of the paper, set

[TABLE]

We calculate ${\cal L}^{*}_{\epsilon,r,n}$ numerically, i.e. we approximate

[TABLE]

In the computer code the approximate value of ${\cal L}^{*}_{\epsilon,r,n}$ is denoted by ”I”.

**Calculation of ** ${\cal D}^{*}_{\kappa,0,n}$ and ${\cal D}^{*}_{\kappa,s,n}$

Let us denote

[TABLE]

We calculate ${\cal D}^{*}_{\kappa,0,n}$ and ${\cal D}^{*}_{\kappa,s,n}$ numerically, i.e.

[TABLE]

In the computer code the approximate values of ${\cal D}^{*}_{\kappa,0,n}$ and ${\cal D}^{*}_{\kappa,s,n}$ are denoted by ”D0” and ”Ds”, respectively.

**Calculation of ${\cal M}^{*}_{n}$

**

A scheme of our program in this case was as follows. Given observations $(X_{1},Y_{1}),...,(X_{n},Y_{n})$ , we calculated pertaining values of statistics, L = ${\cal L}^{*}_{\epsilon,r,n}$ and H = HHG, say, where HHG denotes rank based variant of test introduced in Heller et al. (2013). Note that basic part of a C code for the statistic HHG is given in Section 2 of Supplementary Material for this paper.

To estimate $p$ -values of both ingredients of ${\cal M}^{*}_{n}$ we have applied Monte Carlo method. Namely, we have generated MC = 100 000 auxiliary samples of size $n$ from uniform distribution on $(0,1)^{2}$ and calculated for them respective values of both statistics. Next, we sorted $MC$ values of ${\cal L}^{*}_{\epsilon,r,n}$ in ascending order obtaining $L_{(1)},....,L_{(MC)}$ . Similarly, we obtained sorted values $H_{(1)},....,H_{(MC)}$ of HHG. Empirical $p$ -values for the obtained values $L$ and $H$ were estimated as follows

[TABLE]

Finally, we calculated ${\cal M}^{*}_{n}=\min\{p_{1},p_{2}\}$ .

References

(1) Heller, R., Heller, Y., Gorfine, M. (2013). A consistent multivariate test of association based on ranks of distances. Biometrika 100, 503-510.
(2)

**C.2. Codes

**

Below we give $C$ codes for computation of ${\cal Q}_{n}^{*}$ , ${\cal Q}_{n,s}^{*}$ , ${\cal L}^{*}_{\epsilon,r,n},\;{\cal D}^{*}_{\kappa,0,n},\;{\cal D}^{*}_{\kappa,s,n}$ .

See pages - of program.pdf

Bibliography62

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Albers, (1999) Albers, W. (1999). Stop-loss premiums under dependence. Insurance: Mathematics and Economics , 24:173–185.
2Bagkavos and Patil, (2017) Bagkavos, D. and Patil, P. N. (2017). A new test of independence for bivariate observations. Journal of Multivariate Analysis , 160:117–133.
3Berentsen and Tjøstheim, (2014) Berentsen, D. and Tjøstheim, D. (2014). Recognizing and visualizing departures from independence in bivariate data using Gaussian correlation. Statistics and Computing , 24:785–801.
4Berghaus and Segers, (2018) Berghaus, B. and Segers, J. (2018). Weak convergence of the weighted empirical beta copula process. Journal of Multivariate Analysis , 166:266–281.
5Bowman and Azzalini, (1997) Bowman, A. and Azzalini, A. (1997). Applied Smoothing Techniques for Data Analysis: The Kernel Approach with S-Plus Illustration . Oxford Science Publications, Clarendon Press, Oxford.
6Chicheportiche, (2013) Chicheportiche, R. (2013). Non-linear dependence in finance. These, Ecole Centrale des Arts et Manufactures. ar Xiv:1309.5073 .
7Daubechies et al., (1986) Daubechies, I., Grossman, A., and Meyer, Y. (1986). Painless nonorthogonal expansions. Journal of Mathematical Physics , 27:1271–1283.
8Deheuvels et al., (2006) Deheuvels, P., Peccati, G., and Yor, M. (2006). On quadratic functionals of the Brownian sheet and related processes. Stochastic Processes and their Applications , 116:493–538.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Validation of Association

Abstract

1 Introduction

2 A Copula-Based Measure of Dependence: the Quantile Dependence Function

2.1 Definition and Properties

Proposition 1**.**

Remark 1**.**

2.2 Graphical illustration

3 Symmetrized estimate of qqq

3.1 Motivation

3.2 Symmetrization

Proposition 2**.**

4 New weighted test statistics and their properties

4.1 Integral statistics

Assumption 0**.**

Proposition 3**.**

4.2 Minimum ppp-value statistics

4.3 Supremum-type solutions

Proposition 4**.**

5 Simulated powers

6 Application

7 Conclusion

8 Supplementary Materials

References

References

Proposition 1.

Remark 1.

3 Symmetrized estimate of $q$

Proposition 2.

Assumption 0.

Proposition 3.

4.2 Minimum $p$ -value statistics

Proposition 4.