Exact Detection Thresholds and Minimax Optimality of Chatterjee's Correlation Coefficient
Arnab Auddy, Nabarun Deb, and Sagnik Nandy

TL;DR
This paper provides a detailed theoretical analysis of Chatterjee's correlation coefficient, establishing exact detection thresholds and demonstrating its minimax optimality for testing dependence and independence between variables.
Contribution
It advances understanding of Chatterjee's coefficient by deriving its asymptotic distribution, detection thresholds, and proving its minimax optimality in dependence testing.
Findings
Establishes a $n^{-1/4}$ detection boundary for independence testing.
Proves minimax optimality of Chatterjee's coefficient for dependence detection with a $n^{-1/2}$ boundary.
Provides explicit limiting local power calculations for certain alternatives.
Abstract
Recently, Chatterjee (2021) introduced a new rank-based correlation coefficient which can be used to measure the strength of dependence between two random variables. This coefficient has already attracted much attention as it converges to the Dette-Siburg-Stoimenov measure (see Dette et al. (2013)), which equals if and only if the variables are independent and if and only if one variable is a function of the other. Further, Chatterjee's coefficient is computable in (near) linear time, which makes it appropriate for large-scale applications. In this paper, we expand the theoretical understanding of Chatterjee's coefficient in two directions: (a) First we consider the problem of testing for independence using Chatterjee's correlation. We obtain its asymptotic distribution under any changing sequence of alternatives converging to the null hypothesis (of independence). We further…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProbability and Risk Models · Statistical Methods and Inference
