A new measure of dependence: Integrated $R^2$
Mona Azadkia, Pouya Roudaki

TL;DR
This paper introduces a new dependence measure called integrated R^2 that quantifies the relationship between variables, extending to conditional dependence, with a simple estimator and a variable selection method that outperforms existing techniques.
Contribution
The paper proposes a novel, interpretable dependence measure with a computationally efficient estimator and a consistent, tuning-free variable selection procedure.
Findings
The measure equals zero for independence and one when Y is a function of X.
The estimator has complexity comparable to classical correlation coefficients.
The method shows strong empirical performance and outperforms existing approaches.
Abstract
We introduce a novel measure of dependence that captures the extent to which a random variable is determined by a random vector . The measure equals zero precisely when and are independent, and it attains one exactly when is almost surely a measurable function of . We further extend this framework to define a measure of conditional dependence between and given . We propose a simple and interpretable estimator with computational complexity comparable to classical correlation coefficients, including those of Pearson, Spearman, and Chatterjee. Leveraging this dependence measure, we develop a tuning-free, model-agnostic variable selection procedure and establish its consistency under appropriate sparsity conditions. Extensive experiments on synthetic and real datasets highlight the strong empirical performance of our methodology and demonstrate substantial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
