A direct extension of Azadkia & Chatterjee's rank correlation to multi-response vectors
Jonathan Ansari, Sebastian Fuchs

TL;DR
This paper extends Azadkia & Chatterjee's rank correlation to multi-response vectors, providing a new measure T for assessing dependence, with theoretical properties, a consistent estimator, and applications in feature selection.
Contribution
It introduces a novel dependence measure T for multi-response vectors, along with a non-parametric estimator and theoretical analysis, filling a gap in multivariate dependence assessment.
Findings
T characterizes independence and perfect dependence accurately.
The estimator for T is strongly consistent and asymptotically normal.
Applications include feature ranking and selection for multi-outcome data.
Abstract
Recently, Chatterjee (2023) recognized the lack of a direct generalization of his rank correlation in Azadkia and Chatterjee (2021) to a multi-dimensional response vector. As a natural solution to this problem, we here propose an extension of that is applicable to a set of response variables, where our approach builds upon converting the original vector-valued problem into a univariate problem and then applying the rank correlation to it. Our novel measure quantifies the scale-invariant extent of functional dependence of a response vector on predictor variables , characterizes independence of and as well as perfect dependence of on and hence fulfills all the characteristics of a measure of predictability. Aiming at maximum interpretability,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Statistical Methods and Inference · Bayesian Modeling and Causal Inference
