A simple measure of conditional dependence
Mona Azadkia, Sourav Chatterjee

TL;DR
This paper introduces a new, model-free coefficient for measuring conditional dependence between variables, which converges without distributional assumptions and underpins a consistent variable selection algorithm called FOCI.
Contribution
The paper proposes a novel conditional dependence coefficient with desirable properties and develops FOCI, a new variable selection method based on this coefficient, with proven consistency.
Findings
Coefficient converges to a limit in [0,1] without distributional assumptions.
FOCI algorithm is model-free and tuning-parameter free.
Applications demonstrate effectiveness on synthetic and real data.
Abstract
We propose a coefficient of conditional dependence between two random variables and given a set of other variables , based on an i.i.d. sample. The coefficient has a long list of desirable properties, the most important of which is that under absolutely no distributional assumptions, it converges to a limit in , where the limit is if and only if and are conditionally independent given , and is if and only if is equal to a measurable function of given . Moreover, it has a natural interpretation as a nonlinear generalization of the familiar partial statistic for measuring conditional dependence by regression. Using this statistic, we devise a new variable selection algorithm, called Feature Ordering by Conditional Independence (FOCI), which is model-free, has no tuning parameters, and is provably…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
