TL;DR
This paper demonstrates the inherent difficulty of conditional independence testing, proves the non-existence of a universally powerful test, and proposes a practical, regression-based method called the Generalised Covariance Measure (GCM) for univariate and high-dimensional data.
Contribution
It introduces the GCM method for conditional independence testing, showing its validity relies on slow-rate regression estimates and extending it to high-dimensional settings.
Findings
GCM is competitive with state-of-the-art tests in simulations.
Validity depends mainly on regression estimation quality.
Theoretical guarantees are provided for kernel ridge regression.
Abstract
It is a common saying that testing for conditional independence, i.e., testing whether whether two random vectors and are independent, given , is a hard statistical problem if is a continuous random variable (or vector). In this paper, we prove that conditional independence is indeed a particularly difficult hypothesis to test for. Valid statistical tests are required to have a size that is smaller than a predefined significance level, and different tests usually have power against a different class of alternatives. We prove that a valid test for conditional independence does not have power against any alternative. Given the non-existence of a uniformly valid conditional independence test, we argue that tests must be designed so their suitability for a particular problem may be judged easily. To address this need, we propose in the case where and are univariate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
