Conditional independence testing based on a nearest-neighbor estimator of conditional mutual information
Jakob Runge

TL;DR
This paper introduces a non-parametric, nearest-neighbor based conditional independence test for continuous data that effectively handles nonlinear and high-dimensional dependencies, outperforming kernel-based methods in calibration and power.
Contribution
It presents a novel conditional independence test combining nearest-neighbor estimation with a local permutation scheme, improving calibration and power over existing kernel-based methods.
Findings
Reliable null distribution simulation for small samples and high dimensions
Better calibration than kernel-based tests, especially for non-smooth densities
Comparable or higher power levels than existing methods
Abstract
Conditional independence testing is a fundamental problem underlying causal discovery and a particularly challenging task in the presence of nonlinear and high-dimensional dependencies. Here a fully non-parametric test for continuous data based on conditional mutual information combined with a local permutation scheme is presented. Through a nearest neighbor approach, the test efficiently adapts also to non-smooth distributions due to strongly nonlinear dependencies. Numerical experiments demonstrate that the test reliably simulates the null distribution even for small sample sizes and with high-dimensional conditioning sets. The test is better calibrated than kernel-based tests utilizing an analytical approximation of the null distribution, especially for non-smooth densities, and reaches the same or higher power levels. Combining the local permutation scheme with the kernel tests…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Statistical Methods and Inference · Bayesian Methods and Mixture Models
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
