A Scalable Conditional Independence Test for Nonlinear, Non-Gaussian Data
Joseph D. Ramsey

TL;DR
This paper introduces a scalable conditional independence test for nonlinear, non-Gaussian data that is computationally efficient and accurate, enabling analysis of large high-dimensional datasets in scientific research.
Contribution
It presents a new class of O(N^2) tests based on conditional correlation independence, improving computational efficiency over existing kernel-based methods while maintaining accuracy.
Findings
The proposed tests are significantly faster, requiring seconds instead of hours.
They achieve similar accuracy to existing methods on complex nonlinear, non-Gaussian data.
The method is practical for large-scale high-dimensional data analysis.
Abstract
Many relations of scientific interest are nonlinear, and even in linear systems distributions are often non-Gaussian, for example in fMRI BOLD data. A class of search procedures for causal relations in high dimensional data relies on sample derived conditional independence decisions. The most common applications rely on Gaussian tests that can be systematically erroneous in nonlinear non-Gaussian cases. Recent work (Gretton et al. (2009), Tillman et al. (2009), Zhang et al. (2011)) has proposed conditional independence tests using Reproducing Kernel Hilbert Spaces (RKHS). Among these, perhaps the most efficient has been KCI (Kernel Conditional Independence, Zhang et al. (2011)), with computational requirements that grow effectively at least as O(N3), placing it out of range of large sample size analysis, and restricting its applicability to high dimensional data sets. We propose a class…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Spectroscopy and Chemometric Analyses · Fault Detection and Control Systems
