Conditional Diffusion Models Based Conditional Independence Testing
Yanfeng Yang, Shuai Li, Yingjie Zhang, Zhuoran Sun, Hai Shu, Ziqi, Chen, Renming Zhang

TL;DR
This paper introduces a new conditional independence testing method using conditional diffusion models to better approximate the conditional distribution, improving test accuracy especially in high-dimensional settings.
Contribution
It proposes using conditional diffusion models for more accurate approximation of $X|Z$ in CRT, outperforming GANs and handling complex, mixed-type data without distributional assumptions.
Findings
CDMs closely approximate the true conditional distribution.
The proposed test controls type I error effectively.
The method performs well in high-dimensional synthetic data experiments.
Abstract
Conditional independence (CI) testing is a fundamental task in modern statistics and machine learning. The conditional randomization test (CRT) was recently introduced to test whether two random variables, and , are conditionally independent given a potentially high-dimensional set of random variables, . The CRT operates exceptionally well under the assumption that the conditional distribution is known. However, since this distribution is typically unknown in practice, accurately approximating it becomes crucial. In this paper, we propose using conditional diffusion models (CDMs) to learn the distribution of . Theoretically and empirically, it is shown that CDMs closely approximate the true conditional distribution. Furthermore, CDMs offer a more accurate approximation of compared to GANs, potentially leading to a CRT that performs better than those based on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsFault Detection and Control Systems
MethodsSparse Evolutionary Training · Diffusion
