Conditional Diffusion Models Based Conditional Independence Testing

Yanfeng Yang; Shuai Li; Yingjie Zhang; Zhuoran Sun; Hai Shu; Ziqi; Chen; Renming Zhang

arXiv:2412.11744·stat.ML·December 19, 2024

Conditional Diffusion Models Based Conditional Independence Testing

Yanfeng Yang, Shuai Li, Yingjie Zhang, Zhuoran Sun, Hai Shu, Ziqi, Chen, Renming Zhang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a new conditional independence testing method using conditional diffusion models to better approximate the conditional distribution, improving test accuracy especially in high-dimensional settings.

Contribution

It proposes using conditional diffusion models for more accurate approximation of $X|Z$ in CRT, outperforming GANs and handling complex, mixed-type data without distributional assumptions.

Findings

01

CDMs closely approximate the true conditional distribution.

02

The proposed test controls type I error effectively.

03

The method performs well in high-dimensional synthetic data experiments.

Abstract

Conditional independence (CI) testing is a fundamental task in modern statistics and machine learning. The conditional randomization test (CRT) was recently introduced to test whether two random variables, $X$ and $Y$ , are conditionally independent given a potentially high-dimensional set of random variables, $Z$ . The CRT operates exceptionally well under the assumption that the conditional distribution $X ∣ Z$ is known. However, since this distribution is typically unknown in practice, accurately approximating it becomes crucial. In this paper, we propose using conditional diffusion models (CDMs) to learn the distribution of $X ∣ Z$ . Theoretically and empirically, it is shown that CDMs closely approximate the true conditional distribution. Furthermore, CDMs offer a more accurate approximation of $X ∣ Z$ compared to GANs, potentially leading to a CRT that performs better than those based on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yanfeng-yang-0316/cdcit
pytorchOfficial

Videos

Conditional Diffusion Models Based Conditional Independence Testing· underline

Taxonomy

TopicsFault Detection and Control Systems

MethodsSparse Evolutionary Training · Diffusion