Toward Scalable and Valid Conditional Independence Testing with Spectral Representations

Alek Frohlich; Vladimir Kostic; Karim Lounici; Daniel Perazzo; Massimiliano Pontil

arXiv:2512.19510·cs.LG·December 23, 2025

Toward Scalable and Valid Conditional Independence Testing with Spectral Representations

Alek Frohlich, Vladimir Kostic, Karim Lounici, Daniel Perazzo, Massimiliano Pontil

PDF

Open Access

TL;DR

This paper proposes a scalable, representation learning-based method for conditional independence testing that improves on existing kernel methods in validity, adaptivity, and scalability, with theoretical guarantees and promising preliminary results.

Contribution

It introduces a novel spectral representation approach for CI testing, combining SVD-based features with a contrastive learning algorithm, and provides theoretical analysis of its validity and power.

Findings

01

The method is theoretically valid and asymptotically powerful.

02

Preliminary experiments show improved scalability and performance.

03

The approach bridges kernel methods with modern representation learning.

Abstract

Conditional independence (CI) is central to causal inference, feature selection, and graphical modeling, yet it is untestable in many settings without additional assumptions. Existing CI tests often rely on restrictive structural conditions, limiting their validity on real-world data. Kernel methods using the partial covariance operator offer a more principled approach but suffer from limited adaptivity, slow convergence, and poor scalability. In this work, we explore whether representation learning can help address these limitations. Specifically, we focus on representations derived from the singular value decomposition of the partial covariance operator and use them to construct a simple test statistic, reminiscent of the Hilbert-Schmidt Independence Criterion (HSIC). We also introduce a practical bi-level contrastive algorithm to learn these representations. Our theory links…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Explainable Artificial Intelligence (XAI) · Generative Adversarial Networks and Image Synthesis