Fourier Feature Methods for Nonlinear Causal Discovery: FFML Scoring, TRFF Scoring, and FFCI Testing in Mixed Data
Joseph D. Ramsey

TL;DR
This paper introduces three RFF-based methods—FFML, TRFF, and FFCI—for scalable nonlinear causal discovery, balancing computational efficiency with statistical robustness across mixed data types.
Contribution
The paper develops practical RFF-based tools for score-based, constraint-based, and hybrid causal discovery, extending Gaussian process and kernel methods to large-scale mixed data.
Findings
TRFF achieves higher precision than FFML.
FFML provides better recall and lower SHD.
BOSS+FFML outperforms others on nonlinear data.
Abstract
Gaussian process (GP) marginal likelihood scores and kernel conditional independence tests are theoretically appealing for nonlinear causal discovery but computationally prohibitive at scale. We present three complementary RFF-based methods forming a practical toolkit for score-based, constraint-based, and hybrid causal discovery. The Fourier Feature Marginal Likelihood (FFML) score approximates the exact GP marginal likelihood by replacing the kernel Gram matrix with a finite-dimensional feature representation, reducing cost to while retaining the probabilistic interpretation and automatic complexity penalty of the exact score. FFML extends to mixed (continuous and discrete) parent sets via a product-kernel construction, with a Kronecker path for small discrete parent sets and a Hadamard-product path otherwise. The Tetrad Random Fourier Feature (TRFF) score…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
