Prior-Data Fitted Networks for Causal Inference: a Simulation Study with Real-World Scenarios
Francisco Mourao (1, 2), David Hajage (1, 3), Daria Bystrova (1), Bertrand Bouvarel (1, 2), Nathana\"el Lapidus (1, 2), Fabrice Carrat (1, 2), Benjamin Glemain (1, 2) ((1) Sorbonne Universit\'e, Inserm, Institut Pierre-Louis d'\'epid\'emiologie et de sant\'e publique, Paris

TL;DR
This paper explores the use of Prior-Data Fitted Networks (PFNs) for causal inference in tabular data, evaluating their performance in simulated real-world clinical scenarios.
Contribution
It introduces PFNs as a new paradigm for causal inference, assesses two variants for estimating treatment effects, and discusses their advantages and limitations.
Findings
TabPFN has high computational costs for routine causal inference.
g-computation with TabPFN yields biased estimates, improved by T-learner approach.
CausalPFN is computationally efficient but shows poor coverage of credible intervals.
Abstract
Prior-Data Fitted Networks (PFNs) represent a paradigm shift in tabular data prediction. We present the principles of this new paradigm and evaluate two PFNs for estimating the average treatment effect (ATE) of a binary treatment on a binary outcome, using simulated clinical scenarios based on real-world data. We assessed TabPFN combined with causal inference procedures (g-computation and inverse probability of treatment weighting), and CausalPFN, a PFN that directly provides an ATE estimate with a credible interval. Confidence intervals for the TabPFN-based methods were derived using bootstrap resampling. We found that computation times for TabPFN were prohibitive for routine causal inference, particularly because of the need for bootstrapping to yield confidence intervals. Moreover, g-computation with TabPFN produced a highly biased estimator, partially corrected by fitting separate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
