Prior-Data Fitted Networks for Causal Inference: a Simulation Study with Real-World Scenarios

Francisco Mourao (1; 2); David Hajage (1; 3); Daria Bystrova (1); Bertrand Bouvarel (1; 2); Nathana\"el Lapidus (1; 2); Fabrice Carrat (1; 2); Benjamin Glemain (1; 2) ((1) Sorbonne Universit\'e; Inserm; Institut Pierre-Louis d'\'epid\'emiologie et de sant\'e publique; Paris; France; (2) D\'epartement de sant\'e publique; H\^opital Saint-Antoine; AP-HP. Sorbonne Universit\'e; Paris; France; (3) H\^opital Piti\'e-Salp\^etri\`ere; D\'epartement de Sant\'e Publique; Centre de Pharmaco\'epid\'emiologie; Sorbonne Universit\'e; Paris; France.)

arXiv:2603.15928·stat.AP·April 14, 2026

Prior-Data Fitted Networks for Causal Inference: a Simulation Study with Real-World Scenarios

Francisco Mourao (1, 2), David Hajage (1, 3), Daria Bystrova (1), Bertrand Bouvarel (1, 2), Nathana\"el Lapidus (1, 2), Fabrice Carrat (1, 2), Benjamin Glemain (1, 2) ((1) Sorbonne Universit\'e, Inserm, Institut Pierre-Louis d'\'epid\'emiologie et de sant\'e publique, Paris

PDF

TL;DR

This paper explores the use of Prior-Data Fitted Networks (PFNs) for causal inference in tabular data, evaluating their performance in simulated real-world clinical scenarios.

Contribution

It introduces PFNs as a new paradigm for causal inference, assesses two variants for estimating treatment effects, and discusses their advantages and limitations.

Findings

01

TabPFN has high computational costs for routine causal inference.

02

g-computation with TabPFN yields biased estimates, improved by T-learner approach.

03

CausalPFN is computationally efficient but shows poor coverage of credible intervals.

Abstract

Prior-Data Fitted Networks (PFNs) represent a paradigm shift in tabular data prediction. We present the principles of this new paradigm and evaluate two PFNs for estimating the average treatment effect (ATE) of a binary treatment on a binary outcome, using simulated clinical scenarios based on real-world data. We assessed TabPFN combined with causal inference procedures (g-computation and inverse probability of treatment weighting), and CausalPFN, a PFN that directly provides an ATE estimate with a credible interval. Confidence intervals for the TabPFN-based methods were derived using bootstrap resampling. We found that computation times for TabPFN were prohibitive for routine causal inference, particularly because of the need for bootstrapping to yield confidence intervals. Moreover, g-computation with TabPFN produced a highly biased estimator, partially corrected by fitting separate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.