Causal Foundation Models with Continuous Treatments

Christopher Stith; Medha Barath; Vahid Balazadeh; Jesse C. Cresswell; Rahul G. Krishnan

arXiv:2605.15133·cs.LG·May 15, 2026

Causal Foundation Models with Continuous Treatments

Christopher Stith, Medha Barath, Vahid Balazadeh, Jesse C. Cresswell, Rahul G. Krishnan

PDF

TL;DR

This paper introduces the first causal foundation model for continuous treatments, capable of predicting causal effects across various tasks without additional training, using a transformer trained on a rich causal corpus.

Contribution

It presents a novel prior for data generation and a transformer-based model that reconstructs treatment-response curves from observational data, enabling zero-shot causal inference.

Findings

01

Achieves state-of-the-art performance in treatment-response curve reconstruction.

02

Can generalize to unseen tasks without additional training.

03

Uses in-context learning to efficiently infer causal effects.

Abstract

Causal inference, estimating causal effects from observational data, is a fundamental tool in many disciplines. Of particular importance across a variety of domains is the continuous treatment setting, where the variable of intervention has a continuous range. This setting is far less explored and represents a substantial shift from the binary treatment setting, with models needing to represent effects across a continuum of treatment values. In this paper, we present the first causal foundation model for the continuous treatment setting. Our model meta-learns the ability to predict causal effects across a wide variety of unseen tasks without additional training or fine-tuning. First, we design a novel prior over data-generating processes with continuous treatment variables in order to generate a rich causal training corpus. We then train a transformer to reconstruct individual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.