Technical Report: Facilitating the Adoption of Causal Inference Methods Through LLM-Empowered Co-Pilot

Jeroen Berrevoets; Julianna Piskorz; Robert Davis; Harry Amad; Jim Weatherall; Mihaela van der Schaar

arXiv:2508.10581·cs.LG·August 15, 2025

Technical Report: Facilitating the Adoption of Causal Inference Methods Through LLM-Empowered Co-Pilot

Jeroen Berrevoets, Julianna Piskorz, Robert Davis, Harry Amad, Jim Weatherall, Mihaela van der Schaar

PDF

TL;DR

This paper presents CATE-B, an LLM-powered co-pilot system that guides users through causal inference steps, making treatment effect estimation more accessible and reproducible across various domains.

Contribution

Introducing CATE-B, an open-source system that integrates LLMs for causal discovery, adjustment set identification, and method selection, enhancing usability in causal inference.

Findings

01

CATE-B effectively guides users through causal modeling tasks.

02

The system improves accuracy in identifying adjustment sets.

03

Benchmark suite enables reproducibility and evaluation.

Abstract

Estimating treatment effects (TE) from observational data is a critical yet complex task in many fields, from healthcare and economics to public policy. While recent advances in machine learning and causal inference have produced powerful estimation techniques, their adoption remains limited due to the need for deep expertise in causal assumptions, adjustment strategies, and model selection. In this paper, we introduce CATE-B, an open-source co-pilot system that uses large language models (LLMs) within an agentic framework to guide users through the end-to-end process of treatment effect estimation. CATE-B assists in (i) constructing a structural causal model via causal discovery and LLM-based edge orientation, (ii) identifying robust adjustment sets through a novel Minimal Uncertainty Adjustment Set criterion, and (iii) selecting appropriate regression methods tailored to the causal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.