Causal-ICM: A Data Fusion Framework For Heterogeneous Treatment Effect Estimation With Multi-Task Gaussian Processes
Evangelos Dimitriou, Edwin Fong, Jens Magelund Tarp, Karla Diaz-Ordaz, Brieuc Lehmann

TL;DR
Causal-ICM is a Bayesian nonparametric framework that fuses RCT and observational data using multi-task Gaussian processes to improve heterogeneous treatment effect estimation with reliable uncertainty quantification.
Contribution
It introduces a novel data fusion method with a parameter controlling dataset influence, and a data-adaptive procedure for optimal parameter selection.
Findings
Causal-ICM outperforms existing data fusion methods in point estimation.
It provides principled uncertainty quantification for treatment effects.
Demonstrated robust performance in simulations and real-world data.
Abstract
Bridging the gap between internal and external validity is crucial for heterogeneous treatment effect estimation. Randomised controlled trials (RCTs), favoured for their internal validity due to randomisation, often encounter challenges in generalising findings due to strict eligibility criteria. Observational studies, on the other hand, may provide stronger external validity through larger and more representative samples but can suffer from compromised internal validity due to unmeasured confounding. Motivated by these complementary characteristics, we propose a novel Bayesian nonparametric approach, Causal-ICM, leveraging multi-task Gaussian processes to integrate data from both RCTs and observational studies. In particular, we introduce a parameter that controls the degree of borrowing between the datasets and prevents the observational dataset from dominating the estimation. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
