Generalizing treatment effects with incomplete covariates: identifying assumptions and multiple imputation algorithms
Imke Mayer, Julie Josse, Traumabase Group

TL;DR
This paper addresses the challenge of generalizing causal effects from RCTs to broader populations when covariates have missing data, proposing multiple imputation methods and a direct estimation approach, with extensive simulations and real data application.
Contribution
It introduces three novel multiple imputation strategies and a direct estimation method tailored for incomplete covariates in multi-source data for causal effect generalization.
Findings
Imputation methods improve effect estimation accuracy.
Assumption sets significantly influence method performance.
Application to trauma data demonstrates practical impact.
Abstract
We focus on the problem of generalizing a causal effect estimated on a randomized controlled trial (RCT) to a target population described by a set of covariates from observational data. Available methods such as inverse propensity sampling weighting are not designed to handle missing values, which are however common in both data sources. In addition to coupling the assumptions for causal effect identifiability and for the missing values mechanism and to defining appropriate estimation strategies, one difficulty is to consider the specific structure of the data with two sources and treatment and outcome only available in the RCT. We propose three multiple imputation strategies to handle missing values when generalizing treatment effects, each handling the multi-source structure of the problem differently (separate imputation, joint imputation with fixed effect, joint imputation ignoring…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Causal Inference Techniques · Statistical Methods and Bayesian Inference · Survey Methodology and Nonresponse
