Federated Causal Inference from Multi-Site Observational Data via Propensity Score Aggregation
R\'emi Khellaf, Aur\'elien Bellet, Julie Josse

TL;DR
This paper introduces a federated learning approach for causal inference that estimates the average treatment effect from decentralized data without sharing individual records, using aggregate statistics and novel propensity score methods.
Contribution
It proposes a new federated propensity score estimation technique with Membership Weights, enabling causal inference across multiple sites while addressing heterogeneity and privacy constraints.
Findings
Federated methods outperform meta-analysis in heterogeneous settings.
The approach effectively estimates ATE with decentralized data.
Experimental results validate theoretical advantages over existing methods.
Abstract
Causal inference typically assumes centralized access to individual-level data. Yet, in practice, data are often decentralized across multiple sites, making centralization infeasible due to privacy, logistical, or legal constraints. We address this problem by estimating the Average Treatment Effect (ATE) from decentralized observational data via a Federated Learning (FL) approach, allowing inference through the exchange of aggregate statistics rather than individual-level data. We propose a novel method to estimate propensity scores via a federated weighted average of local scores using Membership Weights (MW), defined as probabilities of site membership conditional on covariates. MW can be flexibly estimated with parametric or non-parametric classification models using standard FL algorithms. The resulting propensity scores are used to construct Federated Inverse Propensity Weighting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
