Robust Direct Learning for Causal Data Fusion

Xinyu Li; Yilin Li; Qing Cui; Longfei Li; Jun Zhou

arXiv:2211.00249·stat.ML·November 2, 2022

Robust Direct Learning for Causal Data Fusion

Xinyu Li, Yilin Li, Qing Cui, Longfei Li, Jun Zhou

PDF

Open Access

TL;DR

This paper introduces a robust direct learning framework for causal data fusion from multiple sources, effectively handling heterogeneity and source-specific covariates to improve causal inference accuracy.

Contribution

It proposes a novel weighted multi-source direct learner with double robustness and interpretability, advancing causal data fusion methods under complex data settings.

Findings

01

Effective in both homogeneous and heterogeneous scenarios

02

Achieves double robustness against model misspecification

03

Demonstrates improved estimation stability and accuracy

Abstract

In the era of big data, the explosive growth of multi-source heterogeneous data offers many exciting challenges and opportunities for improving the inference of conditional average treatment effects. In this paper, we investigate homogeneous and heterogeneous causal data fusion problems under a general setting that allows for the presence of source-specific covariates. We provide a direct learning framework for integrating multi-source data that separates the treatment effect from other nuisance functions, and achieves double robustness against certain misspecification. To improve estimation precision and stability, we propose a causal information-aware weighting function motivated by theoretical insights from the semiparametric efficiency theory; it assigns larger weights to samples containing more causal information with high interpretability. We introduce a two-step algorithm, the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Causal Inference Techniques · Bayesian Modeling and Causal Inference · Machine Learning and Algorithms