Data-Driven Confounder Selection via Markov and Bayesian Networks

Jenny H\"aggstr\"om

arXiv:1604.07212·stat.ME·March 20, 2017

Data-Driven Confounder Selection via Markov and Bayesian Networks

Jenny H\"aggstr\"om

PDF

Open Access

TL;DR

This paper introduces a data-driven method for selecting confounders in causal inference using probabilistic graphical models, effectively identifying relevant covariate subsets to improve causal effect estimation.

Contribution

It proposes a novel approach combining Markov and Bayesian networks to select confounders without prior causal structure knowledge, validated through simulation.

Findings

01

Outperforms random forests and LASSO in confounder selection accuracy

02

Achieves lower mean squared error in causal effect estimation

03

Effective in high-dimensional data settings

Abstract

To unbiasedly estimate a causal effect on an outcome unconfoundedness is often assumed. If there is sufficient knowledge on the underlying causal structure then existing confounder selection criteria can be used to select subsets of the observed pretreatment covariates, $X$ , sufficient for unconfoundedness, if such subsets exist. Here, estimation of these target subsets is considered when the underlying causal structure is unknown. The proposed method is to model the causal structure by a probabilistic graphical model, e.g., a Markov or Bayesian network, estimate this graph from observed data and select the target subsets given the estimated graph. The approach is evaluated by simulation both in a high-dimensional setting where unconfoundedness holds given $X$ and in a setting where unconfoundedness only holds given subsets of $X$ . Several common target subsets are investigated and the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Advanced Causal Inference Techniques · Statistical Methods and Inference