Bayesian variable selection in hierarchical difference-in-differences models
James Normington, Eric F. Lock, Thomas A. Murray, Caroline S. Carlin

TL;DR
This paper introduces a Bayesian hierarchical difference-in-differences model that accounts for unobservable individual differences and confounding variables, improving causal effect estimation in complex observational data.
Contribution
It extends classical DiD models to hierarchical settings with latent variables and proposes structured Bayesian variable selection methods for unbiased treatment effect estimation.
Findings
The HDiD model reduces bias from confounders.
Bayesian variable selection improves covariate adjustment.
Application to Minnesota diabetes data demonstrates practical utility.
Abstract
A popular method for estimating a causal treatment effect with observational data is the difference-in-differences (DiD) model. In this work, we consider an extension of the classical DiD setting to the hierarchical context in which data cannot be matched at the most granular level (e.g., individual-level differences are unobservable). We propose a Bayesian hierarchical difference-in-differences (HDiD) model which estimates the treatment effect by regressing the treatment on a latent variable representing the mean change in group-level outcome. We present theoretical and empirical results showing that an HDiD model that fails to adjust for a particular class of confounding variables, or confounding with the baseline (pre-treatment) outcomes, biases the treatment effect estimate. We propose and implement various approaches to perform variable selection using a structured Bayesian…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
