Simultaneous Record Linkage and Causal Inference with Propensity Score Subclassification
Joan Heck Wortman, Jerome P. Reiter

TL;DR
This paper introduces a methodology for causal inference using propensity score subclassification in linked observational data with linkage errors, improving treatment effect estimates.
Contribution
It develops new algorithms for record linkage with errors and evaluates their impact on causal effect estimation accuracy.
Findings
Some algorithms improve treatment effect estimate accuracy.
Linkage errors can bias causal estimates if unaddressed.
Simulation shows improved performance over traditional methods.
Abstract
We develop methodology for causal inference in observational studies when using propensity score subclassification on data constructed with probabilistic record linkage techniques. We focus on scenarios where covariates and binary treatment assignments are in one file and outcomes are in another file, and the goal is to estimate an additive treatment effect by merging the files. We assume that the files can be linked using variables common to both files, e.g., names or birth dates, but that links are subject to errors, e.g., due to reporting errors in the linking variables. We develop methodology for cases where such reporting errors are independent of the other variables on the files. We describe conceptually how linkage errors can affect causal estimates in subclassification contexts. We also present and evaluate several algorithms for deciding which record pairs to use in estimation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
