How to apply multiple imputation in propensity score matching with partially observed confounders: a simulation study and practical recommendations
Albee Y. Ling, Maria E. Montez-Rath, Maya B. Mathur, Kris Kapphahn,, Manisha Desai

TL;DR
This study evaluates how multiple imputation can be effectively integrated with propensity score matching in the presence of missing confounder data, providing practical recommendations based on simulation results.
Contribution
It offers novel insights and guidelines on applying multiple imputation strategies in propensity score matching to improve bias reduction and efficiency in observational studies.
Findings
MI-derPassive improves bias in PSM with missing data.
Within-imputation and across-imputation methods vary in performance.
Including auxiliary variables enhances imputation accuracy.
Abstract
Propensity score matching (PSM) has been widely used to mitigate confounding in observational studies, although complications arise when the covariates used to estimate the PS are only partially observed. Multiple imputation (MI) is a potential solution for handling missing covariates in the estimation of the PS. Unfortunately, it is not clear how to best apply MI strategies in the context of PSM. We conducted a simulation study to compare the performances of popular non-MI missing data methods and various MI-based strategies under different missing data mechanisms (MDMs). We found that commonly applied missing data methods resulted in biased and inefficient estimates, and we observed large variation in performance across MI-based strategies. Based on our findings, we recommend 1) deriving the PS after applying MI (referred to as MI-derPassive); 2) conducting PSM within each imputed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
