"Robust-squared" Imputation Models Using BART
Yaoyuan V. Tan, Carol A.C. Flannagan, Michael R. Elliott

TL;DR
This paper introduces 'robust-squared' imputation models using Bayesian Additive Regression Trees (BART) to enhance the robustness of doubly-robust estimators for missing data, demonstrating improved performance in simulations and real datasets.
Contribution
The paper develops BART-based extensions of AIPWT and PSPP models, significantly improving robustness against model misspecification in missing data imputation.
Findings
BART-based models outperform traditional methods in simulations.
BART improves robustness and efficiency in real data applications.
Combined BART-PSPP yields the most efficient estimates.
Abstract
Examples of "doubly robust" estimator for missing data include augmented inverse probability weighting (AIPWT) models (Robins et al., 1994) and penalized splines of propensity prediction (PSPP) models (Zhang and Little, 2009). Doubly-robust estimators have the property that, if either the response propensity or the mean is modeled correctly, a consistent estimator of the population mean is obtained. However, doubly-robust estimators can perform poorly when modest misspecification is present in both models (Kang and Schafer, 2007). Here we consider extensions of the AIPWT and PSPP models that use Bayesian Additive Regression Trees (BART; Chipman et al., 2010) to provide highly robust propensity and mean model estimation. We term these "robust-squared" in the sense that the propensity score, the means, or both can be estimated with minimal model misspecification, and applied to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
