TL;DR
This paper evaluates bootstrap methods for multiple imputation when models are uncongenial or misspecified, recommending a specific approach that provides valid inference in such challenging scenarios.
Contribution
It identifies the limitations of traditional bootstrap methods under uncongeniality and proposes a validated, efficient bootstrap-then-impute procedure for reliable inference.
Findings
Bootstrapping followed by imputation yields valid variance estimates.
Imputation followed by bootstrapping often produces invalid results.
Recommended method is computationally efficient and robust under model misspecification.
Abstract
Multiple imputation has become one of the most popular approaches for handling missing data in statistical analyses. Part of this success is due to Rubin's simple combination rules. These give frequentist valid inferences when the imputation and analysis procedures are so called congenial and the complete data analysis is valid, but otherwise may not. Roughly speaking, congeniality corresponds to whether the imputation and analysis models make different assumptions about the data. In practice imputation and analysis procedures are often not congenial, such that tests may not have the correct size and confidence interval coverage deviates from the advertised level. We examine a number of recent proposals which combine bootstrapping with multiple imputation, and determine which are valid under uncongeniality and model misspecification. Imputation followed by bootstrapping generally does…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
