Instability of inverse probability weighting methods and a remedy for non-ignorable missing data
Pengfei Li, Jing Qin, Yukun Liu

TL;DR
This paper identifies the instability issues of inverse probability weighting methods in non-ignorable missing data analysis and proposes a semiparametric modeling approach with a maximum likelihood estimation to improve stability and accuracy.
Contribution
The paper introduces a novel semiparametric method that avoids the instability of IPW by modeling outcome distribution and missingness jointly with a maximum likelihood approach.
Findings
Proposed method outperforms existing methods in simulations
The approach is robust to high missingness probabilities
Real data examples demonstrate practical advantages
Abstract
Inverse probability weighting (IPW) methods are commonly used to analyze non-ignorable missing data under the assumption of a logistic model for the missingness probability. However, solving IPW equations numerically may involve non-convergence problems when the sample size is moderate and the missingness probability is high. Moreover, those equations often have multiple roots, and identifying the best root is challenging. Therefore, IPW methods may have low efficiency or even produce biased results. We identify the pitfall in these methods pathologically: they involve the estimation of a moment-generating function, and such functions are notoriously unstable in general. As a remedy, we model the outcome distribution given the covariates of the completely observed individuals semiparametrically. After forming an induced logistic regression model for the missingness status of the outcome…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
