An integrated approach to test for missing not at random
Jack Noonan, Adetola Adedamola Adediran, Robin Mitra, Stefanie, Biedermann

TL;DR
This paper develops an integrated framework for testing missing data mechanisms, especially MNAR, using follow-up samples to improve hypothesis testing accuracy and efficiency.
Contribution
It introduces a comprehensive approach combining testing for MNAR with follow-up sample design, optimizing for error control and power.
Findings
Conditions identified for follow-up samples enabling valid MNAR tests
Method achieves known Type I error rates in simulations
Real data application demonstrates practical effectiveness
Abstract
Missing data can lead to inefficiencies and biases in analyses, in particular when data are missing not at random (MNAR). It is thus vital to understand and correctly identify the missing data mechanism. Recovering missing values through a follow up sample allows researchers to conduct hypothesis tests for MNAR, which are not possible when using only the original incomplete data. Investigating how properties of these tests are affected by the follow up sample design is little explored in the literature. Our results provide comprehensive insight into the properties of one such test, based on the commonly used selection model framework. We determine conditions for recovery samples that allow the test to be applied appropriately and effectively, i.e. with known Type I error rates and optimized with respect to power. We thus provide an integrated framework for testing for the presence of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Bayesian Modeling and Causal Inference · Statistical Methods and Bayesian Inference
