Missing at Random or Not: A Semiparametric Testing Approach
Rui Duan, C. Jason Liang, Pamela Shaw, Cheng Yong Tang, Yong Chen

TL;DR
This paper introduces a new nonparametric hypothesis testing method to distinguish between missing at random and missing not at random data mechanisms, using discrepancy measures and instrumental variables.
Contribution
It develops a broadly applicable, nonparametric test for missing data mechanisms that does not require explicit model specification, improving decision accuracy.
Findings
The test effectively differentiates missing at random from not at random.
The method is validated through theoretical analysis, simulations, and real data.
It offers a data-driven approach for missing data mechanism selection.
Abstract
Practical problems with missing data are common, and statistical methods have been developed concerning the validity and/or efficiency of statistical procedures. On a central focus, there have been longstanding interests on the mechanism governing data missingness, and correctly deciding the appropriate mechanism is crucially relevant for conducting proper practical investigations. The conventional notions include the three common potential classes -- missing completely at random, missing at random, and missing not at random. In this paper, we present a new hypothesis testing approach for deciding between missing at random and missing not at random. Since the potential alternatives of missing at random are broad, we focus our investigation on a general class of models with instrumental variables for data missing not at random. Our setting is broadly applicable, thanks to that the model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Bayesian Inference · Statistical Methods and Inference · Statistical Methods in Clinical Trials
