An integrated approach to test for missing not at random

Jack Noonan; Adetola Adedamola Adediran; Robin Mitra; Stefanie; Biedermann

arXiv:2208.07813·stat.ME·December 8, 2022

An integrated approach to test for missing not at random

Jack Noonan, Adetola Adedamola Adediran, Robin Mitra, Stefanie, Biedermann

PDF

Open Access

TL;DR

This paper develops an integrated framework for testing missing data mechanisms, especially MNAR, using follow-up samples to improve hypothesis testing accuracy and efficiency.

Contribution

It introduces a comprehensive approach combining testing for MNAR with follow-up sample design, optimizing for error control and power.

Findings

01

Conditions identified for follow-up samples enabling valid MNAR tests

02

Method achieves known Type I error rates in simulations

03

Real data application demonstrates practical effectiveness

Abstract

Missing data can lead to inefficiencies and biases in analyses, in particular when data are missing not at random (MNAR). It is thus vital to understand and correctly identify the missing data mechanism. Recovering missing values through a follow up sample allows researchers to conduct hypothesis tests for MNAR, which are not possible when using only the original incomplete data. Investigating how properties of these tests are affected by the follow up sample design is little explored in the literature. Our results provide comprehensive insight into the properties of one such test, based on the commonly used selection model framework. We determine conditions for recovery samples that allow the test to be applied appropriately and effectively, i.e. with known Type I error rates and optimized with respect to power. We thus provide an integrated framework for testing for the presence of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Bayesian Modeling and Causal Inference · Statistical Methods and Bayesian Inference