What Is Meant by "Missing at Random"?
Shaun Seaman, John Galati, Dan Jackson, John Carlin

TL;DR
This paper clarifies the precise definitions of 'missing at random' and 'missing completely at random' to unify theory and clarify when valid inference can ignore missing data mechanisms.
Contribution
It provides standardized, precise definitions of missing data concepts and clarifies the conditions for valid inference across different inferential paradigms.
Findings
Standardized definitions of missing at random and missing completely at random.
Clarification of conditions for valid inference ignoring missingness.
Unification of theoretical understanding of missing data mechanisms.
Abstract
The concept of missing at random is central in the literature on statistical analysis with missing data. In general, inference using incomplete data should be based not only on observed data values but should also take account of the pattern of missing values. However, it is often said that if data are missing at random, valid inference using likelihood approaches (including Bayesian) can be obtained ignoring the missingness mechanism. Unfortunately, the term "missing at random" has been used inconsistently and not always clearly; there has also been a lack of clarity around the meaning of "valid inference using likelihood". These issues have created potential for confusion about the exact conditions under which the missingness mechanism can be ignored, and perhaps fed confusion around the meaning of "analysis ignoring the missingness mechanism". Here we provide standardised precise…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
