Ignorability in Statistical and Probabilistic Inference
M. Jaeger

TL;DR
This paper analyzes the conditions under which missing data assumptions like MAR and CAR can be justified, providing criteria and characterizations to evaluate their reasonableness in statistical and probabilistic inference.
Contribution
It offers a comprehensive overview of different MAR and CAR assumptions, introduces criteria for their validity, and extends existing theoretical results on their support and procedural conditions.
Findings
Distributional MAR/CAR assumptions are less restrictive and more applicable.
Provides an equivalence characterization of CAR in terms of support structure.
Argues that the coarsened completely at random (CCAR) condition is the most reasonable assumption.
Abstract
When dealing with incomplete data in statistical learning, or incomplete observations in probabilistic inference, one needs to distinguish the fact that a certain event is observed from the fact that the observed event has happened. Since the modeling and computational complexities entailed by maintaining this proper distinction are often prohibitive, one asks for conditions under which it can be safely ignored. Such conditions are given by the missing at random (mar) and coarsened at random (car) assumptions. In this paper we provide an in-depth analysis of several questions relating to mar/car assumptions. Main purpose of our study is to provide criteria by which one may evaluate whether a car assumption is reasonable for a particular data collecting or observational process. This question is complicated by the fact that several distinct versions of mar/car assumptions exist. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
