The Effect of Sample Size and Missingness on Inference with Missing Data
Julian Morimoto

TL;DR
This paper develops a new asymptotic theory demonstrating that inference from partial data with missingness converges to that from complete data as sample size grows and missingness decreases, especially under Missing at Random conditions.
Contribution
It introduces a more general asymptotic framework for inference with missing data, showing that missing data mechanisms become asymptotically irrelevant under certain conditions.
Findings
Loglikelihood from partial data converges to complete data likelihood
Missing at Random data mechanisms have asymptotically no effect on inference
Inference methods remain consistent as sample size increases
Abstract
When are inferences (whether Direct-Likelihood, Bayesian, or Frequentist) obtained from partial data valid? This paper answers this question by offering a new asymptotic theory about inference with missing data that is more general than existing theories. It proves that as the sample size increases and the extent of missingness decreases, the average-loglikelihood function generated by partial data and that ignores the missingness mechanism will converge in probability to that which would have been generated by complete data; and if the data are Missing at Random, this convergence depends only on sample size. Thus, inferences from partial data, such as posterior modes, confidence intervals, likelihood ratios, test statistics, and indeed, all quantities or features derived from the partial-data loglikelihood function, will be consistently estimated. Additionally, the missing data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Bayesian Inference · Statistical Methods and Inference · Statistical Methods in Clinical Trials
