Zero Inflation as a Missing Data Problem: a Proxy-based Approach
Trung Phung, Jaron J.R. Lee, Opeyemi Oladapo-Shittu, Eili Y. Klein,, Ayse Pinar Gurses, Susan M. Hannum, Kimberly Weems, Jill A. Marsteller, Sara, E. Cosgrove, Sara C. Keller, Ilya Shpitser

TL;DR
This paper treats zero-inflated data as a missing data problem and proposes a proxy-based approach for identification and estimation, addressing issues of zero inflation due to data recording conventions or equipment, with applications and sensitivity analysis.
Contribution
It introduces a proxy-based method for zero-inflated data, enabling identification and estimation under certain assumptions, and provides bounds when assumptions are unknown.
Findings
The method achieves identification with a known proxy relationship.
Partial identification and sensitivity analysis are feasible when the proxy relationship is unknown.
Simulation and real data demonstrate the approach's effectiveness.
Abstract
A common type of zero-inflated data has certain true values incorrectly replaced by zeros due to data recording conventions (rare outcomes assumed to be absent) or details of data recording equipment (e.g. artificial zeros in gene expression data). Existing methods for zero-inflated data either fit the observed data likelihood via parametric mixture models that explicitly represent excess zeros, or aim to replace excess zeros by imputed values. If the goal of the analysis relies on knowing true data realizations, a particular challenge with zero-inflated data is identifiability, since it is difficult to correctly determine which observed zeros are real and which are inflated. This paper views zero-inflated data as a general type of missing data problem, where the observability indicator for a potentially censored variable is itself unobserved whenever a zero is recorded. We show…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMonetary Policy and Economic Impact · Economic theories and models
