Estimation and imputation of missing data in longitudinal models with Zero-Inflated Poisson response variable
D. S. Martinez-Lobo, O.O. Melo, N.A. Cruz

TL;DR
This paper presents a new EM-based methodology for estimating and imputing missing data in longitudinal Zero-Inflated Poisson models, demonstrating effectiveness through simulations and real corn growth data.
Contribution
It introduces a maximum likelihood approach using EM for zero-inflated Poisson data with missing values, applicable in longitudinal studies.
Findings
Method performs well in various missing data scenarios
Outperforms simple imputation methods in simulations
Successfully applied to real corn growth data
Abstract
This research deals with the estimation and imputation of missing data in longitudinal models with a Poisson response variable inflated with zeros. A methodology is proposed that is based on the use of maximum likelihood, assuming that data is missing at random and that there is a correlation between the response variables. In each of the times, the expectation maximization (EM) algorithm is used: in step E, a weighted regression is carried out, conditioned on the previous times that are taken as covariates. In step M, the estimation and imputation of the missing data are performed. The good performance of the methodology in different loss scenarios is demonstrated in a simulation study comparing the model only with complete data, and estimating missing data using the mode of the data of each individual. Furthermore, in a study related to the growth of corn, it is tested on real data to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Bayesian Inference
