Logistic regression with missing responses and predictors: a review of existing approaches and a case study
Susana Rafaela Martins, Jacobo de U\~na-\'Alvarez, Mar\'ia del, Carmen Iglesias-P\'erez

TL;DR
This paper reviews methods for logistic regression with missing data, compares their performance through simulations, and applies the best methods to a real case study on childhood obesity, providing practical guidance.
Contribution
It offers a comprehensive review of existing approaches for logistic regression with missing data and evaluates their performance, highlighting maximum likelihood as the most effective method.
Findings
Maximum likelihood yields the least bias and variance.
Multiple imputation with five datasets performs nearly as well.
All methods agree on the importance of past obesity and physical scores.
Abstract
In this work logistic regression when both the response and the predictor variables may be missing is considered. Several existing approaches are reviewed, including complete case analysis, inverse probability weighting, multiple imputation and maximum likelihood. The methods are compared in a simulation study, which serves to evaluate the bias, the variance and the mean squared error of the estimators for the regression coefficients. In the simulations, the maximum likelihood methodology is the one that presents the best results, followed by multiple imputation with five imputations, which is the second best. The methods are applied to a case study on the obesity for schoolchildren in the municipality of Viana do Castelo, North Portugal, where a logistic regression model is used to predict the International Obesity Task Force (IOTF) indicator from physical examinations and the past…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Bayesian Inference · Statistical Methods and Applications · Multi-Criteria Decision Making
