Zero-Inflated Logistic Regression Models with Shared Design: Identifiability, Existence of Estimates, and a Relabeling Rule
Yui Tomo, Shinto Eguchi, Daisuke Yoneoka

TL;DR
This paper analyzes the theoretical properties of zero-inflated logistic regression models with shared design matrices, focusing on identifiability, estimation, and a relabeling rule, supported by simulations and real data application.
Contribution
It establishes conditions for model identifiability, existence of estimates, and introduces a relabeling rule for parameter selection in shared-design zero-inflated logistic models.
Findings
Model is identifiable up to exchange symmetry of parameters.
Sufficient conditions for maximum likelihood estimate existence are provided.
A simple relabeling rule effectively selects ordered parameters.
Abstract
The zero-inflated logistic regression model accommodates binary responses with excess zeros, which often arise from a latent mixture of susceptible and insusceptible subpopulations or asymmetric misclassification of the response. The model has two components: regression for the binary response and a latent binary indicator for the zero-inflation state. In applied settings, it is common to use the same design matrix for both components if there is no prior knowledge. However, this shared-design specification lacks guaranteed identifiability of the regression parameters, as established in prior works. This paper investigates the theoretical properties of the zero-inflated logistic regression model under the shared-design setting and computational methods for applications. First, to motivate the use of the zero-inflated model, we prove that ignoring the zero-inflation mechanism can lead to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
