An Extension of Generalized Linear Models to Finite Mixture Outcome Distributions
Andrew M. Raim, Nagaraj K. Neerchal, Jorge G. Morel

TL;DR
This paper extends generalized linear models to incorporate finite mixture distributions for outcomes, enabling better modeling of heterogeneity and variability in categorical and count data.
Contribution
It introduces a likelihood-based extension of GLMs that models the response as a finite mixture, linking the mean to the regression, and handles complex mean structures.
Findings
Improved residual plots with extra variation support.
Wider prediction intervals achieved.
Application to success/failure and count data datasets.
Abstract
Finite mixture distributions arise in sampling a heterogeneous population. Data drawn from such a population will exhibit extra variability relative to any single subpopulation. Statistical models based on finite mixtures can assist in the analysis of categorical and count outcomes when standard generalized linear models (GLMs) cannot adequately account for variability observed in the data. We propose an extension of GLM where the response is assumed to follow a finite mixture distribution, while the regression of interest is linked to the mixture's mean. This approach may be preferred over a finite mixture of regressions when the population mean is the quantity of interest; here, only a single regression function must be specified and interpreted in the analysis. A technical challenge is that the mean of a finite mixture is a composite parameter which does not appear explicitly in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Statistical Methods and Bayesian Inference · Statistical Methods and Inference
