Generally-Altered, -Inflated, -Truncated and -Deflated Regression, With Application to Heaped and Seeped Data
Thomas W. Yee, Chenchen Ma

TL;DR
This paper introduces a comprehensive regression model that unifies various count data modifications like inflation, truncation, and deflation, effectively handling complex data features such as heaping and seepage.
Contribution
The paper proposes a super model for count data that unifies multiple data alterations and provides flexible handling of dispersion and data irregularities, implemented in the VGAM R package.
Findings
Model accommodates up to seven modes.
Handles under-, equi-, and over-dispersion.
Effectively manages heaped and seeped data.
Abstract
Models such as the zero-inflated and zero-altered Poisson and zero-truncated binomial are well-established in modern regression analysis. We propose a super model that jointly and maximally unifies alteration, inflation, truncation and deflation for counts, given a 1- or 2-parameter parent (base) distribution. Seven disjoint sets of special value types are accommodated because all but truncation have parametric and nonparametric variants. Some highlights include: (i) the mixture distribution is exceeding flexible, e.g., up to seven modes; (ii) under-, equi- and over-dispersion can be handled using a negative binomial (NB) parent, with underdispersion handled by a novel Generally-Truncated-Expansion method; (iii) overdispersion can be studied holistically in terms of the four operators; (iv) an important application: heaped and seeped data from retrospective self-reported surveys are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Statistical Methods and Bayesian Inference · Survey Sampling and Estimation Techniques
