Double Probability Integral Transform Residuals for Regression Models with Discrete Outcomes
Lu Yang

TL;DR
This paper introduces a novel residual diagnostic method for regression models with discrete outcomes, enabling more accurate model assessment and identification of misspecification issues.
Contribution
The paper develops a new class of residuals based on double probability integral transforms for discrete outcomes, with theoretical justification and practical tools for model diagnostics.
Findings
Proposed residuals follow a uniform distribution under correct models.
Residuals enable straightforward visual model checks like QQ plots.
Simulation shows the new residuals outperform traditional methods.
Abstract
The assessment of regression models with discrete outcomes is challenging and has many fundamental issues. With discrete outcomes, standard regression model assessment tools such as Pearson and deviance residuals do not follow the conventional reference distribution (normal) under the true model, calling into question the legitimacy of model assessment based on these tools. To fill this gap, we construct a new type of residuals for general discrete outcomes, including ordinal and count outcomes. The proposed residuals are based on two layers of probability integral transformation. When at least one continuous covariate is available, the proposed residuals closely follow a uniform distribution (or a normal distribution after transformation) under the correctly specified model. One can construct visualizations such as QQ plots to check the overall fit of a model straightforwardly, and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Statistical Methods and Inference · Statistical Methods and Bayesian Inference
