A weakly informative default prior distribution for logistic and other regression models
Andrew Gelman, Aleks Jakulin, Maria Grazia Pittau, Yu-Sung Su

TL;DR
This paper introduces a new default prior distribution for logistic regression models, using Student-$t$ distributions, which improves robustness and performance, especially in cases of complete separation and complex interactions.
Contribution
The paper proposes a Cauchy prior as a default for logistic regression, with an implementation in R using an EM algorithm, enhancing stability and automatic shrinkage.
Findings
Outperforms Gaussian and Laplace priors in cross-validation
Provides stable estimates even with complete separation
Automatically shrinks higher-order interactions
Abstract
We propose a new prior distribution for classical (nonhierarchical) logistic regression models, constructed by first scaling all nonbinary variables to have mean 0 and standard deviation 0.5, and then placing independent Student- prior distributions on the coefficients. As a default choice, we recommend the Cauchy distribution with center 0 and scale 2.5, which in the simplest setting is a longer-tailed version of the distribution attained by assuming one-half additional success and one-half additional failure in a logistic regression. Cross-validation on a corpus of datasets shows the Cauchy class of prior distributions to outperform existing implementations of Gaussian and Laplace priors. We recommend this prior distribution as a default choice for routine applied use. It has the advantage of always giving answers, even when there is complete separation in logistic regression (a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
