A weakly informative default prior distribution for logistic and other   regression models

Andrew Gelman; Aleks Jakulin; Maria Grazia Pittau; Yu-Sung Su

arXiv:0901.4011·stat.AP·January 27, 2009

A weakly informative default prior distribution for logistic and other regression models

Andrew Gelman, Aleks Jakulin, Maria Grazia Pittau, Yu-Sung Su

PDF

TL;DR

This paper introduces a new default prior distribution for logistic regression models, using Student-$t$ distributions, which improves robustness and performance, especially in cases of complete separation and complex interactions.

Contribution

The paper proposes a Cauchy prior as a default for logistic regression, with an implementation in R using an EM algorithm, enhancing stability and automatic shrinkage.

Findings

01

Outperforms Gaussian and Laplace priors in cross-validation

02

Provides stable estimates even with complete separation

03

Automatically shrinks higher-order interactions

Abstract

We propose a new prior distribution for classical (nonhierarchical) logistic regression models, constructed by first scaling all nonbinary variables to have mean 0 and standard deviation 0.5, and then placing independent Student- $t$ prior distributions on the coefficients. As a default choice, we recommend the Cauchy distribution with center 0 and scale 2.5, which in the simplest setting is a longer-tailed version of the distribution attained by assuming one-half additional success and one-half additional failure in a logistic regression. Cross-validation on a corpus of datasets shows the Cauchy class of prior distributions to outperform existing implementations of Gaussian and Laplace priors. We recommend this prior distribution as a default choice for routine applied use. It has the advantage of always giving answers, even when there is complete separation in logistic regression (a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.