On the Need for Spatial Random Effects in Bayesian Regression Models for Multilevel Areal Data
Shuqi Lin, Joshua L. Warren

TL;DR
This paper provides a formal threshold to determine when spatial random effects are necessary in Bayesian multilevel areal data models, based on sample size and data characteristics.
Contribution
It derives a sample size threshold $m^*$ that guides practitioners on when spatial modeling impacts inference, aiding model selection.
Findings
The threshold $m^*$ depends on spatial correlation, variance ratios, and covariate alignment.
Posterior variance differences diminish at rate $O(m^{-1})$ as sample size increases.
Simulation studies validate the accuracy of the threshold $m^*$ across various scenarios.
Abstract
Although spatial models for areal data are widely used in multilevel settings, the conditions under which spatial and nonspatial random effects yield equivalent posterior inference for regression coefficients have never been formally characterized. We address this question within a hierarchical Bayesian framework for Gaussian outcomes, using the Leroux conditional autoregressive (CAR) prior distribution as a representative specification. We derive a closed-form sample size threshold, , below which spatial modeling materially affects inference on regression coefficients and above which a simpler nonspatial model yields effectively equivalent results, and show that the absolute relative difference in posterior variances converges to zero at rate . The threshold depends on three interpretable quantities: the spatial correlation parameter, the ratio of between-area to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
