Estimating Cross-validatory Predictive P-values with Integrated Importance Sampling for Disease Mapping Models
Longhai Li, Cindy X. Feng, Shi Qiu

TL;DR
This paper introduces integrated importance sampling (iIS), a novel efficient method for estimating leave-one-out cross-validatory predictive p-values in disease mapping models, avoiding extensive reruns of MCMC analyses.
Contribution
The paper proposes iIS, a new importance sampling technique that accurately estimates LOOCV predictive p-values using only full data posterior samples, reducing computational cost.
Findings
iIS provides estimates nearly identical to actual LOOCV.
iIS outperforms existing methods like posterior predictive checking, importance sampling, and ghosting.
Empirical tests on disease datasets validate the effectiveness of iIS.
Abstract
An important statistical task in disease mapping problems is to identify divergent regions with unusually high or low risk of disease. Leave-one-out cross-validatory (LOOCV) model assessment is the gold standard for estimating predictive p-values that can flag such divergent regions. However, actual LOOCV is time-consuming because one needs to rerun a Markov chain Monte Carlo analysis for each posterior distribution in which an observation is held out as a test case. This paper introduces a new method, called integrated importance sampling (iIS), for estimating LOOCV predictive p-values with only Markov chain samples drawn from the posterior based on a full data set. The key step in iIS is that we integrate away the latent variables associated the test observation with respect to their conditional distribution \textit{without} reference to the actual observation. By following the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
