The 2008 election: A preregistered replication analysis
Rayleigh Lei, Andrew Gelman, and Yair Ghitza

TL;DR
This paper conducts a series of preregistered replications of a 2013 analysis of 2008 US election polls, revealing potential issues with previous models and demonstrating the value of preregistration in historical and exploratory research.
Contribution
It introduces a rigorous preregistration approach to replicating complex election analysis, highlighting potential model limitations and emphasizing transparency in historical data analysis.
Findings
Potential model inadequately accounts for nonsampling error
Some patterns in earlier analysis may reflect noise
Demographic and geographic insights are validated
Abstract
We present an increasingly stringent set of replications of Ghitza & Gelman (2013), a multilevel regression and poststratification analysis of polls from the 2008 U.S. presidential election campaign, focusing on a set of plots showing the estimated Republican vote share for whites and for all voters, as a function of income level in each of the states. We start with a nearly-exact duplication that uses the posted code and changes only the model-fitting algorithm; we then replicate using already-analyzed data from 2004; and finally we set up preregistered replications using two surveys from 2008 that we had not previously looked at. We have already learned from our preliminary, non-preregistered replication, which has revealed a potential problem with the published analysis of Ghitza & Gelman (2013); it appears that our model may not sufficiently account for nonsampling error, and that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsElectoral Systems and Political Participation · Statistical Methods and Inference · Data Analysis with R
