Mixed Effects Models are Sometimes Terrible
Christopher Eager, Joseph Roy

TL;DR
This paper investigates the frequent non-convergence issues of mixed-effects models in linguistics, demonstrating that non-convergence often indicates model misspecification rather than over-simplification, and proposes Bayesian models as a solution.
Contribution
The study challenges the parsimonious convergence hypothesis and introduces Bayesian modeling with rstan to improve convergence in mixed-effects models.
Findings
lme4 models have high non-convergence rates with known effect structures
non-convergence is not necessarily due to over-specified models
Bayesian models significantly reduce convergence problems
Abstract
Mixed-effects models have emerged as the gold standard of statistical analysis in different sub-fields of linguistics (Baayen, Davidson & Bates, 2008; Johnson, 2009; Barr, et al, 2013; Gries, 2015). One problematic feature of these models is their failure to converge under maximal (or even near-maximal) random effects structures. The lack of convergence is relatively unaddressed in linguistics and when it is addressed has resulted in statistical practices (e.g. Jaeger, 2009; Gries, 2015; Bates, et al, 2015b) that are premised on the idea that non-convergence is an indication that a random effects structure is over-specified (or not parsimonious), the parsimonious convergence hypothesis (PCH). We test the PCH by running simulations in lme4 under two sets of assumptions for both a linear dependent variable and a binary dependent variable in order to assess the rate of non-convergence for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Bayesian Inference · Spatial and Panel Data Analysis · Psychometric Methodologies and Testing
