Mixed Effects Models are Sometimes Terrible

Christopher Eager; Joseph Roy

arXiv:1701.04858·stat.AP·January 19, 2017·87 cites

Mixed Effects Models are Sometimes Terrible

Christopher Eager, Joseph Roy

PDF

Open Access

TL;DR

This paper investigates the frequent non-convergence issues of mixed-effects models in linguistics, demonstrating that non-convergence often indicates model misspecification rather than over-simplification, and proposes Bayesian models as a solution.

Contribution

The study challenges the parsimonious convergence hypothesis and introduces Bayesian modeling with rstan to improve convergence in mixed-effects models.

Findings

01

lme4 models have high non-convergence rates with known effect structures

02

non-convergence is not necessarily due to over-specified models

03

Bayesian models significantly reduce convergence problems

Abstract

Mixed-effects models have emerged as the gold standard of statistical analysis in different sub-fields of linguistics (Baayen, Davidson & Bates, 2008; Johnson, 2009; Barr, et al, 2013; Gries, 2015). One problematic feature of these models is their failure to converge under maximal (or even near-maximal) random effects structures. The lack of convergence is relatively unaddressed in linguistics and when it is addressed has resulted in statistical practices (e.g. Jaeger, 2009; Gries, 2015; Bates, et al, 2015b) that are premised on the idea that non-convergence is an indication that a random effects structure is over-specified (or not parsimonious), the parsimonious convergence hypothesis (PCH). We test the PCH by running simulations in lme4 under two sets of assumptions for both a linear dependent variable and a binary dependent variable in order to assess the rate of non-convergence for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Bayesian Inference · Spatial and Panel Data Analysis · Psychometric Methodologies and Testing