Estimation and Inference for Very Large Linear Mixed Effects Models
K. Gao, A. B. Owen

TL;DR
This paper introduces a computationally efficient method of moments approach for estimating large linear mixed effects models with crossed random effects, enabling scalable inference with guarantees on consistency and asymptotic normality.
Contribution
It proposes an O(N) cost method of moments technique that accounts for correlation structures in large crossed random effects models, improving scalability and inference accuracy.
Findings
Method achieves O(N) computational complexity.
Provides conditions for consistency and asymptotic normality.
Demonstrated on real data from Stitch Fix.
Abstract
Linear mixed models with large imbalanced crossed random effects structures pose severe computational problems for maximum likelihood estimation and for Bayesian analysis. The costs can grow as fast as when there are N observations. Such problems arise in any setting where the underlying factors satisfy a many to many relationship (instead of a nested one) and in electronic commerce applications, the N can be quite large. Methods that do not account for the correlation structure can greatly underestimate uncertainty. We propose a method of moments approach that takes account of the correlation structure and that can be computed at O(N) cost. The method of moments is very amenable to parallel computation and it does not require parametric distributional assumptions, tuning parameters or convergence diagnostics. For the regression coefficients, we give conditions for consistency…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Bayesian Inference · Statistical Methods and Inference · Statistical Methods in Clinical Trials
