The Effective Sample Size in Bayesian Information Criterion for Level-Specific Fixed and Random Effects Selection in a Two-Level Nested Model
Sun-Joo Cho, Hao Wu, Matthew Naveiras

TL;DR
This paper derives a new version of BIC, called BIC_E, for multilevel models that accurately accounts for level-specific fixed and random effects, resolving discrepancies in existing formulas and improving model selection.
Contribution
The study introduces BIC_E, a revised BIC penalty term for multilevel models that decomposes the sample size component and addresses issues with redundant random effects.
Findings
BIC_E accurately reflects empirical model selection behavior.
The penalty term decomposes into parts based on sample size and parameters.
BIC_E performs well in examples and simulations.
Abstract
Popular statistical software provides Bayesian information criterion (BIC) for multilevel models or linear mixed models. However, it has been observed that the combination of statistical literature and software documentation has led to discrepancies in the formulas of the BIC and uncertainties of the proper use of the BIC in selecting a multilevel model with respect to level-specific fixed and random effects. These discrepancies and uncertainties result from different specifications of sample size in the BIC's penalty term for multilevel models. In this study, we derive the BIC's penalty term for level-specific fixed and random effect selection in a two-level nested design. In this new version of BIC, called BIC_E, this penalty term is decomposed into two parts if the random effect variance-covariance matrix has full rank: (a) a term with the log of average sample size per cluster whose…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimal Experimental Design Methods · Statistical Methods and Bayesian Inference
