SEMMS with Random Effects: A Mixed-Model Extension for Variable Selection in Clustered and Longitudinal Data
Haim Bar, Martin T. Wells

TL;DR
This paper extends the SEMMS variable selection method to mixed models with random effects, improving accuracy in clustered and longitudinal data by accounting for within-group correlations, and demonstrates superior performance in simulations and real data.
Contribution
The authors develop a mixed-model extension of SEMMS that incorporates random effects via an iterative algorithm, enhancing variable selection in correlated data scenarios.
Findings
Mixed-model SEMMS recovers true predictors in over 90% of Gaussian simulations.
The extended method outperforms plain SEMMS in non-Gaussian settings with high random-effect variance.
Simulation studies show improved variable selection accuracy across various data regimes.
Abstract
SEMMS (Scalable Empirical-Bayes Model for Marker Selection) is a variable-selection procedure for generalized linear models that uses a three-component normal mixture prior on regression coefficients. In its original form, SEMMS assumes that all observations are independent. Many real-world datasets, however, arise from repeated-measures or clustered designs in which observations within the same subject are correlated. Ignoring this correlation inflates the apparent residual variance and can severely degrade variable-selection performance. We extend SEMMS to accommodate random intercepts, random slopes, or both, via an alternating coordinate-ascent algorithm. After each round of fixed-effect variable selection, the subject-level best linear unbiased predictors (BLUPs) are updated with \texttt{lmer} (Gaussian) or \texttt{glmer} (non-Gaussian); the fixed-effect step then operates on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Statistical Methods and Bayesian Inference · Statistical Methods and Inference
