Subset selection for linear mixed models

Daniel R. Kowal

arXiv:2107.12890·stat.ME·April 20, 2022·1 cites

Subset selection for linear mixed models

Daniel R. Kowal

PDF

Open Access

TL;DR

This paper presents a Bayesian decision analysis approach for subset selection in linear mixed models, effectively handling structured dependence and providing stable, near-optimal variable subsets with uncertainty quantification.

Contribution

It introduces a novel Bayesian decision framework for subset selection in LMMs that emphasizes multiple near-optimal subsets over a single best subset.

Findings

01

Demonstrates superior prediction and estimation on simulated data.

02

Provides scalable algorithms for subset search.

03

Shows effective variable importance metrics.

Abstract

Linear mixed models (LMMs) are instrumental for regression analysis with structured dependence, such as grouped, clustered, or multilevel data. However, selection among the covariates--while accounting for this structured dependence--remains a challenge. We introduce a Bayesian decision analysis for subset selection with LMMs. Using a Mahalanobis loss function that incorporates the structured dependence, we derive optimal linear coefficients for (i) any given subset of variables and (ii) all subsets of variables that satisfy a cardinality constraint. Crucially, these estimates inherit shrinkage or regularization and uncertainty quantification from the underlying Bayesian model, and apply for any well-specified Bayesian LMM. More broadly, our decision analysis strategy deemphasizes the role of a single "best" subset, which is often unstable and limited in its information content, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Statistical Methods and Bayesian Inference · Bayesian Modeling and Causal Inference