A distribution-free mixed-integer optimization approach to hierarchical modelling of clustered and longitudinal data
Madhav Sankaranarayanan, Intekhab Hossain, Tom Chen

TL;DR
This paper introduces a novel distribution-free mixed-integer optimization method for hierarchical modeling of clustered and longitudinal data, improving feature selection and predictive accuracy in linear mixed effects models.
Contribution
It extends MIO-based subset selection to cluster-aware regression, enabling efficient, robust, and sparse solutions without distributional assumptions.
Findings
Method solves problems within minutes on synthetic and real data.
Outperforms traditional LMMs in sparsity and predictive power.
Enhances robustness by evaluating cluster effects for new data.
Abstract
Recent advancements in Mixed Integer Optimization (MIO) algorithms, paired with hardware enhancements, have led to significant speedups in resolving MIO problems. These strategies have been utilized for optimal subset selection, specifically for choosing features out of in linear regression given observations. In this paper, we broaden this method to facilitate cluster-aware regression, where selection aims to choose out of clusters in a linear mixed effects (LMM) model with observations for each cluster. Through comprehensive testing on a multitude of synthetic and real datasets, we exhibit that our method efficiently solves problems within minutes. Through numerical experiments, we also show that the MIO approach outperforms both Gaussian- and Laplace-distributed LMMs in terms of generating sparse solutions with high predictive power. Traditional LMMs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research · Face and Expression Recognition · Statistical Methods and Inference
MethodsLinear Regression
