A non-asymptotic approach for model selection via penalization in high-dimensional mixture of experts models
TrungTin Nguyen, Hien Duy Nguyen, Faicel Chamroukhi, Florence, Forbes

TL;DR
This paper introduces a non-asymptotic penalization approach for selecting high-dimensional mixture of experts models, specifically GLoME and BLoME, with theoretical risk bounds and empirical validation.
Contribution
It develops a non-asymptotic model selection framework with risk bounds for complex MoE models, addressing both computational and theoretical challenges.
Findings
Established non-asymptotic risk bounds for model selection
Demonstrated good empirical performance on synthetic and real data
Provided theoretical guarantees under penalty lower bounds
Abstract
Mixture of experts (MoE) are a popular class of statistical and machine learning models that have gained attention over the years due to their flexibility and efficiency. In this work, we consider Gaussian-gated localized MoE (GLoME) and block-diagonal covariance localized MoE (BLoME) regression models to present nonlinear relationships in heterogeneous data with potential hidden graph-structured interactions between high-dimensional predictors. These models pose difficult statistical estimation and model selection questions, both from a computational and theoretical perspective. This paper is devoted to the study of the problem of model selection among a collection of GLoME or BLoME models characterized by the number of mixture components, the complexity of Gaussian mean experts, and the hidden block-diagonal structures of the covariance matrices, in a penalized maximum likelihood…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Statistical Methods and Inference · Statistical Methods and Bayesian Inference
