Non-asymptotic oracle inequalities for the Lasso in high-dimensional mixture of experts
TrungTin Nguyen, Hien D Nguyen, Faicel Chamroukhi, Geoffrey J, McLachlan

TL;DR
This paper establishes non-asymptotic oracle inequalities for the Lasso estimator in high-dimensional Gaussian mixture of experts models, providing theoretical guarantees and empirical validation for their estimation properties.
Contribution
It introduces the first non-asymptotic analysis of Lasso regularization in softmax-gated Gaussian MoE models under mild assumptions.
Findings
Provided a lower bound on the Lasso regularization parameter.
Achieved non-asymptotic control of the Kullback-Leibler loss.
Validated theoretical results through simulations.
Abstract
We investigate the estimation properties of the mixture of experts (MoE) model in a high-dimensional setting, where the number of predictors is much larger than the sample size, and for which the literature is particularly lacking in theoretical results. We consider the class of softmax-gated Gaussian MoE (SGMoE) models, defined as MoE models with softmax gating functions and Gaussian experts, and focus on the theoretical properties of their -regularized estimation via the Lasso. To the best of our knowledge, we are the first to investigate the -regularization properties of SGMoE models from a non-asymptotic perspective, under the mildest assumptions, namely the boundedness of the parameter space. We provide a lower bound on the regularization parameter of the Lasso penalty that ensures non-asymptotic theoretical control of the Kullback--Leibler loss of the Lasso estimator for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Bayesian Methods and Mixture Models · Statistical Methods and Bayesian Inference
MethodsFeature Selection
