A non-asymptotic approach for model selection via penalization in   high-dimensional mixture of experts models

TrungTin Nguyen; Hien Duy Nguyen; Faicel Chamroukhi; Florence; Forbes

arXiv:2104.02640·math.ST·September 29, 2022·1 cites

A non-asymptotic approach for model selection via penalization in high-dimensional mixture of experts models

TrungTin Nguyen, Hien Duy Nguyen, Faicel Chamroukhi, Florence, Forbes

PDF

Open Access 1 Repo

TL;DR

This paper introduces a non-asymptotic penalization approach for selecting high-dimensional mixture of experts models, specifically GLoME and BLoME, with theoretical risk bounds and empirical validation.

Contribution

It develops a non-asymptotic model selection framework with risk bounds for complex MoE models, addressing both computational and theoretical challenges.

Findings

01

Established non-asymptotic risk bounds for model selection

02

Demonstrated good empirical performance on synthetic and real data

03

Provided theoretical guarantees under penalty lower bounds

Abstract

Mixture of experts (MoE) are a popular class of statistical and machine learning models that have gained attention over the years due to their flexibility and efficiency. In this work, we consider Gaussian-gated localized MoE (GLoME) and block-diagonal covariance localized MoE (BLoME) regression models to present nonlinear relationships in heterogeneous data with potential hidden graph-structured interactions between high-dimensional predictors. These models pose difficult statistical estimation and model selection questions, both from a computational and theoretical perspective. This paper is devoted to the study of the problem of model selection among a collection of GLoME or BLoME models characterized by the number of mixture components, the complexity of Gaussian mean experts, and the hidden block-diagonal structures of the covariance matrices, in a penalized maximum likelihood…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Trung-TinNGUYEN/NamsGLoME-Simulation
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Methods and Mixture Models · Statistical Methods and Inference · Statistical Methods and Bayesian Inference