Non-asymptotic model selection in block-diagonal mixture of polynomial experts models
TrungTin Nguyen, Faicel Chamroukhi, Hien Duy Nguyen, Florence Forbes

TL;DR
This paper develops a non-asymptotic model selection criterion for a complex mixture of polynomial experts model with block-diagonal covariance structures, providing finite-sample guarantees for selecting model complexity and structure.
Contribution
It introduces a penalized maximum likelihood criterion for BLoMPE models that handles model complexity, hidden structures, and provides theoretical finite-sample guarantees.
Findings
Proposed a non-asymptotic model selection criterion with oracle inequality.
Handled complex model structures including hidden graph interactions.
Provided theoretical guarantees for model selection performance.
Abstract
Model selection, via penalized likelihood type criteria, is a standard task in many statistical inference and machine learning problems. Progress has led to deriving criteria with asymptotic consistency results and an increasing emphasis on introducing non-asymptotic criteria. We focus on the problem of modeling non-linear relationships in regression data with potential hidden graph-structured interactions between the high-dimensional predictors, within the mixture of experts modeling framework. In order to deal with such a complex situation, we investigate a block-diagonal localized mixture of polynomial experts (BLoMPE) regression model, which is constructed upon an inverse regression and block-diagonal structures of the Gaussian expert covariance matrices. We introduce a penalized maximum likelihood selection criterion to estimate the unknown conditional density of the regression…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
