Estimating the Number of Components in Finite Mixture Models via the Group-Sort-Fuse Procedure
Tudor Manole, Abbas Khalili

TL;DR
The paper introduces the Group-Sort-Fuse (GSF) procedure, a penalized likelihood method for accurately estimating the number of components in finite mixture models, with proven consistency and practical implementation.
Contribution
It proposes a novel GSF approach that directly penalizes model parameters, improving mixture order estimation and parameter convergence in multidimensional models.
Findings
GSF is consistent in estimating the true mixture order.
Achieves near parametric convergence rates for parameter estimation.
Demonstrates strong finite sample performance in simulations and real data.
Abstract
Estimation of the number of components (or order) of a finite mixture model is a long standing and challenging problem in statistics. We propose the Group-Sort-Fuse (GSF) procedure -- a new penalized likelihood approach for simultaneous estimation of the order and mixing measure in multidimensional finite mixture models. Unlike methods which fit and compare mixtures with varying orders using criteria involving model complexity, our approach directly penalizes a continuous function of the model parameters. More specifically, given a conservative upper bound on the order, the GSF groups and sorts mixture component parameters to fuse those which are redundant. For a wide range of finite mixture models, we show that the GSF is consistent in estimating the true mixture order and achieves the convergence rate for parameter estimation up to polylogarithmic factors. The GSF is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Statistical Distribution Estimation and Applications · Statistical Methods and Bayesian Inference
