Unsupervised learning of regression mixture models with unknown number of components
Faicel Chamroukhi

TL;DR
This paper introduces an unsupervised learning algorithm for regression mixture models that automatically determines the number of components and is robust to initialization, improving curve clustering accuracy.
Contribution
It proposes a fully unsupervised penalized maximum likelihood approach with a robust EM algorithm that infers both model parameters and the number of components simultaneously.
Findings
Performs well on simulated data, accurately retrieving the number of clusters.
Demonstrates robustness to initialization issues.
Effective in real-world functional data clustering applications.
Abstract
Regression mixture models are widely studied in statistics, machine learning and data analysis. Fitting regression mixtures is challenging and is usually performed by maximum likelihood by using the expectation-maximization (EM) algorithm. However, it is well-known that the initialization is crucial for EM. If the initialization is inappropriately performed, the EM algorithm may lead to unsatisfactory results. The EM algorithm also requires the number of clusters to be given a priori; the problem of selecting the number of mixture components requires using model selection criteria to choose one from a set of pre-estimated candidate models. We propose a new fully unsupervised algorithm to learn regression mixture models with unknown number of components. The developed unsupervised learning approach consists in a penalized maximum likelihood estimation carried out by a robust…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Advanced Clustering Algorithms Research · Face and Expression Recognition
