Unsupervised learning of regression mixture models with unknown number   of components

Faicel Chamroukhi

arXiv:1409.6981·stat.ME·September 25, 2014

Unsupervised learning of regression mixture models with unknown number of components

Faicel Chamroukhi

PDF

Open Access

TL;DR

This paper introduces an unsupervised learning algorithm for regression mixture models that automatically determines the number of components and is robust to initialization, improving curve clustering accuracy.

Contribution

It proposes a fully unsupervised penalized maximum likelihood approach with a robust EM algorithm that infers both model parameters and the number of components simultaneously.

Findings

01

Performs well on simulated data, accurately retrieving the number of clusters.

02

Demonstrates robustness to initialization issues.

03

Effective in real-world functional data clustering applications.

Abstract

Regression mixture models are widely studied in statistics, machine learning and data analysis. Fitting regression mixtures is challenging and is usually performed by maximum likelihood by using the expectation-maximization (EM) algorithm. However, it is well-known that the initialization is crucial for EM. If the initialization is inappropriately performed, the EM algorithm may lead to unsatisfactory results. The EM algorithm also requires the number of clusters to be given a priori; the problem of selecting the number of mixture components requires using model selection criteria to choose one from a set of pre-estimated candidate models. We propose a new fully unsupervised algorithm to learn regression mixture models with unknown number of components. The developed unsupervised learning approach consists in a penalized maximum likelihood estimation carried out by a robust…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Methods and Mixture Models · Advanced Clustering Algorithms Research · Face and Expression Recognition