Variable selection for clustering with Gaussian mixture models: state of   the art

Abdelghafour Talibi; Boujem\^aa Achchab; Rafik Lasri

arXiv:1701.08946·stat.ML·February 1, 2017·2 cites

Variable selection for clustering with Gaussian mixture models: state of the art

Abdelghafour Talibi, Boujem\^aa Achchab, Rafik Lasri

PDF

Open Access

TL;DR

This paper reviews current methods for variable selection in Gaussian mixture model-based clustering, highlighting challenges and potential improvements for handling large modern datasets.

Contribution

It provides a comprehensive overview of existing variable selection techniques in model-based clustering and discusses avenues for enhancing these methods.

Findings

01

Identifies limitations of current variable selection methods in large datasets.

02

Discusses potential improvements for model-based clustering variable selection.

03

Highlights the importance of relevant variable selection for clustering accuracy.

Abstract

The mixture models have become widely used in clustering, given its probabilistic framework in which its based, however, for modern databases that are characterized by their large size, these models behave disappointingly in setting out the model, making essential the selection of relevant variables for this type of clustering. After recalling the basics of clustering based on a model, this article will examine the variable selection methods for model-based clustering, as well as presenting opportunities for improvement of these methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Methods and Mixture Models · Advanced Clustering Algorithms Research · Data Management and Algorithms