Variable selection for clustering with Gaussian mixture models: state of the art
Abdelghafour Talibi, Boujem\^aa Achchab, Rafik Lasri

TL;DR
This paper reviews current methods for variable selection in Gaussian mixture model-based clustering, highlighting challenges and potential improvements for handling large modern datasets.
Contribution
It provides a comprehensive overview of existing variable selection techniques in model-based clustering and discusses avenues for enhancing these methods.
Findings
Identifies limitations of current variable selection methods in large datasets.
Discusses potential improvements for model-based clustering variable selection.
Highlights the importance of relevant variable selection for clustering accuracy.
Abstract
The mixture models have become widely used in clustering, given its probabilistic framework in which its based, however, for modern databases that are characterized by their large size, these models behave disappointingly in setting out the model, making essential the selection of relevant variables for this type of clustering. After recalling the basics of clustering based on a model, this article will examine the variable selection methods for model-based clustering, as well as presenting opportunities for improvement of these methods.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Advanced Clustering Algorithms Research · Data Management and Algorithms
