A Tutorial on Discriminative Clustering and Mutual Information
Louis Ohl, Pierre-Alexandre Mattei, Fr\'ed\'eric Precioso

TL;DR
This paper provides a comprehensive overview of discriminative clustering methods, emphasizing the role of mutual information, its evolution, limitations, and the challenges in cluster number selection, supported by a Python package implementation.
Contribution
It offers a historical perspective on discriminative clustering, highlighting the evolution of assumptions and the significance of mutual information, along with practical tools for implementation.
Findings
Mutual information has been central to discriminative clustering development.
Discriminative models evolved from decision boundaries to invariance critics.
Challenges remain in selecting the optimal number of clusters.
Abstract
To cluster data is to separate samples into distinctive groups that should ideally have some cohesive properties. Today, numerous clustering algorithms exist, and their differences lie essentially in what can be perceived as ``cohesive properties''. Therefore, hypotheses on the nature of clusters must be set: they can be either generative or discriminative. As the last decade witnessed the impressive growth of deep clustering methods that involve neural networks to handle high-dimensional data often in a discriminative manner; we concentrate mainly on the discriminative hypotheses. In this paper, our aim is to provide an accessible historical perspective on the evolution of discriminative clustering methods and notably how the nature of assumptions of the discriminative models changed over time: from decision boundaries to invariance critics. We notably highlight how mutual information…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research · Bayesian Methods and Mixture Models · Data Analysis with R
