Generalised Mutual Information: a Framework for Discriminative Clustering
Louis Ohl, Pierre-Alexandre Mattei, Charles Bouveyron, Warith, Harchaoui, Micka\"el Leclercq, Arnaud Droit, Fr\'ed\'eric Precioso

TL;DR
This paper introduces Generalised Mutual Information (GEMINI), a new framework that improves clustering by addressing limitations of traditional MI, enabling automatic cluster number selection and reducing the need for regularisations.
Contribution
The paper proposes GEMINI, a generalized mutual information framework that is geometry-aware, eliminates the need for regularisations, and can automatically determine the number of clusters.
Findings
GEMINI addresses MI limitations in clustering.
Some GEMINIs do not require regularisations.
GEMINI can automatically select the number of clusters.
Abstract
In the last decade, recent successes in deep clustering majorly involved the Mutual Information (MI) as an unsupervised objective for training neural networks with increasing regularisations. While the quality of the regularisations have been largely discussed for improvements, little attention has been dedicated to the relevance of MI as a clustering objective. In this paper, we first highlight how the maximisation of MI does not lead to satisfying clusters. We identified the Kullback-Leibler divergence as the main reason of this behaviour. Hence, we generalise the mutual information by changing its core distance, introducing the Generalised Mutual Information (GEMINI): a set of metrics for unsupervised neural network training. Unlike MI, some GEMINIs do not require regularisations when training as they are geometry-aware thanks to distances or kernels in the data space. Finally, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Neural Networks and Applications · Remote-Sensing Image Classification
