Meta-learning representations for clustering with infinite Gaussian   mixture models

Tomoharu Iwata

arXiv:2103.00694·stat.ML·March 2, 2021

Meta-learning representations for clustering with infinite Gaussian mixture models

Tomoharu Iwata

PDF

TL;DR

This paper introduces a meta-learning approach that trains neural networks to produce representations optimized for clustering with infinite Gaussian mixture models, improving clustering performance on unseen data.

Contribution

It presents a novel meta-learning framework that directly optimizes representations for clustering via a differentiable approximation of the ARI and VB inference, enabling better generalization.

Findings

01

Higher adjusted Rand index than existing methods

02

Effective on both text and image datasets

03

Meta-learned representations improve clustering performance

Abstract

For better clustering performance, appropriate representations are critical. Although many neural network-based metric learning methods have been proposed, they do not directly train neural networks to improve clustering performance. We propose a meta-learning method that train neural networks for obtaining representations such that clustering performance improves when the representations are clustered by the variational Bayesian (VB) inference with an infinite Gaussian mixture model. The proposed method can cluster unseen unlabeled data using knowledge meta-learned with labeled data that are different from the unlabeled data. For the objective function, we propose a continuous approximation of the adjusted Rand index (ARI), by which we can evaluate the clustering performance from soft clustering assignments. Since the approximated ARI and the VB inference procedure are differentiable,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.