Learning Neural Models for End-to-End Clustering

Benjamin Bruno Meier; Ismail Elezi; Mohammadreza Amirian; Oliver Durr; and Thilo Stadelmann

arXiv:1807.04001·cs.LG·July 12, 2018

Learning Neural Models for End-to-End Clustering

Benjamin Bruno Meier, Ismail Elezi, Mohammadreza Amirian, Oliver Durr, and Thilo Stadelmann

PDF

1 Repo

TL;DR

This paper introduces a neural network architecture capable of performing end-to-end probabilistic clustering on various data types, trained in a supervised manner to generalize across different datasets and grouping criteria.

Contribution

It presents a novel end-to-end neural clustering model that estimates distributions over cluster counts and assignments, differing from existing methods by enabling learnable, fully end-to-end clustering.

Findings

01

Promising results on image and speech datasets

02

Able to generalize to different data and grouping criteria

03

Distinct from deep metric learning and semi-supervised clustering

Abstract

We propose a novel end-to-end neural network architecture that, once trained, directly outputs a probabilistic clustering of a batch of input examples in one pass. It estimates a distribution over the number of clusters $k$ , and for each $1 \leq k \leq k_{max}$ , a distribution over the individual cluster assignment for each data point. The network is trained in advance in a supervised fashion on separate data to learn grouping by any perceptual similarity criterion based on pairwise labels (same/different group). It can then be applied to different data containing different groups. We demonstrate promising performance on high-dimensional data like images (COIL-100) and speech (TIMIT). We call this ``learning to cluster'' and show its conceptual difference to deep metric learning, semi-supervise clustering and other related approaches while having the advantage of performing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kutoga/learning2cluster
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.