TL;DR
This paper introduces an invariant information clustering method that learns to classify images without labels, achieving state-of-the-art results in unsupervised image classification and segmentation across multiple benchmarks.
Contribution
The paper proposes a novel mutual information-based clustering objective that directly outputs semantic labels, outperforming existing methods in unsupervised and semi-supervised image classification.
Findings
Achieved state-of-the-art accuracy on STL10 and CIFAR10 benchmarks.
Outperformed competitors by 6.6 and 9.5 percentage points respectively.
Effective in semi-supervised settings with limited labels.
Abstract
We present a novel clustering objective that learns a neural network classifier from scratch, given only unlabelled data samples. The model discovers clusters that accurately match semantic classes, achieving state-of-the-art results in eight unsupervised clustering benchmarks spanning image classification and segmentation. These include STL10, an unsupervised variant of ImageNet, and CIFAR10, where we significantly beat the accuracy of our closest competitors by 6.6 and 9.5 absolute percentage points respectively. The method is not specialised to computer vision and operates on any paired dataset samples; in our experiments we use random transforms to obtain a pair from each image. The trained network directly outputs semantic labels, rather than high dimensional representations that need external processing to be usable for semantic clustering. The objective is simply to maximise…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
