Unsupervised Learning by Predicting Noise

Piotr Bojanowski; Armand Joulin

arXiv:1704.05310·stat.ML·April 19, 2017·130 cites

Unsupervised Learning by Predicting Noise

Piotr Bojanowski, Armand Joulin

PDF

Open Access 1 Repo

TL;DR

This paper presents a novel unsupervised learning framework for deep neural networks that aligns features to fixed noise targets, avoiding collapse and scaling efficiently to large datasets.

Contribution

It introduces Noise As Targets (NAT), a domain-agnostic method that trains deep networks without supervision, outperforming existing unsupervised techniques on standard benchmarks.

Findings

01

Achieves competitive results on ImageNet and Pascal VOC

02

Scales efficiently to millions of images

03

Avoids trivial solutions and feature collapse

Abstract

Convolutional neural networks provide visual features that perform remarkably well in many computer vision applications. However, training these networks requires significant amounts of supervision. This paper introduces a generic framework to train deep networks, end-to-end, with no supervision. We propose to fix a set of target representations, called Noise As Targets (NAT), and to constrain the deep features to align to them. This domain agnostic approach avoids the standard unsupervised learning issues of trivial solutions and collapsing of features. Thanks to a stochastic batch reassignment strategy and a separable square loss function, it scales to millions of images. The proposed approach produces representations that perform on par with state-of-the-art unsupervised methods on ImageNet and Pascal VOC.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

facebookresearch/noise-as-targets
torch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification