Building high-level features using large scale unsupervised learning

Quoc V. Le; Marc'Aurelio Ranzato; Rajat Monga; Matthieu Devin; Kai; Chen; Greg S. Corrado; Jeff Dean; Andrew Y. Ng

arXiv:1112.6209·cs.LG·April 17, 2017·179 cites

Building high-level features using large scale unsupervised learning

Quoc V. Le, Marc'Aurelio Ranzato, Rajat Monga, Matthieu Devin, Kai, Chen, Greg S. Corrado, Jeff Dean, Andrew Y. Ng

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates that high-level, class-specific features like face detectors can be learned from unlabeled data using large-scale unsupervised learning, significantly improving object recognition performance.

Contribution

It introduces a large-scale unsupervised learning approach with a deep autoencoder trained on internet images, enabling the discovery of high-level features without labels.

Findings

01

Successfully learned face and object detectors from unlabeled data.

02

Features are robust to translation, scaling, and rotation.

03

Achieved 15.8% accuracy on 20,000 ImageNet categories, a 70% improvement.

Abstract

We consider the problem of building high-level, class-specific feature detectors from only unlabeled data. For example, is it possible to learn a face detector using only unlabeled images? To answer this, we train a 9-layered locally connected sparse autoencoder with pooling and local contrast normalization on a large dataset of images (the model has 1 billion connections, the dataset has 10 million 200x200 pixel images downloaded from the Internet). We train this network using model parallelism and asynchronous SGD on a cluster with 1,000 machines (16,000 cores) for three days. Contrary to what appears to be a widely-held intuition, our experimental results reveal that it is possible to train a face detector without having to label images as containing a face or not. Control experiments show that this feature detector is robust not only to translation but also to scaling and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

greenelab/tybalt
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques

MethodsSparse Autoencoder · Solana Customer Service Number +1-833-534-1729