Early Visual Concept Learning with Unsupervised Deep Learning

Irina Higgins; Loic Matthey; Xavier Glorot; Arka Pal; Benigno Uria,; Charles Blundell; Shakir Mohamed; Alexander Lerchner

arXiv:1606.05579·stat.ML·September 21, 2016·127 cites

Early Visual Concept Learning with Unsupervised Deep Learning

Irina Higgins, Loic Matthey, Xavier Glorot, Arka Pal, Benigno Uria,, Charles Blundell, Shakir Mohamed, Alexander Lerchner

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper presents an unsupervised deep learning method inspired by neuroscience to learn disentangled visual concepts from raw images, enabling zero-shot inference and understanding of objectness.

Contribution

It introduces a variational autoencoder framework that enforces redundancy reduction and statistical independence to learn disentangled factors without supervision.

Findings

01

Effective across various datasets

02

Enables zero-shot inference

03

Develops an intuitive understanding of objectness

Abstract

Automated discovery of early visual concepts from raw image data is a major open challenge in AI research. Addressing this problem, we propose an unsupervised approach for learning disentangled representations of the underlying factors of variation. We draw inspiration from neuroscience, and show how this can be achieved in an unsupervised generative model by applying the same learning pressures as have been suggested to act in the ventral visual stream in the brain. By enforcing redundancy reduction, encouraging statistical independence, and exposure to data with transform continuities analogous to those to which human infants are exposed, we obtain a variational autoencoder (VAE) framework capable of learning disentangled factors. Our approach makes few assumptions and works well across a wide variety of datasets. Furthermore, our solution has useful emergent properties, such as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

takuseno/beta-vae
tf

Videos

Variational Autoencoders· youtube

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning