AVT: Unsupervised Learning of Transformation Equivariant Representations   by Autoencoding Variational Transformations

Guo-Jun Qi; Liheng Zhang; Chang Wen Chen; Qi Tian

arXiv:1903.10863·cs.CV·July 24, 2019·5 cites

AVT: Unsupervised Learning of Transformation Equivariant Representations by Autoencoding Variational Transformations

Guo-Jun Qi, Liheng Zhang, Chang Wen Chen, Qi Tian

PDF

Open Access 1 Repo

TL;DR

This paper introduces AVT, an unsupervised method for learning transformation-equivariant representations that capture intrinsic visual structures under various transformations, using a variational autoencoding approach.

Contribution

It proposes a novel unsupervised framework, AVT, that learns generalized nonlinear transformation-equivariant representations via mutual information maximization.

Findings

01

Sets new state-of-the-art performance on unsupervised tasks

02

Closes the gap between unsupervised and supervised models

03

Demonstrates effectiveness across various transformations

Abstract

The learning of Transformation-Equivariant Representations (TERs), which is introduced by Hinton et al. \cite{hinton2011transforming}, has been considered as a principle to reveal visual structures under various transformations. It contains the celebrated Convolutional Neural Networks (CNNs) as a special case that only equivary to the translations. In contrast, we seek to train TERs for a generic class of transformations and train them in an {\em unsupervised} fashion. To this end, we present a novel principled method by Autoencoding Variational Transformations (AVT), compared with the conventional approach to autoencoding data. Formally, given transformed images, the AVT seeks to train the networks by maximizing the mutual information between the transformations and representations. This ensures the resultant TERs of individual images contain the {\em intrinsic} information about their…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

maple-research-lab/avt-pytorch
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis · Digital Imaging for Blood Diseases