AVT: Unsupervised Learning of Transformation Equivariant Representations by Autoencoding Variational Transformations
Guo-Jun Qi, Liheng Zhang, Chang Wen Chen, Qi Tian

TL;DR
This paper introduces AVT, an unsupervised method for learning transformation-equivariant representations that capture intrinsic visual structures under various transformations, using a variational autoencoding approach.
Contribution
It proposes a novel unsupervised framework, AVT, that learns generalized nonlinear transformation-equivariant representations via mutual information maximization.
Findings
Sets new state-of-the-art performance on unsupervised tasks
Closes the gap between unsupervised and supervised models
Demonstrates effectiveness across various transformations
Abstract
The learning of Transformation-Equivariant Representations (TERs), which is introduced by Hinton et al. \cite{hinton2011transforming}, has been considered as a principle to reveal visual structures under various transformations. It contains the celebrated Convolutional Neural Networks (CNNs) as a special case that only equivary to the translations. In contrast, we seek to train TERs for a generic class of transformations and train them in an {\em unsupervised} fashion. To this end, we present a novel principled method by Autoencoding Variational Transformations (AVT), compared with the conventional approach to autoencoding data. Formally, given transformed images, the AVT seeks to train the networks by maximizing the mutual information between the transformations and representations. This ensures the resultant TERs of individual images contain the {\em intrinsic} information about their…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis · Digital Imaging for Blood Diseases
