AET vs. AED: Unsupervised Representation Learning by Auto-Encoding Transformations rather than Data
Liheng Zhang, Guo-Jun Qi, Liqiang Wang, Jiebo Luo

TL;DR
This paper introduces a new unsupervised learning paradigm called Auto-Encoding Transformation (AET), which predicts transformations from encoded features, leading to improved representations that outperform existing methods on standard datasets.
Contribution
The paper proposes AET, a novel unsupervised learning approach that encodes visual structures to predict transformations, expanding the scope of transformations beyond traditional data auto-encoding.
Findings
AET achieves state-of-the-art results on CIFAR-10, ImageNet, and Places datasets.
AET outperforms existing unsupervised methods significantly.
AET approaches the performance of fully supervised models.
Abstract
The success of deep neural networks often relies on a large amount of labeled examples, which can be difficult to obtain in many real scenarios. To address this challenge, unsupervised methods are strongly preferred for training neural networks without using any labeled data. In this paper, we present a novel paradigm of unsupervised representation learning by Auto-Encoding Transformation (AET) in contrast to the conventional Auto-Encoding Data (AED) approach. Given a randomly sampled transformation, AET seeks to predict it merely from the encoded features as accurately as possible at the output end. The idea is the following: as long as the unsupervised features successfully encode the essential information about the visual structures of original and transformed images, the transformation can be well predicted. We will show that this AET paradigm allows us to instantiate a large…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications
