Learning Invariant Representations with Local Transformations
Kihyuk Sohn (University of Michigan), Honglak Lee (University of, Michigan)

TL;DR
This paper introduces a new framework for learning invariant features by integrating linear transformations into existing algorithms, improving robustness and performance across various image and audio classification tasks.
Contribution
The paper proposes a transformation-invariant feature learning framework applicable to multiple unsupervised methods, including RBMs, autoencoders, and sparse coding, demonstrating broad applicability.
Findings
Achieves state-of-the-art results on TIMIT phone classification.
Shows competitive performance on image classification benchmarks.
Extends invariant learning to various unsupervised models.
Abstract
Learning invariant representations is an important problem in machine learning and pattern recognition. In this paper, we present a novel framework of transformation-invariant feature learning by incorporating linear transformations into the feature learning algorithms. For example, we present the transformation-invariant restricted Boltzmann machine that compactly represents data by its weights and their transformations, which achieves invariance of the feature representation via probabilistic max pooling. In addition, we show that our transformation-invariant feature learning framework can also be extended to other unsupervised learning methods, such as autoencoders or sparse coding. We evaluate our method on several image classification benchmark datasets, such as MNIST variations, CIFAR-10, and STL-10, and show competitive or superior classification performance when compared to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques · Music and Audio Processing
