Learning Invariant Representations with Local Transformations

Kihyuk Sohn (University of Michigan); Honglak Lee (University of; Michigan)

arXiv:1206.6418·cs.LG·July 3, 2012·ICML·101 cites

Learning Invariant Representations with Local Transformations

Kihyuk Sohn (University of Michigan), Honglak Lee (University of, Michigan)

PDF

Open Access

TL;DR

This paper introduces a new framework for learning invariant features by integrating linear transformations into existing algorithms, improving robustness and performance across various image and audio classification tasks.

Contribution

The paper proposes a transformation-invariant feature learning framework applicable to multiple unsupervised methods, including RBMs, autoencoders, and sparse coding, demonstrating broad applicability.

Findings

01

Achieves state-of-the-art results on TIMIT phone classification.

02

Shows competitive performance on image classification benchmarks.

03

Extends invariant learning to various unsupervised models.

Abstract

Learning invariant representations is an important problem in machine learning and pattern recognition. In this paper, we present a novel framework of transformation-invariant feature learning by incorporating linear transformations into the feature learning algorithms. For example, we present the transformation-invariant restricted Boltzmann machine that compactly represents data by its weights and their transformations, which achieves invariance of the feature representation via probabilistic max pooling. In addition, we show that our transformation-invariant feature learning framework can also be extended to other unsupervised learning methods, such as autoencoders or sparse coding. We evaluate our method on several image classification benchmark datasets, such as MNIST variations, CIFAR-10, and STL-10, and show competitive or superior classification performance when compared to the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques · Music and Audio Processing