Unsupervised Learning of Invariant Representations in Hierarchical   Architectures

Fabio Anselmi; Joel Z. Leibo; Lorenzo Rosasco; Jim Mutch; Andrea; Tacchetti; Tomaso Poggio

arXiv:1311.4158·cs.CV·March 12, 2014·65 cites

Unsupervised Learning of Invariant Representations in Hierarchical Architectures

Fabio Anselmi, Joel Z. Leibo, Lorenzo Rosasco, Jim Mutch, Andrea, Tacchetti, Tomaso Poggio

PDF

Open Access

TL;DR

This paper proposes an unsupervised hierarchical approach to learn invariant, stable, and discriminative visual representations, reducing sample complexity and aligning with biological visual processing, applicable to various domains.

Contribution

It introduces a theory for unsupervised learning of invariant image signatures using templates and hierarchical modules, extending deep learning architectures and inspired by visual cortex functions.

Findings

01

Invariant signatures computed via empirical distributions of dot-products.

02

Hierarchical modules inherit invariance, stability, and discriminability.

03

Theory applies to visual object recognition and other domains.

Abstract

The present phase of Machine Learning is characterized by supervised learning algorithms relying on large sets of labeled examples ( $n \to \infty$ ). The next phase is likely to focus on algorithms capable of learning from very few labeled examples ( $n \to 1$ ), like humans seem able to do. We propose an approach to this problem and describe the underlying theory, based on the unsupervised, automatic learning of a ``good'' representation for supervised learning, characterized by small sample complexity ( $n$ ). We consider the case of visual object recognition though the theory applies to other domains. The starting point is the conjecture, proved in specific cases, that image representations which are invariant to translations, scaling and other transformations can considerably reduce the sample complexity of learning. We prove that an invariant and unique (discriminative) signature can be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques · Face Recognition and Perception · Visual perception and processing mechanisms