Unsupervised Learning of Invariant Representations in Hierarchical Architectures
Fabio Anselmi, Joel Z. Leibo, Lorenzo Rosasco, Jim Mutch, Andrea, Tacchetti, Tomaso Poggio

TL;DR
This paper proposes an unsupervised hierarchical approach to learn invariant, stable, and discriminative visual representations, reducing sample complexity and aligning with biological visual processing, applicable to various domains.
Contribution
It introduces a theory for unsupervised learning of invariant image signatures using templates and hierarchical modules, extending deep learning architectures and inspired by visual cortex functions.
Findings
Invariant signatures computed via empirical distributions of dot-products.
Hierarchical modules inherit invariance, stability, and discriminability.
Theory applies to visual object recognition and other domains.
Abstract
The present phase of Machine Learning is characterized by supervised learning algorithms relying on large sets of labeled examples (). The next phase is likely to focus on algorithms capable of learning from very few labeled examples (), like humans seem able to do. We propose an approach to this problem and describe the underlying theory, based on the unsupervised, automatic learning of a ``good'' representation for supervised learning, characterized by small sample complexity (). We consider the case of visual object recognition though the theory applies to other domains. The starting point is the conjecture, proved in specific cases, that image representations which are invariant to translations, scaling and other transformations can considerably reduce the sample complexity of learning. We prove that an invariant and unique (discriminative) signature can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Face Recognition and Perception · Visual perception and processing mechanisms
