Transformational Sparse Coding
Dimitrios C. Gklezakos, Rajesh P. N. Rao

TL;DR
This paper introduces an unsupervised sparse coding model that learns object features along with their affine transformations, promoting equivariance and reducing model complexity compared to traditional methods.
Contribution
It presents a novel sparse coding approach that jointly learns features and transformations directly from images, enhancing invariance without exploding model size.
Findings
Matches traditional sparse coding in reconstruction quality
Learns transformations directly from natural images
Uses fewer degrees of freedom than existing methods
Abstract
A fundamental problem faced by object recognition systems is that objects and their features can appear in different locations, scales and orientations. Current deep learning methods attempt to achieve invariance to local translations via pooling, discarding the locations of features in the process. Other approaches explicitly learn transformed versions of the same feature, leading to representations that quickly explode in size. Instead of discarding the rich and useful information about feature transformations to achieve invariance, we argue that models should learn object features conjointly with their transformations to achieve equivariance. We propose a new model of unsupervised learning based on sparse coding that can learn object features jointly with their affine transformations directly from images. Results based on learning from natural images indicate that our approach…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural dynamics and brain function · Face Recognition and Perception · Sparse and Compressive Sensing Techniques
