Learning Equivariant Representations
Carlos Esteves

TL;DR
This paper develops models that incorporate known symmetries like rotation and scaling into deep learning architectures, enhancing their efficiency and generalization, especially in data-limited or rotation-perturbed scenarios.
Contribution
It introduces several equivariant neural network models for different symmetry groups, extending the concept of shift-equivariance in CNNs to rotations, scales, and 3D transformations.
Findings
Achieves equivariance to various transformation groups.
Improves performance on tasks with limited data or input perturbations.
Reduces sample complexity and enhances generalization.
Abstract
State-of-the-art deep learning systems often require large amounts of data and computation. For this reason, leveraging known or unknown structure of the data is paramount. Convolutional neural networks (CNNs) are successful examples of this principle, their defining characteristic being the shift-equivariance. By sliding a filter over the input, when the input shifts, the response shifts by the same amount, exploiting the structure of natural images where semantic content is independent of absolute pixel positions. This property is essential to the success of CNNs in audio, image and video recognition tasks. In this thesis, we extend equivariance to other kinds of transformations, such as rotation and scaling. We propose equivariant models for different transformations defined by groups of symmetries. The main contributions are (i) polar transformer networks, achieving equivariance to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Image Retrieval and Classification Techniques · Image Processing and 3D Reconstruction
