Equivariant Transformer Networks
Kai Sheng Tai, Peter Bailis, Gregory Valiant

TL;DR
Equivariant Transformers (ETs) are a new neural network architecture that incorporate transformation invariances directly into their structure, enhancing robustness and sample efficiency in image classification tasks.
Contribution
The paper introduces Equivariant Transformers, a novel differentiable mapping framework that embeds transformation equivariance into neural networks using canonical coordinate systems.
Findings
ETs improve model robustness to complex transformations.
ETs increase sample efficiency in limited data scenarios.
ETs achieve up to 15% error rate reduction with minimal parameter increase.
Abstract
How can prior knowledge on the transformation invariances of a domain be incorporated into the architecture of a neural network? We propose Equivariant Transformers (ETs), a family of differentiable image-to-image mappings that improve the robustness of models towards pre-defined continuous transformation groups. Through the use of specially-derived canonical coordinate systems, ETs incorporate functions that are equivariant by construction with respect to these transformations. We show empirically that ETs can be flexibly composed to improve model robustness towards more complicated transformation groups in several parameters. On a real-world image classification task, ETs improve the sample efficiency of ResNet classifiers, achieving relative improvements in error rate of up to 15% in the limited data regime while increasing model parameter count by less than 1%.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications
MethodsAverage Pooling · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization · Max Pooling · Residual Connection
