Implicit Equivariance in Convolutional Networks
Naman Khetan, Tushar Arora, Samee Ur Rehman, Deepak K. Gupta

TL;DR
This paper introduces Implicitly Equivariant Networks (IEN), a novel approach that induces transformation equivariance in standard CNNs through a multi-objective loss, outperforming steerable CNNs in accuracy and efficiency across various tasks.
Contribution
The paper proposes IEN, a simple, flexible method to embed equivariance in CNNs without complex basis functions, enabling heterogeneous feature groups and improved performance.
Findings
IEN outperforms steerable CNNs on multiple datasets.
IEN reduces channel count by over 30% while maintaining accuracy.
IEN achieves superior results in visual object tracking.
Abstract
Convolutional Neural Networks(CNN) are inherently equivariant under translations, however, they do not have an equivalent embedded mechanism to handle other transformations such as rotations and change in scale. Several approaches exist that make CNNs equivariant under other transformation groups by design. Among these, steerable CNNs have been especially effective. However, these approaches require redesigning standard networks with filters mapped from combinations of predefined basis involving complex analytical functions. We experimentally demonstrate that these restrictions in the choice of basis can lead to model weights that are sub-optimal for the primary deep learning task (e.g. classification). Moreover, such hard-baked explicit formulations make it difficult to design composite networks comprising heterogeneous feature groups. To circumvent such issues, we propose Implicitly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Human Pose and Action Recognition · Advanced Image and Video Retrieval Techniques
Methods*Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Residual Connection · Residual Block · Batch Normalization · Bottleneck Residual Block · Average Pooling · Global Average Pooling · Kaiming Initialization · Softmax
