From CNNs to Shift-Invariant Twin Models Based on Complex Wavelets
Hubert Leterme, K\'evin Polisano, Val\'erie Perrier, Karteek Alahari

TL;DR
This paper introduces a shift-invariant neural network architecture using complex wavelets, improving accuracy and efficiency by replacing traditional max pooling with complex convolutions constrained to Gabor-like kernels.
Contribution
The authors propose a novel complex wavelet-based method replacing max pooling in CNNs, enhancing shift invariance and accuracy while reducing computational costs.
Findings
Achieves superior accuracy on ImageNet and CIFAR-10
Maintains high-frequency details for better information preservation
Lower computational cost and memory footprint
Abstract
We propose a novel method to increase shift invariance and prediction accuracy in convolutional neural networks. Specifically, we replace the first-layer combination "real-valued convolutions + max pooling" (RMax) by "complex-valued convolutions + modulus" (CMod), which is stable to translations, or shifts. To justify our approach, we claim that CMod and RMax produce comparable outputs when the convolution kernel is band-pass and oriented (Gabor-like filter). In this context, CMod can therefore be considered as a stable alternative to RMax. To enforce this property, we constrain the convolution kernels to adopt such a Gabor-like structure. The corresponding architecture is called mathematical twin, because it employs a well-defined mathematical operator to mimic the behavior of the original, freely-trained model. Our approach achieves superior accuracy on ImageNet and CIFAR-10…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Remote-Sensing Image Classification
Methods*Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Average Pooling · Residual Connection · Bottleneck Residual Block · Batch Normalization · Kaiming Initialization · Max Pooling · Residual Block · Convolution
