TL;DR
WaveCNet integrates wavelet transforms into CNNs to effectively suppress aliasing effects, enhancing noise robustness and accuracy in image classification and object detection tasks.
Contribution
The paper introduces a novel wavelet-based down-sampling method for CNNs, improving noise and adversarial robustness while maintaining high accuracy.
Findings
WaveCNet achieves higher ImageNet accuracy than vanilla CNNs.
WaveCNet demonstrates improved robustness on noisy and adversarially attacked images.
Object detectors using WaveCNet backbones perform better on COCO dataset.
Abstract
Though widely used in image classification, convolutional neural networks (CNNs) are prone to noise interruptions, i.e. the CNN output can be drastically changed by small image noise. To improve the noise robustness, we try to integrate CNNs with wavelet by replacing the common down-sampling (max-pooling, strided-convolution, and average pooling) with discrete wavelet transform (DWT). We firstly propose general DWT and inverse DWT (IDWT) layers applicable to various orthogonal and biorthogonal discrete wavelets like Haar, Daubechies, and Cohen, etc., and then design wavelet integrated CNNs (WaveCNets) by integrating DWT into the commonly used CNNs (VGG, ResNets, and DenseNet). During the down-sampling, WaveCNets apply DWT to decompose the feature maps into the low-frequency and high-frequency components. Containing the main information including the basic object structures, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsRegion Proposal Network · Softmax · RoIPool · Convolution · Faster R-CNN
