Improving Generalization of Batch Whitening by Convolutional Unit Optimization
Yooshin Cho, Hanbyel Cho, Youngsoo Kim, Junmo Kim

TL;DR
This paper introduces a new convolutional unit aligned with theory to enhance Batch Whitening's effectiveness, demonstrating improved stability and accuracy across multiple image classification datasets.
Contribution
A novel convolutional unit design that improves Batch Whitening performance and stability, validated with off-the-shelf whitening modules like IterNorm.
Findings
Significant accuracy improvements on five datasets.
Enhanced training stability with larger learning rates.
Better performance with increased whitening iterations.
Abstract
Batch Whitening is a technique that accelerates and stabilizes training by transforming input features to have a zero mean (Centering) and a unit variance (Scaling), and by removing linear correlation between channels (Decorrelation). In commonly used structures, which are empirically optimized with Batch Normalization, the normalization layer appears between convolution and activation function. Following Batch Whitening studies have employed the same structure without further analysis; even Batch Whitening was analyzed on the premise that the input of a linear layer is whitened. To bridge the gap, we propose a new Convolutional Unit that is in line with the theory, and our method generally improves the performance of Batch Whitening. Moreover, we show the inefficacy of the original Convolutional Unit by investigating rank and correlation of features. As our method is employable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques
MethodsConvolution · Batch Normalization · Linear Layer
