Weights Augmentation: it has never ever ever ever let her model down
Junbin Zhuang, Guiguang Din, Yunyi Yan

TL;DR
This paper introduces Weight Augmentation Strategy (WAS), a novel approach that applies random transformations to network weights during training, significantly improving accuracy and efficiency across various CNN architectures on CIFAR datasets.
Contribution
It proposes the concept of weight augmentation with dual modes, enhancing model robustness and accuracy without significant computational costs.
Findings
Models improved accuracy by up to 18.93% on CIFAR datasets.
DOM mode reduces FLOPs by up to 36.33%.
Applicable to multiple CNN architectures with minimal overhead.
Abstract
Weight play an essential role in deep learning network models. Unlike network structure design, this article proposes the concept of weight augmentation, focusing on weight exploration. The core of Weight Augmentation Strategy (WAS) is to adopt random transformed weight coefficients training and transformed coefficients, named Shadow Weight(SW), for networks that can be used to calculate loss function to affect parameter updates. However, stochastic gradient descent is applied to Plain Weight(PW), which is referred to as the original weight of the network before the random transformation. During training, numerous SW collectively form high-dimensional space, while PW is directly learned from the distribution of SW instead of the data. The weight of the accuracy-oriented mode(AOM) relies on PW, which guarantees the network is highly robust and accurate. The desire-oriented mode(DOM)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDiet and metabolism studies
MethodsVGG-16
