Weights Augmentation: it has never ever ever ever let her model down

Junbin Zhuang; Guiguang Din; Yunyi Yan

arXiv:2405.19590·cs.LG·May 31, 2024

Weights Augmentation: it has never ever ever ever let her model down

Junbin Zhuang, Guiguang Din, Yunyi Yan

PDF

Open Access 1 Repo

TL;DR

This paper introduces Weight Augmentation Strategy (WAS), a novel approach that applies random transformations to network weights during training, significantly improving accuracy and efficiency across various CNN architectures on CIFAR datasets.

Contribution

It proposes the concept of weight augmentation with dual modes, enhancing model robustness and accuracy without significant computational costs.

Findings

01

Models improved accuracy by up to 18.93% on CIFAR datasets.

02

DOM mode reduces FLOPs by up to 36.33%.

03

Applicable to multiple CNN architectures with minimal overhead.

Abstract

Weight play an essential role in deep learning network models. Unlike network structure design, this article proposes the concept of weight augmentation, focusing on weight exploration. The core of Weight Augmentation Strategy (WAS) is to adopt random transformed weight coefficients training and transformed coefficients, named Shadow Weight(SW), for networks that can be used to calculate loss function to affect parameter updates. However, stochastic gradient descent is applied to Plain Weight(PW), which is referred to as the original weight of the network before the random transformation. During training, numerous SW collectively form high-dimensional space, while PW is directly learned from the distribution of SW instead of the data. The weight of the accuracy-oriented mode(AOM) relies on PW, which guarantees the network is highly robust and accurate. The desire-oriented mode(DOM)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zlearh/weight-augmentation-technology
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDiet and metabolism studies

MethodsVGG-16