Adversarial Training: embedding adversarial perturbations into the parameter space of a neural network to build a robust system
Shixian Wen, Laurent Itti

TL;DR
This paper proposes a novel adversarial training method that embeds dynamic adversarial perturbations into the neural network's parameters, reducing costs and improving robustness and diversity of adversarial defenses.
Contribution
Introducing parameter space perturbations for adversarial training, enabling automatic creation of adversarial examples with minimal additional cost.
Findings
Achieves adversarial robustness with negligible extra computational cost.
Alleviates accuracy trade-offs between clean and adversarial examples.
Increases diversity of adversarial perturbations.
Abstract
Adversarial training, in which a network is trained on both adversarial and clean examples, is one of the most trusted defense methods against adversarial attacks. However, there are three major practical difficulties in implementing and deploying this method - expensive in terms of extra memory and computation costs; accuracy trade-off between clean and adversarial examples; and lack of diversity of adversarial perturbations. Classical adversarial training uses fixed, precomputed perturbations in adversarial examples (input space). In contrast, we introduce dynamic adversarial perturbations into the parameter space of the network, by adding perturbation biases to the fully connected layers of deep convolutional neural network. During training, using only clean images, the perturbation biases are updated in the Fast Gradient Sign Direction to automatically create and store adversarial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Anomaly Detection Techniques and Applications
