Efficient Two-Step Adversarial Defense for Deep Neural Networks
Ting-Jui Chang, Yukun He, Peng Li

TL;DR
This paper introduces a two-step adversarial training method that achieves robustness comparable to multi-step attacks but with significantly reduced computational costs, enhancing the practicality of defending deep neural networks.
Contribution
The authors propose a novel two-step adversarial example generation technique that balances robustness and computational efficiency, improving upon existing adversarial training methods.
Findings
Effective against various attack methods
Achieves robustness similar to multi-step adversarial training
Reduces computational cost to approximately two FGSM runs
Abstract
In recent years, deep neural networks have demonstrated outstanding performance in many machine learning tasks. However, researchers have discovered that these state-of-the-art models are vulnerable to adversarial examples: legitimate examples added by small perturbations which are unnoticeable to human eyes. Adversarial training, which augments the training data with adversarial examples during the training process, is a well known defense to improve the robustness of the model against adversarial attacks. However, this robustness is only effective to the same attack method used for adversarial training. Madry et al.(2017) suggest that effectiveness of iterative multi-step adversarial attacks and particularly that projected gradient descent (PGD) may be considered the universal first order adversary and applying the adversarial training with PGD implies resistance against many other…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Bacillus and Francisella bacterial research · High-Velocity Impact and Material Behavior
