TL;DR
This paper introduces Adversarial Model Cascades (AMC), a training method that enhances neural network robustness against multiple adversarial attacks across complex datasets, while maintaining accuracy on normal inputs.
Contribution
The paper proposes AMC, a sequential training approach that creates a single model resilient to various attacks, addressing limitations of existing methods that target specific attacks or simple datasets.
Findings
AMC improves robustness by 6.225% on MNIST
AMC increases robustness by 5.075% on SVHN
AMC enhances robustness by 2.65% on CIFAR-10
Abstract
Deep neural networks (DNNs) are vulnerable to malicious inputs crafted by an adversary to produce erroneous outputs. Works on securing neural networks against adversarial examples achieve high empirical robustness on simple datasets such as MNIST. However, these techniques are inadequate when empirically tested on complex data sets such as CIFAR-10 and SVHN. Further, existing techniques are designed to target specific attacks and fail to generalize across attacks. We propose the Adversarial Model Cascades (AMC) as a way to tackle the above inadequacies. Our approach trains a cascade of models sequentially where each model is optimized to be robust towards a mixture of multiple attacks. Ultimately, it yields a single model which is secure against a wide range of attacks; namely FGSM, Elastic, Virtual Adversarial Perturbations and Madry. On an average, AMC increases the model's empirical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
