Jacobian Ensembles Improve Robustness Trade-offs to Adversarial Attacks
Kenneth T. Co, David Martinez-Rego, Zhongyuan Hau, Emil C. Lupu

TL;DR
This paper introduces Jacobian Ensembles, a novel method combining Jacobian regularization and model ensembles, to enhance neural network robustness against universal adversarial perturbations while maintaining high accuracy.
Contribution
The paper proposes Jacobian Ensembles, a new approach that significantly improves robustness to UAPs without sacrificing accuracy, outperforming existing methods.
Findings
Achieves higher robustness against UAPs.
Maintains or improves model accuracy.
Outperforms previous robustness methods.
Abstract
Deep neural networks have become an integral part of our software infrastructure and are being deployed in many widely-used and safety-critical applications. However, their integration into many systems also brings with it the vulnerability to test time attacks in the form of Universal Adversarial Perturbations (UAPs). UAPs are a class of perturbations that when applied to any input causes model misclassification. Although there is an ongoing effort to defend models against these adversarial attacks, it is often difficult to reconcile the trade-offs in model accuracy and robustness to adversarial attacks. Jacobian regularization has been shown to improve the robustness of models against UAPs, whilst model ensembles have been widely adopted to improve both predictive performance and model robustness. In this work, we propose a novel approach, Jacobian Ensembles-a combination of Jacobian…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
