TL;DR
This paper introduces a Jacobian regularization method to improve deep neural networks' robustness against adversarial attacks, achieving enhanced security with minimal impact on accuracy.
Contribution
The paper proposes a novel Jacobian regularization technique based on the Frobenius norm to enhance DNN robustness post-training.
Findings
Increased adversarial robustness demonstrated empirically.
Minimal loss in original network accuracy.
Effective as a post-processing step.
Abstract
Deep neural networks have lately shown tremendous performance in various applications including vision and speech processing tasks. However, alongside their ability to perform these tasks with such high accuracy, it has been shown that they are highly susceptible to adversarial attacks: a small change in the input would cause the network to err with high confidence. This phenomenon exposes an inherent fault in these networks and their ability to generalize well. For this reason, providing robustness to adversarial attacks is an important challenge in networks training, which has led to extensive research. In this work, we suggest a theoretically inspired novel approach to improve the networks' robustness. Our method applies regularization using the Frobenius norm of the Jacobian of the network, which is applied as post-processing, after regular training has finished. We demonstrate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
