Defense against Universal Adversarial Perturbations
Naveed Akhtar, Jian Liu, Ajmal Mian

TL;DR
This paper introduces a novel defense framework against universal adversarial perturbations by using a Perturbation Rectifying Network and a detection mechanism, significantly improving robustness without modifying the original classifier.
Contribution
It proposes the first dedicated framework employing a PRN and a perturbation detector to defend against universal adversarial perturbations effectively.
Findings
Achieves up to 97.5% success rate in defending against unseen perturbations.
PRN generalizes across different network architectures.
Effective detection and rectification of universal adversarial perturbations.
Abstract
Recent advances in Deep Learning show the existence of image-agnostic quasi-imperceptible perturbations that when applied to `any' image can fool a state-of-the-art network classifier to change its prediction about the image label. These `Universal Adversarial Perturbations' pose a serious threat to the success of Deep Learning in practice. We present the first dedicated framework to effectively defend the networks against such perturbations. Our approach learns a Perturbation Rectifying Network (PRN) as `pre-input' layers to a targeted model, such that the targeted model needs no modification. The PRN is learned from real and synthetic image-agnostic perturbations, where an efficient method to compute the latter is also proposed. A perturbation detector is separately trained on the Discrete Cosine Transform of the input-output difference of the PRN. A query image is first passed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Integrated Circuits and Semiconductor Failure Analysis · Anomaly Detection Techniques and Applications
MethodsDiscrete Cosine Transform
