Jujutsu: A Two-stage Defense against Adversarial Patch Attacks on Deep Neural Networks
Zitao Chen, Pritam Dash, Karthik Pattabiraman

TL;DR
Jujutsu is a two-stage defense method that detects and mitigates adversarial patch attacks on deep neural networks by identifying localized features and reconstructing clean inputs using GANs, significantly improving robustness.
Contribution
The paper introduces Jujutsu, a novel two-stage defense combining attack detection and localized input reconstruction to counter robust adversarial patches.
Findings
Jujutsu outperforms existing defenses across multiple datasets and models.
It effectively detects adversarial patches with low false positives.
Jujutsu maintains robustness against physical-world and adaptive attacks.
Abstract
Adversarial patch attacks create adversarial examples by injecting arbitrary distortions within a bounded region of the input to fool deep neural networks (DNNs). These attacks are robust (i.e., physically-realizable) and universally malicious, and hence represent a severe security threat to real-world DNN-based systems. We propose Jujutsu, a two-stage technique to detect and mitigate robust and universal adversarial patch attacks. We first observe that adversarial patches are crafted as localized features that yield large influence on the prediction output, and continue to dominate the prediction on any input. Jujutsu leverages this observation for accurate attack detection with low false positives. Patch attacks corrupt only a localized region of the input, while the majority of the input remains unperturbed. Therefore, Jujutsu leverages generative adversarial networks (GAN) to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications
MethodsInpainting
