Defending Adversarial Patches via Joint Region Localizing and Inpainting
Junwen Chen, Xingxing Wei

TL;DR
This paper introduces a joint region localizing and inpainting framework to defend against adversarial patch attacks on deep neural networks, improving robustness in traffic sign recognition tasks.
Contribution
The paper proposes a novel unified framework combining localizing and inpainting modules, trained iteratively, to effectively detect and recover from adversarial patches.
Findings
Effective detection of adversarial patches in images.
Improved robustness in traffic sign classification and detection.
Joint training enhances the interaction between localization and inpainting.
Abstract
Deep neural networks are successfully used in various applications, but show their vulnerability to adversarial examples. With the development of adversarial patches, the feasibility of attacks in physical scenes increases, and the defenses against patch attacks are urgently needed. However, defending such adversarial patch attacks is still an unsolved problem. In this paper, we analyse the properties of adversarial patches, and find that: on the one hand, adversarial patches will lead to the appearance or contextual inconsistency in the target objects; on the other hand, the patch region will show abnormal changes on the high-level feature maps of the objects extracted by a backbone network. Considering the above two points, we propose a novel defense method based on a ``localizing and inpainting" mechanism to pre-process the input examples. Specifically, we design an unified…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Advanced Neural Network Applications
