TL;DR
This paper introduces Local Gradients Smoothing (LGS), a novel defense method that effectively mitigates localized adversarial attacks by smoothing high-frequency noise in gradient regions, outperforming existing defenses especially against BPDA attacks.
Contribution
The paper proposes LGS, a new gradient-based smoothing technique that enhances robustness of DNNs against localized adversarial attacks like LaVAN and patches, with minimal impact on salient features.
Findings
LGS outperforms other defenses on ImageNet against localized attacks.
LGS shows high resistance to BPDA attack.
LGS maintains classification accuracy on clean images.
Abstract
Deep neural networks (DNNs) have shown vulnerability to adversarial attacks, i.e., carefully perturbed inputs designed to mislead the network at inference time. Recently introduced localized attacks, Localized and Visible Adversarial Noise (LaVAN) and Adversarial patch, pose a new challenge to deep learning security by adding adversarial noise only within a specific region without affecting the salient objects in an image. Driven by the observation that such attacks introduce concentrated high-frequency changes at a particular image location, we have developed an effective method to estimate noise location in gradient domain and transform those high activation regions caused by adversarial noise in image domain while having minimal effect on the salient object that is important for correct classification. Our proposed Local Gradients Smoothing (LGS) scheme achieves this by regularizing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
