$L_p$-norm Distortion-Efficient Adversarial Attack
Chao Zhou, Yuan-Gen Wang, Zi-jia Wang, Xiangui Kang

TL;DR
This paper introduces a novel $L_p$-norm adversarial attack that minimizes both $L_2$ and $L_0$ distortions, producing more imperceptible adversarial examples with improved robustness.
Contribution
It proposes a new optimization scheme combining $L_2$-norm minimization with a dimension unimportance matrix and threshold to reduce $L_0$-norm distortion in adversarial attacks.
Findings
Lower $L_0$ and $L_2$ distortions than state-of-the-art methods.
Achieves significant reduction in $L_2$-norm distortion on MNIST.
Maintains high imperceptibility with 47% pixels unattacked on MNIST.
Abstract
Adversarial examples have shown a powerful ability to make a well-trained model misclassified. Current mainstream adversarial attack methods only consider one of the distortions among -norm, -norm, and -norm. -norm based methods cause large modification on a single pixel, resulting in naked-eye visible detection, while -norm and -norm based methods suffer from weak robustness against adversarial defense since they always diffuse tiny perturbations to all pixels. A more realistic adversarial perturbation should be sparse and imperceptible. In this paper, we propose a novel -norm distortion-efficient adversarial attack, which not only owns the least -norm loss but also significantly reduces the -norm distortion. To this aim, we design a new optimization scheme, which first optimizes an initial adversarial perturbation under -norm…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Melanoma and MAPK Pathways · Smart Grid Security and Resilience
MethodsSparse Evolutionary Training
