TL;DR
This paper introduces FDA, a new adversarial attack that disrupts deep neural network features at all layers, outperforming existing methods and affecting both classification and feature-based tasks without needing task-specific access.
Contribution
The paper proposes FDA, a novel feature-disruptive adversarial attack, along with new metrics OLNR and NLOR to evaluate attack damage, addressing limitations of existing methods.
Findings
FDA generates stronger adversaries than state-of-the-art methods.
FDA effectively disrupts feature representations across network layers.
FDA impacts feature-based tasks without network access.
Abstract
Though Deep Neural Networks (DNN) show excellent performance across various computer vision tasks, several works show their vulnerability to adversarial samples, i.e., image samples with imperceptible noise engineered to manipulate the network's prediction. Adversarial sample generation methods range from simple to complex optimization techniques. Majority of these methods generate adversaries through optimization objectives that are tied to the pre-softmax or softmax output of the network. In this work we, (i) show the drawbacks of such attacks, (ii) propose two new evaluation metrics: Old Label New Rank (OLNR) and New Label Old Rank (NLOR) in order to quantify the extent of damage made by an attack, and (iii) propose a new adversarial attack FDA: Feature Disruptive Attack, to address the drawbacks of existing attacks. FDA works by generating image perturbation that disrupt features at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSoftmax
