Adversarial Parameter Attack on Deep Neural Networks

Lijia Yu; Yihan Wang; Xiao-Shan Gao

arXiv:2203.10502·cs.LG·March 22, 2022·1 cites

Adversarial Parameter Attack on Deep Neural Networks

Lijia Yu, Yihan Wang, Xiao-Shan Gao

PDF

Open Access

TL;DR

This paper introduces a novel adversarial parameter attack on deep neural networks that subtly alters parameters to significantly reduce robustness without decreasing accuracy, making attacks harder to detect.

Contribution

It proposes a new, more effective parameter perturbation attack method and provides an algorithm to generate adversarial parameters that lower robustness while maintaining accuracy.

Findings

01

Adversarial parameters can drastically reduce DNN robustness.

02

The attack is more difficult to detect than previous methods.

03

Effective training algorithms for adversarial parameters are demonstrated.

Abstract

In this paper, a new parameter perturbation attack on DNNs, called adversarial parameter attack, is proposed, in which small perturbations to the parameters of the DNN are made such that the accuracy of the attacked DNN does not decrease much, but its robustness becomes much lower. The adversarial parameter attack is stronger than previous parameter perturbation attacks in that the attack is more difficult to be recognized by users and the attacked DNN gives a wrong label for any modified sample input with high probability. The existence of adversarial parameters is proved. For a DNN $F_{Θ}$ with the parameter set $Θ$ satisfying certain conditions, it is shown that if the depth of the DNN is sufficiently large, then there exists an adversarial parameter set $Θ_{a}$ for $Θ$ such that the accuracy of $F_{Θ_{a}}$ is equal to that of $F_{Θ}$ , but the robustness…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Integrated Circuits and Semiconductor Failure Analysis · Physical Unclonable Functions (PUFs) and Hardware Security