Adversarial Parameter Attack on Deep Neural Networks
Lijia Yu, Yihan Wang, Xiao-Shan Gao

TL;DR
This paper introduces a novel adversarial parameter attack on deep neural networks that subtly alters parameters to significantly reduce robustness without decreasing accuracy, making attacks harder to detect.
Contribution
It proposes a new, more effective parameter perturbation attack method and provides an algorithm to generate adversarial parameters that lower robustness while maintaining accuracy.
Findings
Adversarial parameters can drastically reduce DNN robustness.
The attack is more difficult to detect than previous methods.
Effective training algorithms for adversarial parameters are demonstrated.
Abstract
In this paper, a new parameter perturbation attack on DNNs, called adversarial parameter attack, is proposed, in which small perturbations to the parameters of the DNN are made such that the accuracy of the attacked DNN does not decrease much, but its robustness becomes much lower. The adversarial parameter attack is stronger than previous parameter perturbation attacks in that the attack is more difficult to be recognized by users and the attacked DNN gives a wrong label for any modified sample input with high probability. The existence of adversarial parameters is proved. For a DNN with the parameter set satisfying certain conditions, it is shown that if the depth of the DNN is sufficiently large, then there exists an adversarial parameter set for such that the accuracy of is equal to that of , but the robustness…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Integrated Circuits and Semiconductor Failure Analysis · Physical Unclonable Functions (PUFs) and Hardware Security
