T-BFA: Targeted Bit-Flip Adversarial Weight Attack

Adnan Siraj Rakin; Zhezhi He; Jingtao Li; Fan Yao; Chaitali; Chakrabarti; Deliang Fan

arXiv:2007.12336·cs.LG·January 11, 2021·30 cites

T-BFA: Targeted Bit-Flip Adversarial Weight Attack

Adnan Siraj Rakin, Zhezhi He, Jingtao Li, Fan Yao, Chaitali, Chakrabarti, Deliang Fan

PDF

Open Access 2 Repos

TL;DR

This paper introduces T-BFA, a novel targeted bit-flip adversarial attack on DNN weights, capable of misleading specific inputs to desired outputs with minimal bit modifications, demonstrated on multiple architectures and real hardware.

Contribution

It presents the first targeted weight attack method (T-BFA) that selectively flips weight bits to mislead specific inputs, advancing security analysis of DNNs against weight perturbations.

Findings

01

Achieves 100% attack success rate on ResNet-18 for targeted misclassification.

02

Flips only 27 out of 88 million weight bits to succeed.

03

Demonstrates attack feasibility on real hardware system.

Abstract

Traditional Deep Neural Network (DNN) security is mostly related to the well-known adversarial input example attack. Recently, another dimension of adversarial attack, namely, attack on DNN weight parameters, has been shown to be very powerful. As a representative one, the Bit-Flip-based adversarial weight Attack (BFA) injects an extremely small amount of faults into weight parameters to hijack the executing DNN function. Prior works of BFA focus on un-targeted attack that can hack all inputs into a random output class by flipping a very small number of weight bits stored in computer memory. This paper proposes the first work of targeted BFA based (T-BFA) adversarial weight attack on DNNs, which can intentionally mislead selected inputs to a target output class. The objective is achieved by identifying the weight bits that are highly associated with classification of a targeted output…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Advanced Neural Network Applications