A Semi Black-Box Adversarial Bit-Flip Attack with Limited DNN Model   Information

Behnam Ghavami; Mani Sadati; Mohammad Shahidzadeh; Lesley Shannon,; Steve Wilton

arXiv:2412.09450·cs.CR·December 13, 2024

A Semi Black-Box Adversarial Bit-Flip Attack with Limited DNN Model Information

Behnam Ghavami, Mani Sadati, Mohammad Shahidzadeh, Lesley Shannon,, Steve Wilton

PDF

Open Access

TL;DR

This paper introduces B3FA, a semi-black-box adversarial attack method that efficiently causes significant accuracy drops in DNNs with limited model knowledge and minimal bit-flips.

Contribution

The paper presents a novel semi-black-box bit-flip attack method that operates with limited model information, contrasting with full-access models, and demonstrates its effectiveness on real-world DNNs.

Findings

01

B3FA can reduce MobileNetV2 accuracy from 69.84% to 9% with 20 bit-flips.

02

The method effectively identifies vulnerable bits using a magnitude-based ranking and statistical reconstruction.

03

B3FA outperforms existing attacks under limited knowledge scenarios.

Abstract

Despite the rising prevalence of deep neural networks (DNNs) in cyber-physical systems, their vulnerability to adversarial bit-flip attacks (BFAs) is a noteworthy concern. This paper proposes B3FA, a semi-black-box BFA-based parameter attack on DNNs, assuming the adversary has limited knowledge about the model. We consider practical scenarios often feature a more restricted threat model for real-world systems, contrasting with the typical BFA models that presuppose the adversary's full access to a network's inputs and parameters. The introduced bit-flip approach utilizes a magnitude-based ranking method and a statistical re-construction technique to identify the vulnerable bits. We demonstrate the effectiveness of B3FA on several DNN models in a semi-black-box setting. For example, B3FA could drop the accuracy of a MobileNetV2 from 69.84% to 9% with only 20 bit-flips in a real-world…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Physical Unclonable Functions (PUFs) and Hardware Security · Bacillus and Francisella bacterial research