RADAR: Run-time Adversarial Weight Attack Detection and Accuracy   Recovery

Jingtao Li; Adnan Siraj Rakin; Zhezhi He; Deliang Fan; Chaitali; Chakrabarti

arXiv:2101.08254·cs.CR·March 10, 2022

RADAR: Run-time Adversarial Weight Attack Detection and Accuracy Recovery

Jingtao Li, Adnan Siraj Rakin, Zhezhi He, Deliang Fan, Chaitali, Chakrabarti

PDF

1 Repo

TL;DR

RADAR is a run-time detection and recovery scheme for neural network weights that identifies malicious bit-flips using checksum signatures and restores accuracy with minimal overhead.

Contribution

This work introduces a checksum-based method for real-time detection and mitigation of adversarial weight attacks in neural networks during inference.

Findings

01

Detects 96% of bit-flips on average in ResNet-18

02

Restores accuracy from below 1% to above 69% after attacks

03

Adds less than 1% inference time overhead

Abstract

Adversarial attacks on Neural Network weights, such as the progressive bit-flip attack (PBFA), can cause a catastrophic degradation in accuracy by flipping a very small number of bits. Furthermore, PBFA can be conducted at run time on the weights stored in DRAM main memory. In this work, we propose RADAR, a Run-time adversarial weight Attack Detection and Accuracy Recovery scheme to protect DNN weights against PBFA. We organize weights that are interspersed in a layer into groups and employ a checksum-based algorithm on weights to derive a 2-bit signature for each group. At run time, the 2-bit signature is computed and compared with the securely stored golden signature to detect the bit-flip attacks in a group. After successful detection, we zero out all the weights in a group to mitigate the accuracy drop caused by malicious bit-flips. The proposed scheme is embedded in the inference…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zlijingtao/radar_check
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.