Nearest is Not Dearest: Towards Practical Defense against   Quantization-conditioned Backdoor Attacks

Boheng Li; Yishuo Cai; Haowei Li; Feng Xue; Zhifeng Li; Yiming Li

arXiv:2405.12725·cs.CR·May 22, 2024·1 cites

Nearest is Not Dearest: Towards Practical Defense against Quantization-conditioned Backdoor Attacks

Boheng Li, Yishuo Cai, Haowei Li, Feng Xue, Zhifeng Li, Yiming Li

PDF

Open Access 1 Repo

TL;DR

This paper analyzes quantization-conditioned backdoor attacks in neural networks, revealing their reliance on nearest rounding, and proposes a novel defense method called EFRAP that effectively mitigates these backdoors while preserving model accuracy.

Contribution

The paper provides the first in-depth analysis of QCBs, linking their activation to rounding operations, and introduces EFRAP, a practical defense leveraging non-nearest rounding guided by error norms.

Findings

01

EFRAP effectively defeats state-of-the-art QCB attacks.

02

QCB activation is mainly due to nearest rounding and neuron-wise error norms.

03

EFRAP maintains high accuracy while reducing backdoor threats.

Abstract

Model quantization is widely used to compress and accelerate deep neural networks. However, recent studies have revealed the feasibility of weaponizing model quantization via implanting quantization-conditioned backdoors (QCBs). These special backdoors stay dormant on released full-precision models but will come into effect after standard quantization. Due to the peculiarity of QCBs, existing defenses have minor effects on reducing their threats or are even infeasible. In this paper, we conduct the first in-depth analysis of QCBs. We reveal that the activation of existing QCBs primarily stems from the nearest rounding operation and is closely related to the norms of neuron-wise truncation errors (i.e., the difference between the continuous full-precision weights and its quantized version). Motivated by these insights, we propose Error-guided Flipped Rounding with Activation Preservation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

antigonerandy/quantbackdoor_efrap
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Brain Metastases and Treatment · Security and Verification in Computing