Quantization Blindspots: How Model Compression Breaks Backdoor Defenses

Rohan Pandey; Eric Ye

arXiv:2512.06243·cs.LG·December 9, 2025

Quantization Blindspots: How Model Compression Breaks Backdoor Defenses

Rohan Pandey, Eric Ye

PDF

Open Access

TL;DR

This paper reveals that standard quantization practices in neural network deployment significantly undermine the effectiveness of existing backdoor defenses, exposing a critical gap in current evaluation methods.

Contribution

It systematically studies the impact of quantization on backdoor defenses, highlighting the need to consider quantization robustness in defense evaluation and design.

Findings

01

INT8 quantization reduces detection rates to 0%

02

Backdoors remain effective with high attack success rates after quantization

03

Defense effectiveness varies with dataset and quantization level

Abstract

Backdoor attacks embed input-dependent malicious behavior into neural networks while preserving high clean accuracy, making them a persistent threat for deployed ML systems. At the same time, real-world deployments almost never serve full-precision models: post-training quantization to INT8 or lower precision is now standard practice for reducing memory and latency. This work asks a simple question: how do existing backdoor defenses behave under standard quantization pipelines? We conduct a systematic empirical study of five representative defenses across three precision settings (FP32, INT8 dynamic, INT4 simulated) and two standard vision benchmarks using a canonical BadNet attack. We observe that INT8 quantization reduces the detection rate of all evaluated defenses to 0% while leaving attack success rates above 99%. For INT4, we find a pronounced dataset dependence: Neural Cleanse…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Security and Verification in Computing · Advanced Neural Network Applications