Widening the Gap: Exploiting LLM Quantization via Outlier Injection
Xiaohua Zhan, Kazuki Egashira, Robin Staab, Mark Vero, Martin Vechev

TL;DR
This paper introduces a novel attack exploiting outliers in large language model quantization, demonstrating high success rates across advanced schemes and revealing broad security vulnerabilities.
Contribution
It presents the first quantization-conditioned attack effective against multiple sophisticated quantization methods, exposing new security risks.
Findings
High success rates against advanced quantization schemes
Effective attack across multiple large language models
Reveals broad security vulnerabilities in model quantization
Abstract
LLM quantization has become essential for memory-efficient deployment. Recent work has shown that quantization schemes can pose critical security risks: an adversary may release a model that appears benign in full precision but exhibits malicious behavior once quantized by users. However, existing quantization-conditioned attacks have been limited to relatively simple quantization methods, where the attacker can estimate weight regions that remain invariant under the target quantization. Notably, prior attacks have consistently failed to compromise more popular and sophisticated schemes, limiting their practical impact. In this work, we introduce the first quantization-conditioned attack that consistently induces malicious behavior that can be triggered by a broad range of advanced quantization techniques, including AWQ, GPTQ, and GGUF I-quants. Our attack exploits a simple property…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
