Widening the Gap: Exploiting LLM Quantization via Outlier Injection

Xiaohua Zhan; Kazuki Egashira; Robin Staab; Mark Vero; Martin Vechev

arXiv:2605.15152·cs.LG·May 15, 2026

Widening the Gap: Exploiting LLM Quantization via Outlier Injection

Xiaohua Zhan, Kazuki Egashira, Robin Staab, Mark Vero, Martin Vechev

PDF

TL;DR

This paper introduces a novel attack exploiting outliers in large language model quantization, demonstrating high success rates across advanced schemes and revealing broad security vulnerabilities.

Contribution

It presents the first quantization-conditioned attack effective against multiple sophisticated quantization methods, exposing new security risks.

Findings

01

High success rates against advanced quantization schemes

02

Effective attack across multiple large language models

03

Reveals broad security vulnerabilities in model quantization

Abstract

LLM quantization has become essential for memory-efficient deployment. Recent work has shown that quantization schemes can pose critical security risks: an adversary may release a model that appears benign in full precision but exhibits malicious behavior once quantized by users. However, existing quantization-conditioned attacks have been limited to relatively simple quantization methods, where the attacker can estimate weight regions that remain invariant under the target quantization. Notably, prior attacks have consistently failed to compromise more popular and sophisticated schemes, limiting their practical impact. In this work, we introduce the first quantization-conditioned attack that consistently induces malicious behavior that can be triggered by a broad range of advanced quantization techniques, including AWQ, GPTQ, and GGUF I-quants. Our attack exploits a simple property…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.