Smaller = Weaker? Benchmarking Robustness of Quantized LLMs in Code Generation

Sen Fang; Weiyuan Ding; Antonio Mastropaolo; Bowen Xu

arXiv:2506.22776·cs.SE·July 1, 2025

Smaller = Weaker? Benchmarking Robustness of Quantized LLMs in Code Generation

Sen Fang, Weiyuan Ding, Antonio Mastropaolo, Bowen Xu

PDF

Open Access

TL;DR

This paper systematically investigates how quantization impacts the robustness of large language models in code generation, revealing that quantized models often outperform full-precision ones in adversarial and noise resilience.

Contribution

First comprehensive study on the robustness effects of quantization on LLMs in code generation, challenging assumptions about potential robustness degradation.

Findings

01

Quantized LLMs show higher adversarial robustness (51.59%) compared to full-precision models (42.86%).

02

Quantization generally improves LLMs' resilience to weight perturbations.

03

Results suggest quantization can enhance both efficiency and robustness of LLMs.

Abstract

Quantization has emerged as a mainstream method for compressing Large Language Models (LLMs), reducing memory requirements and accelerating inference without architectural modifications. While existing research primarily focuses on evaluating the effectiveness of quantized LLMs compared to their original counterparts, the impact on robustness remains largely unexplored.In this paper, we present the first systematic investigation of how quantization affects the robustness of LLMs in code generation tasks. Through extensive experiments across four prominent LLM families (LLaMA, DeepSeek, CodeGen, and StarCoder) with parameter scales ranging from 350M to 33B, we evaluate robustness from dual perspectives: adversarial attacks on input prompts and noise perturbations on model architecture. Our findings challenge conventional wisdom by demonstrating that quantized LLMs often exhibit superior…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Machine Learning in Materials Science · Topic Modeling