Quantization Meets Reasoning: Exploring LLM Low-Bit Quantization Degradation for Mathematical Reasoning
Zhen Li, Yupeng Su, Runming Yang, Congkai Xie, Zheng Wang, Zhongwei, Xie, Ngai Wong, Hongxia Yang

TL;DR
This paper investigates the effects of low-bit quantization on large language models' mathematical reasoning abilities, proposing evaluation and recovery methods to mitigate accuracy loss.
Contribution
It systematically evaluates quantization impacts on reasoning tasks and introduces targeted fine-tuning and error diagnosis techniques to restore model performance.
Findings
Aggressive quantization causes up to 32.39% accuracy degradation.
Fine-tuning on limited task-specific data recovers near full-precision performance.
An error diagnosis pipeline achieves 98.9% accuracy in identifying quantization errors.
Abstract
Large language models have achieved significant advancements in complex mathematical reasoning benchmarks, such as MATH. However, their substantial computational requirements present challenges for practical deployment. Model quantization has emerged as an effective strategy to reduce memory usage and computational costs by employing lower precision and bit-width representations. In this study, we systematically evaluate the impact of quantization on mathematical reasoning tasks. Our results demonstrate that aggressive quantization methods like AWQ and GPTQ introduce up to 32.39% accuracy degradation (average 11.31%) on Llama-3 models, particularly in numerical computation and reasoning planning. To address this, we introduce a multidimensional evaluation framework combining qualitative capability analysis and quantitative error assessment. We further develop targeted recovery…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvancements in Semiconductor Devices and Circuit Design · Semiconductor materials and devices · Analog and Mixed-Signal Circuit Design
