Evaluating Quantized Large Language Models for Code Generation on Low-Resource Language Benchmarks
Enkhbold Nyamsuren

TL;DR
This study evaluates the effectiveness of quantizing large language models to enable their deployment on consumer devices, focusing on Lua code generation tasks and analyzing different bit precisions for optimal performance and accessibility.
Contribution
It demonstrates that 4-bit quantization of 7B parameter models offers a practical balance, significantly improving accessibility without severely compromising performance.
Findings
4-bit quantized models outperform lower-bit models in Lua code generation
Quantized models can run efficiently on average laptops without GPUs
Performance drops notably at 2-bit precision and with lower parameter models
Abstract
Democratization of AI is an important topic within the broader topic of the digital divide. This issue is relevant to LLMs, which are becoming popular as AI co-pilots but suffer from a lack of accessibility due to high computational demand. In this study, we evaluate whether quantization is a viable approach toward enabling LLMs on generic consumer devices. The study assesses the performance of five quantized code LLMs in Lua code generation tasks. To evaluate the impact of quantization, the models with 7B parameters were tested on a consumer laptop at 2-, 4-, and 8-bit integer precisions and compared to non-quantized code LLMs with 1.3, 2, and 3 billion parameters. Lua is chosen as a low-level resource language to avoid models' biases related to high-resource languages. The results suggest that the models quantized at the 4-bit integer precision offer the best trade-off between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Speech and dialogue systems · Topic Modeling
MethodsLLaMA
