Evaluating Quantized Large Language Models for Code Generation on   Low-Resource Language Benchmarks

Enkhbold Nyamsuren

arXiv:2410.14766·cs.SE·October 22, 2024

Evaluating Quantized Large Language Models for Code Generation on Low-Resource Language Benchmarks

Enkhbold Nyamsuren

PDF

Open Access 1 Repo

TL;DR

This study evaluates the effectiveness of quantizing large language models to enable their deployment on consumer devices, focusing on Lua code generation tasks and analyzing different bit precisions for optimal performance and accessibility.

Contribution

It demonstrates that 4-bit quantization of 7B parameter models offers a practical balance, significantly improving accessibility without severely compromising performance.

Findings

01

4-bit quantized models outperform lower-bit models in Lua code generation

02

Quantized models can run efficiently on average laptops without GPUs

03

Performance drops notably at 2-bit precision and with lower parameter models

Abstract

Democratization of AI is an important topic within the broader topic of the digital divide. This issue is relevant to LLMs, which are becoming popular as AI co-pilots but suffer from a lack of accessibility due to high computational demand. In this study, we evaluate whether quantization is a viable approach toward enabling LLMs on generic consumer devices. The study assesses the performance of five quantized code LLMs in Lua code generation tasks. To evaluate the impact of quantization, the models with 7B parameters were tested on a consumer laptop at 2-, 4-, and 8-bit integer precisions and compared to non-quantized code LLMs with 1.3, 2, and 3 billion parameters. Lua is chosen as a low-level resource language to avoid models' biases related to high-resource languages. The results suggest that the models quantized at the 4-bit integer precision offer the best trade-off between…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

E-Nyamsuren/qLMM-Lua-Eval-Pipeline
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Speech and dialogue systems · Topic Modeling

MethodsLLaMA