Adapting Large Language Models to a Low-Resource Agglutinative Language: A Comparative Study of LoRA and QLoRA for Bashkir
Mullosharaf K. Arabov, Svetlana S. Khaybullina

TL;DR
This study compares PEFT methods like LoRA and QLoRA for adapting large language models to Bashkir, a low-resource agglutinative language, highlighting the trade-offs between model size, quality, and computational efficiency.
Contribution
It provides a comprehensive evaluation of PEFT techniques on various models for Bashkir, demonstrating QLoRA's effectiveness on 7B-scale models with reduced training costs.
Findings
QLoRA on Mistral-7B and Phi-2 achieves comparable perplexity to full fine-tuning.
Full fine-tuning GPT-2 medium yields the lowest perplexity (3.34).
PEFT can cause significant quality degradation depending on the model architecture.
Abstract
This paper presents a comparative study of parameter-efficient fine-tuning (PEFT) methods, including LoRA and QLoRA, applied to the task of adapting large language models to the Bashkir language, a low-resource agglutinative language of the Turkic family. Experimental evaluation is conducted on a Bashkir text corpus of 71k documents (46.9M tokens) using models of various architectures: DistilGPT2, GPT-2 (base, medium), Phi-2, Qwen2.5-7B, DeepSeek-7B, and Mistral-7B. To improve the reliability of results, each configuration was trained with three different random seeds. The lowest perplexity on the test set was obtained for GPT-2 medium with full fine-tuning (3.34). Meanwhile, QLoRA applied to Mistral-7B (3.79) and Phi-2 (3.81) achieved comparable quality with over 40 times fewer trainable parameters. However, we also observed cases of significant quality degradation when using PEFT…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
