RoQLlama: A Lightweight Romanian Adapted Language Model

George-Andrei Dima; Andrei-Marius Avram; Cristian-George Cr\u{a}ciun; and Dumitru-Clementin Cercel

arXiv:2410.04269·cs.CL·October 8, 2024

RoQLlama: A Lightweight Romanian Adapted Language Model

George-Andrei Dima, Andrei-Marius Avram, Cristian-George Cr\u{a}ciun, and Dumitru-Clementin Cercel

PDF

Open Access 1 Models 1 Video

TL;DR

This paper introduces RoQLlama, a lightweight Romanian language model based on Llama2, trained with QLoRA, achieving competitive results on Romanian tasks and introducing a new Romanian medical question dataset.

Contribution

It presents RoQLlama-7b, a quantized Romanian LLM that performs well on downstream tasks and introduces the RoMedQA dataset for medical question answering.

Findings

01

RoQLlama-7b matches or exceeds full-sized model performance.

02

Higher average scores in few-shot prompts.

03

Effective Romanian language adaptation with reduced resources.

Abstract

The remarkable achievements obtained by open-source large language models (LLMs) in recent years have predominantly been concentrated on tasks involving the English language. In this paper, we aim to advance the performance of Llama2 models on Romanian tasks. We tackle the problem of reduced computing resources by using QLoRA for training. We release RoQLlama-7b, a quantized LLM, which shows equal or improved results compared to its full-sized counterpart when tested on seven Romanian downstream tasks in the zero-shot setup. Also, it consistently achieves higher average scores across all few-shot prompts. Additionally, we introduce a novel Romanian dataset, namely RoMedQA, which contains single-choice medical questions in Romanian.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
andreidima/Llama-2-7b-Romanian-qlora
model· 15 dl
15 dl

Videos

RoQLlama: A Lightweight Romanian Adapted Language Model· underline

Taxonomy

TopicsNatural Language Processing Techniques · Text Readability and Simplification