Resource-Aware Arabic LLM Creation: Model Adaptation, Integration, and   Multi-Domain Testing

Prakash Aryan

arXiv:2412.17548·cs.CL·December 24, 2024

Resource-Aware Arabic LLM Creation: Model Adaptation, Integration, and Multi-Domain Testing

Prakash Aryan

PDF

Open Access 1 Repo

TL;DR

This paper introduces a resource-efficient method for fine-tuning a large Arabic language model using Quantized Low-Rank Adaptation on limited hardware, improving performance across multiple Arabic NLP tasks.

Contribution

It presents a novel adaptation process for Arabic LLMs using QLoRA with low hardware requirements, addressing linguistic challenges and demonstrating effective multi-domain performance.

Findings

01

Significant reduction in GPU memory usage during training

02

Improved accuracy on Arabic NLP tasks

03

Robustness to input perturbations

Abstract

This paper presents a novel approach to fine-tuning the Qwen2-1.5B model for Arabic language processing using Quantized Low-Rank Adaptation (QLoRA) on a system with only 4GB VRAM. We detail the process of adapting this large language model to the Arabic domain, using diverse datasets including Bactrian, OpenAssistant, and Wikipedia Arabic corpora. Our methodology involves custom data preprocessing, model configuration, and training optimization techniques such as gradient accumulation and mixed-precision training. We address specific challenges in Arabic NLP, including morphological complexity, dialectal variations, and diacritical mark handling. Experimental results over 10,000 training steps show significant performance improvements, with the final loss converging to 0.1083. We provide comprehensive analysis of GPU memory usage, training dynamics, and model evaluation across various…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

prakash-aryan/qwen-arabic-project
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsService-Oriented Architecture and Web Services · Semantic Web and Ontologies · Natural Language Processing Techniques