Empowering Persian LLMs for Instruction Following: A Novel Dataset and Training Approach
Hojjat Mokhtarabadi, Ziba Zamani, Abbas Maazallahi, Mohammad Hossein, Manshaei

TL;DR
This paper introduces FarsInstruct, a comprehensive dataset and a novel training framework called Co-CoLA, to improve instruction-following capabilities of large language models in Persian, a low-resource language.
Contribution
It presents the first extensive Persian instruction dataset and a multi-task training framework to enhance LLM performance in low-resource language settings.
Findings
FarsInstruct contains 197 templates across 21 datasets.
The combined approach improves Persian LLM instruction-following performance.
Framework enhances multi-task adaptability of LoRA-tuned models.
Abstract
Instruction-tuned large language models have demonstrated remarkable capabilities in following human instructions across various domains. However, their proficiency remains notably deficient in many low-resource languages. To address this challenge, we begin by introducing FarsInstruct a comprehensive instruction dataset designed to enhance the instruction following ability of large language models specifically for the Persian language a significant yet underrepresented language globally. FarsInstruct encompasses a wide range of task types and datasets, each containing a mix of straightforward to complex manual written instructions, as well as translations from the Public Pool of Prompts, ensuring a rich linguistic and cultural representation. Furthermore, we introduce Co-CoLA, a framework designed to enhance the multi-task adaptability of LoRA-tuned models. Through extensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText Readability and Simplification · Natural Language Processing Techniques · Topic Modeling
