AL-QASIDA: Analyzing LLM Quality and Accuracy Systematically in   Dialectal Arabic

Nathaniel R. Robinson; Shahd Abdelmoneim; Kelly Marchisio; Sebastian; Ruder

arXiv:2412.04193·cs.CL·January 7, 2025

AL-QASIDA: Analyzing LLM Quality and Accuracy Systematically in Dialectal Arabic

Nathaniel R. Robinson, Shahd Abdelmoneim, Kelly Marchisio, Sebastian, Ruder

PDF

Open Access 1 Video

TL;DR

This paper introduces a comprehensive framework for evaluating large language models' performance in dialectal Arabic, revealing biases and challenges in generating and understanding this underrepresented language variety.

Contribution

It provides an operationalized assessment framework for LLMs in dialectal Arabic and offers practical insights into their strengths and limitations.

Findings

01

LLMs understand dialectal Arabic better than they generate it.

02

Post-training can introduce bias against dialectal Arabic.

03

Few-shot examples help improve LLM performance in dialectal Arabic.

Abstract

Dialectal Arabic (DA) varieties are under-served by language technologies, particularly large language models (LLMs). This trend threatens to exacerbate existing social inequalities and limits LLM applications, yet the research community lacks operationalized performance measurements in DA. We present a framework that comprehensively assesses LLMs' DA modeling capabilities across four dimensions: fidelity, understanding, quality, and diglossia. We evaluate nine LLMs in eight DA varieties and provide practical recommendations. Our evaluation suggests that LLMs do not produce DA as well as they understand it, not because their DA fluency is poor, but because they are reluctant to generate DA. Further analysis suggests that current post-training can contribute to bias against DA, that few-shot examples can overcome this deficiency, and that otherwise no measurable features of input text…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

AL-QASIDA: Analyzing LLM Quality and Accuracy Systematically in Dialectal Arabic· underline

Taxonomy

TopicsLanguage, Linguistics, Cultural Analysis · Natural Language Processing Techniques · Historical and Linguistic Studies