dziribot: rag based intelligent conversational agent for algerian arabic dialect
El Batoul Bechiri, Dihia Lanasri

TL;DR
DziriBOT is a novel hybrid conversational agent tailored for Algerian Darja, integrating RAG and specialized NLU to handle linguistic complexities, outperforming traditional models in low-resource dialect scenarios.
Contribution
This paper introduces DziriBOT, a dialect-aware conversational agent using a multi-layered RAG architecture and fine-tuned transformer models for Algerian Arabic dialect.
Findings
Fine-tuned DziriBERT achieves state-of-the-art performance.
Outperforms traditional baselines in handling orthographic noise.
Provides a scalable solution for dialect-aware automation.
Abstract
The rapid digitalization of customer service has intensified the demand for conversational agents capable of providing accurate and natural interactions. In the Algerian context, this is complicated by the linguistic complexity of Darja, a dialect characterized by non-standardized orthography, extensive code-switching with French, and the simultaneous use of Arabic and Latin (Arabizi) scripts. This paper introduces DziriBOT, a hybrid intelligent conversational agent specifically engineered to overcome these challenges. We propose a multi-layered architecture that integrates specialized Natural Language Understanding (NLU) with Retrieval-Augmented Generation (RAG), allowing for both structured service flows and dynamic, knowledge-intensive responses grounded in curated enterprise documentation. To address the low-resource nature of Darja, we systematically evaluate three distinct…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · AI in Service Interactions · Language and cultural evolution
