Domain Adaptation for Arabic Machine Translation: The Case of Financial Texts
Emad A. Alghamdi, Jezia Zakraoui, Fares A. Abanmy

TL;DR
This paper investigates domain-specific adaptation for Arabic machine translation in the financial sector, demonstrating that fine-tuning large language models like ChatGPT-3.5 Turbo improves translation quality with minimal data.
Contribution
It introduces a new Arabic-English financial domain dataset and shows that fine-tuning ChatGPT-3.5 Turbo enhances translation performance, a novel application in Arabic domain adaptation.
Findings
ChatGPT-3.5 Turbo outperforms other models in translation quality.
Few in-domain segments are sufficient for effective fine-tuning.
First application of ChatGPT fine-tuning for Arabic financial translation.
Abstract
Neural machine translation (NMT) has shown impressive performance when trained on large-scale corpora. However, generic NMT systems have demonstrated poor performance on out-of-domain translation. To mitigate this issue, several domain adaptation methods have recently been proposed which often lead to better translation quality than genetic NMT systems. While there has been some continuous progress in NMT for English and other European languages, domain adaption in Arabic has received little attention in the literature. The current study, therefore, aims to explore the effectiveness of domain-specific adaptation for Arabic MT (AMT), in yet unexplored domain, financial news articles. To this end, we developed carefully a parallel corpus for Arabic-English (AR- EN) translation in the financial domain for benchmarking different domain adaptation methods. We then fine-tuned several…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
