Machine Translation for Ge'ez Language
Aman Kassahun Wassie

TL;DR
This paper investigates various methods to improve machine translation for the low-resource and ancient Ge'ez language, including transfer learning, model finetuning, and large language models, highlighting their strengths and limitations.
Contribution
It introduces a multilingual neural machine translation model based on related languages and evaluates the use of GPT-3.5 for few-shot translation in Ge'ez.
Findings
MNMT improves BLEU by 4 points over bilingual models
Finetuning NLLB-200 performs poorly with limited data
GPT-3.5 achieves BLEU of 9.2 without prior Ge'ez knowledge
Abstract
Machine translation (MT) for low-resource languages such as Ge'ez, an ancient language that is no longer the native language of any community, faces challenges such as out-of-vocabulary words, domain mismatches, and lack of sufficient labeled training data. In this work, we explore various methods to improve Ge'ez MT, including transfer-learning from related languages, optimizing shared vocabulary and token segmentation approaches, finetuning large pre-trained models, and using large language models (LLMs) for few-shot translation with fuzzy matches. We develop a multilingual neural machine translation (MNMT) model based on languages relatedness, which brings an average performance improvement of about 4 BLEU compared to standard bilingual models. We also attempt to finetune the NLLB-200 model, one of the most advanced translation models available today, but find that it performs poorly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications
Methods{Dispute@FaQ-s}How to file a dispute with Expedia? · Multi-Head Attention · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Linear Layer · Adam · Weight Decay · Cosine Annealing · Byte Pair Encoding · Dense Connections
