The Effect of Normalization for Bi-directional Amharic-English Neural Machine Translation
Tadesse Destaw Belay, Atnafu Lambebo Tonja, Olga Kolesnikova, Seid, Muhie Yimam, Abinew Ali Ayele, Silesh Bogale Haile, Grigori Sidorov,, Alexander Gelbukh

TL;DR
This paper introduces a large-scale Amharic-English dataset and demonstrates that Amharic homophone normalization improves neural machine translation performance in both translation directions.
Contribution
It provides the first large-scale Amharic-English dataset and shows that normalization of Amharic homophones enhances translation accuracy.
Findings
Normalization increases BLEU scores in both translation directions
Achieved BLEU scores of 37.79 (Amharic-English) and 32.74 (English-Amharic)
First large-scale Amharic-English parallel dataset
Abstract
Machine translation (MT) is one of the main tasks in natural language processing whose objective is to translate texts automatically from one natural language to another. Nowadays, using deep neural networks for MT tasks has received great attention. These networks require lots of data to learn abstract representations of the input and store it in continuous vectors. This paper presents the first relatively large-scale Amharic-English parallel sentence dataset. Using these compiled data, we build bi-directional Amharic-English translation models by fine-tuning the existing Facebook M2M100 pre-trained model achieving a BLEU score of 37.79 in Amharic-English 32.74 in English-Amharic translation. Additionally, we explore the effects of Amharic homophone normalization on the machine translation task. The results show that the normalization of Amharic homophone characters increases the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications
