SIT at MixMT 2022: Fluent Translation Built on Giant Pre-trained Models
Abdul Rafae Khan, Hrishikesh Kanade, Girish Amar Budhrani, Preet, Jhanglani, Jia Xu

TL;DR
This paper presents a multilingual NMT system for code-mixed translation tasks, leveraging large pre-trained models, in-domain data, back-translation, and ensemble methods, achieving top rankings in WMT 2022 shared tasks.
Contribution
It introduces a high-performing translation system for Hinglish, utilizing giant pre-trained models and advanced techniques, setting new benchmarks in the shared task.
Findings
Achieved 1st place in subtask 2 (Hinglish to English) across multiple metrics.
Achieved 1st place in subtask 1 (Hindi/English to Hinglish) according to WER and human evaluation.
Secured 3rd place in subtask 1 based on ROUGE-L.
Abstract
This paper describes the Stevens Institute of Technology's submission for the WMT 2022 Shared Task: Code-mixed Machine Translation (MixMT). The task consisted of two subtasks, subtask Hindi/English to Hinglish and subtask Hinglish to English translation. Our findings lie in the improvements made through the use of large pre-trained multilingual NMT models and in-domain datasets, as well as back-translation and ensemble techniques. The translation output is automatically evaluated against the reference translations using ROUGE-L and WER. Our system achieves the position on subtask according to ROUGE-L, WER, and human evaluation, position on subtask according to WER and human evaluation, and position on subtask with respect to ROUGE-L metric.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications
