The Effect of Alignment Objectives on Code-Switching Translation

Mohamed Anwar

arXiv:2309.05044·cs.CL·September 12, 2023

The Effect of Alignment Objectives on Code-Switching Translation

Mohamed Anwar

PDF

Open Access

TL;DR

This paper introduces a unified machine translation model capable of translating both monolingual and code-switched sentences, leveraging synthetic data and alignment loss to improve performance on code-switching content.

Contribution

It proposes a novel training approach with synthetic data and alignment loss, enabling a single model to handle monolingual and code-switched translation tasks effectively.

Findings

01

Outperforms bidirectional baselines on code-switched translation

02

Maintains high quality on monolingual translation

03

Uses synthetic data and alignment loss for improved performance

Abstract

One of the things that need to change when it comes to machine translation is the models' ability to translate code-switching content, especially with the rise of social media and user-generated content. In this paper, we are proposing a way of training a single machine translation model that is able to translate monolingual sentences from one language to another, along with translating code-switched sentences to either language. This model can be considered a bilingual model in the human sense. For better use of parallel data, we generated synthetic code-switched (CSW) data along with an alignment loss on the encoder to align representations across languages. Using the WMT14 English-French (En-Fr) dataset, the trained model strongly outperforms bidirectional baselines on code-switched translation while maintaining quality for non-code-switched (monolingual) data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Text Readability and Simplification · Hate Speech and Cyberbullying Detection

MethodsALIGN