Learning an Artificial Language for Knowledge-Sharing in Multilingual   Translation

Danni Liu; Jan Niehues

arXiv:2211.01292·cs.CL·November 21, 2022

Learning an Artificial Language for Knowledge-Sharing in Multilingual Translation

Danni Liu, Jan Niehues

PDF

Open Access 1 Repo

TL;DR

This paper introduces a discretized artificial language for multilingual neural translation, improving interpretability and robustness, and enabling analysis of language sharing effects.

Contribution

It proposes a novel discretization of encoder outputs into an artificial language, enhancing interpretability and robustness in multilingual translation models.

Findings

01

Discretization improves model robustness in unseen conditions.

02

Artificial language enables better analysis of model behavior.

03

Similar bridge languages enhance knowledge sharing.

Abstract

The cornerstone of multilingual neural translation is shared representations across languages. Given the theoretically infinite representation power of neural networks, semantically identical sentences are likely represented differently. While representing sentences in the continuous latent space ensures expressiveness, it introduces the risk of capturing of irrelevant features which hinders the learning of a common representation. In this work, we discretize the encoder output latent space of multilingual models by assigning encoder states to entries in a codebook, which in effect represents source sentences in a new artificial language. This discretization process not only offers a new way to interpret the otherwise black-box model representations, but, more importantly, gives potential for increasing robustness in unseen testing conditions. We validate our approach on large-scale…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dannigt/fairseq/tree/master/examples/quant
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Neural Networks and Applications