Importance-based Neuron Allocation for Multilingual Neural Machine   Translation

Wanying Xie; Yang Feng; Shuhao Gu; Dong Yu

arXiv:2107.06569·cs.CL·July 15, 2021

Importance-based Neuron Allocation for Multilingual Neural Machine Translation

Wanying Xie, Yang Feng, Shuhao Gu, Dong Yu

PDF

Open Access 1 Repo

TL;DR

This paper introduces an importance-based neuron allocation method for multilingual neural machine translation, effectively balancing general and language-specific knowledge without parameter explosion.

Contribution

It proposes dividing neurons into general and language-specific parts based on importance, avoiding manual design and parameter issues in multilingual models.

Findings

01

Improves translation quality across multiple language pairs.

02

Demonstrates effectiveness on IWSLT and Europarl datasets.

03

Shows universality of the proposed neuron allocation method.

Abstract

Multilingual neural machine translation with a single model has drawn much attention due to its capability to deal with multiple languages. However, the current multilingual translation paradigm often makes the model tend to preserve the general knowledge, but ignore the language-specific knowledge. Some previous works try to solve this problem by adding various kinds of language-specific modules to the model, but they suffer from the parameter explosion problem and require specialized manual design. To solve these problems, we propose to divide the model neurons into general and language-specific parts based on their importance across languages. The general part is responsible for preserving the general knowledge and participating in the translation of all the languages, while the language-specific part is responsible for preserving the language-specific knowledge and participating in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ictnlp/NA-MNMT
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications