Importance-based Neuron Allocation for Multilingual Neural Machine Translation
Wanying Xie, Yang Feng, Shuhao Gu, Dong Yu

TL;DR
This paper introduces an importance-based neuron allocation method for multilingual neural machine translation, effectively balancing general and language-specific knowledge without parameter explosion.
Contribution
It proposes dividing neurons into general and language-specific parts based on importance, avoiding manual design and parameter issues in multilingual models.
Findings
Improves translation quality across multiple language pairs.
Demonstrates effectiveness on IWSLT and Europarl datasets.
Shows universality of the proposed neuron allocation method.
Abstract
Multilingual neural machine translation with a single model has drawn much attention due to its capability to deal with multiple languages. However, the current multilingual translation paradigm often makes the model tend to preserve the general knowledge, but ignore the language-specific knowledge. Some previous works try to solve this problem by adding various kinds of language-specific modules to the model, but they suffer from the parameter explosion problem and require specialized manual design. To solve these problems, we propose to divide the model neurons into general and language-specific parts based on their importance across languages. The general part is responsible for preserving the general knowledge and participating in the translation of all the languages, while the language-specific part is responsible for preserving the language-specific knowledge and participating in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications
