Improving Multilingual Neural Machine Translation by Utilizing Semantic and Linguistic Features
Mengyu Bu, Shuhao Gu, Yang Feng

TL;DR
This paper proposes a novel approach to multilingual neural machine translation that disentangles semantic and linguistic features to improve zero-shot translation performance while maintaining supervised translation quality.
Contribution
It introduces a disentangling learning task and a linguistic encoder to leverage semantic and linguistic features, enhancing knowledge sharing across languages.
Findings
Significant improvement in zero-shot translation accuracy.
Maintains performance in supervised translation tasks.
Effective utilization of semantic and linguistic features.
Abstract
The many-to-many multilingual neural machine translation can be regarded as the process of integrating semantic features from the source sentences and linguistic features from the target sentences. To enhance zero-shot translation, models need to share knowledge across languages, which can be achieved through auxiliary tasks for learning a universal representation or cross-lingual mapping. To this end, we propose to exploit both semantic and linguistic features between multiple languages to enhance multilingual translation. On the encoder side, we introduce a disentangling learning task that aligns encoder representations by disentangling semantic and linguistic features, thus facilitating knowledge transfer while preserving complete information. On the decoder side, we leverage a linguistic encoder to integrate low-level linguistic features to assist in the target language generation.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
