SJTU-NICT's Supervised and Unsupervised Neural Machine Translation Systems for the WMT20 News Translation Task
Zuchao Li, Hai Zhao, Rui Wang, Kehai Chen, Masao Utiyama, Eiichiro, Sumita

TL;DR
This paper describes SJTU-NICT's participation in WMT20, where they developed diverse neural machine translation systems for multiple language pairs and tracks, achieving top results in several directions.
Contribution
The paper introduces novel NMT techniques and data filtering strategies tailored for different language pairs and translation tracks, leading to state-of-the-art performance.
Findings
Won first place in English-Chinese, Polish-English, and German-Upper Sorbian translation tasks.
Implemented diverse NMT techniques including document-enhanced and pre-trained models.
Used TF-IDF filtering to improve domain relevance of training data.
Abstract
In this paper, we introduced our joint team SJTU-NICT 's participation in the WMT 2020 machine translation shared task. In this shared task, we participated in four translation directions of three language pairs: English-Chinese, English-Polish on supervised machine translation track, German-Upper Sorbian on low-resource and unsupervised machine translation tracks. Based on different conditions of language pairs, we have experimented with diverse neural machine translation (NMT) techniques: document-enhanced NMT, XLM pre-trained language model enhanced NMT, bidirectional translation as a pre-training, reference language based UNMT, data-dependent gaussian prior objective, and BT-BLEU collaborative filtering self-training. We also used the TF-IDF algorithm to filter the training set to obtain a domain more similar set with the test set for finetuning. In our submissions, the primary…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications
MethodsLinear Layer · Refunds@Expedia|||How do I get a full refund from Expedia? · Attention Dropout · Residual Connection · Attention Is All You Need · Adam · Byte Pair Encoding · Softmax · Multi-Head Attention · Layer Normalization
