The LIG system for the English-Czech Text Translation Task of IWSLT 2019
Lo\"ic Vial, Benjamin Lecouteux, Didier Schwab, Hang Le, Laurent, Besacier

TL;DR
This paper explores how integrating pre-trained language models like BERT into a Transformer-based machine translation system can enhance English-Czech translation quality, especially with limited training data.
Contribution
It demonstrates the effectiveness of using external pre-trained language models in low-resource machine translation settings, comparing different configurations.
Findings
External BERT improves BLEU scores by up to 1.94 points.
Own language model benefits only with very small datasets.
Pre-trained models enhance translation performance in low-data scenarios.
Abstract
In this paper, we present our submission for the English to Czech Text Translation Task of IWSLT 2019. Our system aims to study how pre-trained language models, used as input embeddings, can improve a specialized machine translation system trained on few data. Therefore, we implemented a Transformer-based encoder-decoder neural system which is able to use the output of a pre-trained language model as input embeddings, and we compared its performance under three configurations: 1) without any pre-trained language model (constrained), 2) using a language model trained on the monolingual parts of the allowed English-Czech data (constrained), and 3) using a language model trained on a large quantity of external monolingual data (unconstrained). We used BERT as external pre-trained language model (configuration 3), and BERT architecture for training our own language model (configuration 2).…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
MethodsTest · Linear Layer · Residual Connection · Attention Dropout · Linear Warmup With Linear Decay · Weight Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Adam · WordPiece
