The MeMAD Submission to the IWSLT 2018 Speech Translation Task
Umut Sulubacak, J\"org Tiedemann, Aku Rouhe, Stig-Arne Gr\"onroos,, Mikko Kurimo

TL;DR
This paper details the MeMAD team's pipeline approach to English-to-German speech translation for IWSLT 2018, combining ASR and NMT models trained on TED and OpenSubtitles data, achieving a BLEU score of 16.45.
Contribution
The paper introduces contrastive translation systems using diverse training data and highlights the impact of OpenSubtitles2018 on translation quality.
Findings
OpenSubtitles2018 data improves translation performance
Pre- and postprocessing routines had limited impact
Best system achieved BLEU score of 16.45
Abstract
This paper describes the MeMAD project entry to the IWSLT Speech Translation Shared Task, addressing the translation of English audio into German text. Between the pipeline and end-to-end model tracks, we participated only in the former, with three contrastive systems. We tried also the latter, but were not able to finish our end-to-end model in time. All of our systems start by transcribing the audio into text through an automatic speech recognition (ASR) model trained on the TED-LIUM English Speech Recognition Corpus (TED-LIUM). Afterwards, we feed the transcripts into English-German text-based neural machine translation (NMT) models. Our systems employ three different translation models trained on separate training sets compiled from the English-German part of the TED Speech Translation Corpus (TED-Trans) and the OpenSubtitles2018 section of the OPUS collection. In this paper, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Natural Language Processing Techniques · Topic Modeling
