The MeMAD Submission to the IWSLT 2018 Speech Translation Task

Umut Sulubacak; J\"org Tiedemann; Aku Rouhe; Stig-Arne Gr\"onroos,; Mikko Kurimo

arXiv:1810.10320·cs.CL·October 25, 2018·1 cites

The MeMAD Submission to the IWSLT 2018 Speech Translation Task

Umut Sulubacak, J\"org Tiedemann, Aku Rouhe, Stig-Arne Gr\"onroos,, Mikko Kurimo

PDF

Open Access

TL;DR

This paper details the MeMAD team's pipeline approach to English-to-German speech translation for IWSLT 2018, combining ASR and NMT models trained on TED and OpenSubtitles data, achieving a BLEU score of 16.45.

Contribution

The paper introduces contrastive translation systems using diverse training data and highlights the impact of OpenSubtitles2018 on translation quality.

Findings

01

OpenSubtitles2018 data improves translation performance

02

Pre- and postprocessing routines had limited impact

03

Best system achieved BLEU score of 16.45

Abstract

This paper describes the MeMAD project entry to the IWSLT Speech Translation Shared Task, addressing the translation of English audio into German text. Between the pipeline and end-to-end model tracks, we participated only in the former, with three contrastive systems. We tried also the latter, but were not able to finish our end-to-end model in time. All of our systems start by transcribing the audio into text through an automatic speech recognition (ASR) model trained on the TED-LIUM English Speech Recognition Corpus (TED-LIUM). Afterwards, we feed the transcripts into English-German text-based neural machine translation (NMT) models. Our systems employ three different translation models trained on separate training sets compiled from the English-German part of the TED Speech Translation Corpus (TED-Trans) and the OpenSubtitles2018 section of the OPUS collection. In this paper, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Natural Language Processing Techniques · Topic Modeling