One-To-Many Multilingual End-to-end Speech Translation

Mattia Antonino Di Gangi; Matteo Negri; Marco Turchi

arXiv:1910.03320·cs.CL·October 9, 2019

One-To-Many Multilingual End-to-end Speech Translation

Mattia Antonino Di Gangi, Matteo Negri, Marco Turchi

PDF

TL;DR

This paper introduces a multilingual transfer learning approach for end-to-end speech translation, using target-language embeddings to improve translation quality across six languages, especially with limited data.

Contribution

It proposes a novel target-language embedding method for multilingual speech translation, addressing the limitations of target forcing in speech tasks.

Findings

01

Significant BLEU score improvements, especially for similar languages.

02

Enhanced translation performance with additional English ASR data.

03

Effective handling of low-resource language translation scenarios.

Abstract

Nowadays, training end-to-end neural models for spoken language translation (SLT) still has to confront with extreme data scarcity conditions. The existing SLT parallel corpora are indeed orders of magnitude smaller than those available for the closely related tasks of automatic speech recognition (ASR) and machine translation (MT), which usually comprise tens of millions of instances. To cope with data paucity, in this paper we explore the effectiveness of transfer learning in end-to-end SLT by presenting a multilingual approach to the task. Multilingual solutions are widely studied in MT and usually rely on ``\textit{target forcing}'', in which multilingual parallel data are combined to train a single model by prepending to the input sequences a language token that specifies the target language. However, when tested in speech translation, our experiments show that MT-like…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.