Lost in Interpreting: Speech Translation from Source or Interpreter?

Dominik Mach\'a\v{c}ek; Mat\'u\v{s} \v{Z}ilinec; Ond\v{r}ej Bojar

arXiv:2106.09343·cs.CL·June 18, 2021

Lost in Interpreting: Speech Translation from Source or Interpreter?

Dominik Mach\'a\v{c}ek, Mat\'u\v{s} \v{Z}ilinec, Ond\v{r}ej Bojar

PDF

TL;DR

This paper compares automatic speech translation systems that follow either the original speaker or the interpreter, analyzing their translation quality, delay, and information retention using a new European Parliament corpus.

Contribution

It introduces the Europarl Simultaneous Interpreting Corpus (ESIC) and evaluates the impact of following the speaker versus the interpreter on translation quality and latency.

Findings

01

Interpreter-based systems have higher delay but better information retention.

02

Speaker-based systems are faster but may lose more information.

03

Human evaluation shows differences in information loss between approaches.

Abstract

Interpreters facilitate multi-lingual meetings but the affordable set of languages is often smaller than what is needed. Automatic simultaneous speech translation can extend the set of provided languages. We investigate if such an automatic system should rather follow the original speaker, or an interpreter to achieve better translation quality at the cost of increased delay. To answer the question, we release Europarl Simultaneous Interpreting Corpus (ESIC), 10 hours of recordings and transcripts of European Parliament speeches in English, with simultaneous interpreting into Czech and German. We evaluate quality and latency of speaker-based and interpreter-based spoken translation systems from English to Czech. We study the differences in implicit simplification and summarization of the human interpreter compared to a machine translation system trained to shorten the output to some…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.