ELITR Non-Native Speech Translation at IWSLT 2020

Dominik Mach\'a\v{c}ek; Jon\'a\v{s} Kratochv\'il; Sangeet Sagar,; Mat\'u\v{s} \v{Z}ilinec; Ond\v{r}ej Bojar; Thai-Son Nguyen; Felix Schneider,; Philip Williams; Yuekun Yao

arXiv:2006.03331·cs.CL·June 8, 2020·1 cites

ELITR Non-Native Speech Translation at IWSLT 2020

Dominik Mach\'a\v{c}ek, Jon\'a\v{s} Kratochv\'il, Sangeet Sagar,, Mat\'u\v{s} \v{Z}ilinec, Ond\v{r}ej Bojar, Thai-Son Nguyen, Felix Schneider,, Philip Williams, Yuekun Yao

PDF

Open Access

TL;DR

This paper presents ELITR's speech translation systems for IWSLT 2020, including offline and real-time ASR and SLT, emphasizing non-native speech handling and system selection strategies.

Contribution

It introduces new end-to-end and hybrid ASR systems trained on non-native speech and discusses system selection for non-native speech translation at IWSLT 2020.

Findings

01

Developed a new end-to-end general ASR system.

02

Trained a hybrid ASR on non-native speech data.

03

Compared multiple system candidates for non-native speech translation.

Abstract

This paper is an ELITR system submission for the non-native speech translation task at IWSLT 2020. We describe systems for offline ASR, real-time ASR, and our cascaded approach to offline SLT and real-time SLT. We select our primary candidates from a pool of pre-existing systems, develop a new end-to-end general ASR system, and a hybrid ASR trained on non-native speech. The provided small validation set prevents us from carrying out a complex validation, but we submit all the unselected candidates for contrastive evaluation on the test set.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis