Disentangling ASR and MT Errors in Speech Translation

Ngoc-Tien Le; Benjamin Lecouteux; Laurent Besacier

arXiv:1709.00678·cs.CL·September 5, 2017·5 cites

Disentangling ASR and MT Errors in Speech Translation

Ngoc-Tien Le, Benjamin Lecouteux, Laurent Besacier

PDF

Open Access

TL;DR

This paper proposes a method for automatically detecting and distinguishing errors originating from transcription and translation modules in speech translation systems, using a joint classifier and label extraction techniques.

Contribution

It introduces a novel approach to disentangle ASR and MT errors in speech translation, enabling more precise error analysis and quality assessment.

Findings

01

Effective joint classifier for 2-class and 3-class error detection

02

Successful label extraction methods for error source disentanglement

03

Qualitative analysis of error origins in speech translation output

Abstract

The main aim of this paper is to investigate automatic quality assessment for spoken language translation (SLT). More precisely, we investigate SLT errors that can be due to transcription (ASR) or to translation (MT) modules. This paper investigates automatic detection of SLT errors using a single classifier based on joint ASR and MT features. We evaluate both 2-class (good/bad) and 3-class (good/badASR/badMT ) labeling tasks. The 3-class problem necessitates to disentangle ASR and MT errors in the speech translation output and we propose two label extraction methods for this non trivial step. This enables - as a by-product - qualitative analysis on the SLT errors and their origin (are they due to transcription or to translation step?) on our large in-house corpus for French-to-English speech translation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Speech Recognition and Synthesis