Automatic Quality Assessment for Speech Translation Using Joint ASR and   MT Features

Ngoc-Tien Le; Benjamin Lecouteux; Laurent Besacier

arXiv:1609.06049·cs.CL·October 2, 2016·1 cites

Automatic Quality Assessment for Speech Translation Using Joint ASR and MT Features

Ngoc-Tien Le, Benjamin Lecouteux, Laurent Besacier

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel approach for automatic quality assessment of speech translation by combining features from automatic speech recognition and machine translation, utilizing a new corpus and sequence labeling.

Contribution

It proposes joint ASR and MT feature-based word confidence estimators for speech translation quality assessment, a new formalization and corpus for the task.

Findings

01

MT features are most influential for quality estimation

02

ASR features provide complementary information

03

Robust estimators can improve speech translation feedback

Abstract

This paper addresses automatic quality assessment of spoken language translation (SLT). This relatively new task is defined and formalized as a sequence labeling problem where each word in the SLT hypothesis is tagged as good or bad according to a large feature set. We propose several word confidence estimators (WCE) based on our automatic evaluation of transcription (ASR) quality, translation (MT) quality, or both (combined ASR+MT). This research work is possible because we built a specific corpus which contains 6.7k utterances for which a quintuplet containing: ASR output, verbatim transcript, text translation, speech translation and post-edition of translation is built. The conclusion of our multiple experiments using joint ASR and MT features for WCE is that MT features remain the most influent while ASR feature can bring interesting complementary information. Our robust quality…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

besacier/WCE-LIG
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems