Dynamic Masking for Improved Stability in Spoken Language Translation

Yuekun Yao; Barry Haddow

arXiv:2006.00249·cs.CL·June 2, 2021

Dynamic Masking for Improved Stability in Spoken Language Translation

Yuekun Yao, Barry Haddow

PDF

Open Access

TL;DR

This paper proposes a dynamic masking technique for spoken language translation that balances low latency and translation stability, reducing flicker without increasing delay.

Contribution

It introduces a novel dynamic masking approach that adaptively adjusts delay in MT output to improve online SLT performance.

Findings

01

Dynamic masking reduces flicker in translations.

02

Adaptive delay improves latency without quality loss.

03

Method outperforms fixed masking strategies.

Abstract

For spoken language translation (SLT) in live scenarios such as conferences, lectures and meetings, it is desirable to show the translation to the user as quickly as possible, avoiding an annoying lag between speaker and translated captions. In other words, we would like low-latency, online SLT. If we assume a pipeline of automatic speech recognition (ASR) and machine translation (MT) then a viable approach to online SLT is to pair an online ASR system, with a a retranslation strategy, where the MT system re-translates every update received from ASR. However this can result in annoying "flicker" as the MT system updates its translation. A possible solution is to add a fixed delay, or "mask" to the the output of the MT system, but a fixed global mask introduces undesirable latency to the output. We show how this mask can be set dynamically, improving the latency-flicker trade-off without…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications