Low-Latency Neural Speech Translation

Jan Niehues; Ngoc-Quan Pham; Thanh-Le Ha; Matthias Sperber; Alex; Waibel

arXiv:1808.00491·cs.CL·August 3, 2018·5 cites

Low-Latency Neural Speech Translation

Jan Niehues, Ngoc-Quan Pham, Thanh-Le Ha, Matthias Sperber, Alex, Waibel

PDF

Open Access

TL;DR

This paper presents a method to adapt neural machine translation systems for low-latency speech translation, effectively handling partial inputs and reducing correction steps without sacrificing translation quality.

Contribution

It introduces a novel adaptation approach using artificial data and multi-task learning to improve partial sentence translation in low-latency scenarios.

Findings

01

Reduced corrections by 45% during incremental translation

02

Maintained translation quality after adaptation

03

Effective adaptation without task-specific training data

Abstract

Through the development of neural machine translation, the quality of machine translation systems has been improved significantly. By exploiting advancements in deep learning, systems are now able to better approximate the complex mapping from source sentences to target sentences. But with this ability, new challenges also arise. An example is the translation of partial sentences in low-latency speech translation. Since the model has only seen complete sentences in training, it will always try to generate a complete sentence, though the input may only be a partial sentence. We show that NMT systems can be adapted to scenarios where no task-specific training data is available. Furthermore, this is possible without losing performance on the original training data. We achieve this by creating artificial data and by using multi-task learning. After adaptation, we are able to reduce the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications