Tackling data scarcity in speech translation using zero-shot   multilingual machine translation techniques

Tu Anh Dinh; Danni Liu; Jan Niehues

arXiv:2201.11172·cs.CL·May 17, 2022

Tackling data scarcity in speech translation using zero-shot multilingual machine translation techniques

Tu Anh Dinh, Danni Liu, Jan Niehues

PDF

1 Repo

TL;DR

This paper explores zero-shot multilingual techniques to improve speech translation performance under data scarcity by leveraging speech transcription and text translation data, with promising results in low-resource scenarios.

Contribution

It adapts zero-shot translation ideas from text to speech translation, demonstrating effective data augmentation and auxiliary loss functions for low-resource speech translation.

Findings

01

Achieved up to +12.9 BLEU points in low-resource settings.

02

Improved over direct end-to-end speech translation models.

03

Enhanced performance with auxiliary loss and data augmentation techniques.

Abstract

Recently, end-to-end speech translation (ST) has gained significant attention as it avoids error propagation. However, the approach suffers from data scarcity. It heavily depends on direct ST data and is less efficient in making use of speech transcription and text translation data, which is often more easily available. In the related field of multilingual text translation, several techniques have been proposed for zero-shot translation. A main idea is to increase the similarity of semantically similar sentences in different languages. We investigate whether these ideas can be applied to speech translation, by building ST models trained on speech transcription and text translation data. We investigate the effects of data augmentation and auxiliary loss function. The techniques were successfully applied to few-shot ST using limited ST data, with improvements of up to +12.9 BLEU points…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tuanh23/multimodalst
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.