Does Simultaneous Speech Translation need Simultaneous Models?

Sara Papi; Marco Gaido; Matteo Negri; Marco Turchi

arXiv:2204.03783·cs.CL·October 19, 2023·1 cites

Does Simultaneous Speech Translation need Simultaneous Models?

Sara Papi, Marco Gaido, Matteo Negri, Marco Turchi

PDF

Open Access 1 Repo

TL;DR

This paper explores whether a single offline-trained speech translation model can effectively perform both offline and simultaneous translation tasks, potentially reducing computational costs and simplifying deployment.

Contribution

It demonstrates that an offline-trained model can match or outperform dedicated simultaneous models in translation quality without additional training.

Findings

01

Offline model achieves comparable or better quality than simultaneous models.

02

Using a single offline model simplifies deployment and reduces computational costs.

03

The approach is competitive with state-of-the-art SimulST methods.

Abstract

In simultaneous speech translation (SimulST), finding the best trade-off between high translation quality and low latency is a challenging task. To meet the latency constraints posed by the different application scenarios, multiple dedicated SimulST models are usually trained and maintained, generating high computational costs. In this paper, motivated by the increased social and environmental impact caused by these costs, we investigate whether a single model trained offline can serve not only the offline but also the simultaneous task without the need for any additional training or adaptation. Experiments on en->{de, es} indicate that, aside from facilitating the adoption of well-established offline techniques and architectures without affecting latency, the offline solution achieves similar or better translation quality compared to the same model trained in simultaneous settings, as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hlt-mt/fbk-fairseq
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis