Does Simultaneous Speech Translation need Simultaneous Models?
Sara Papi, Marco Gaido, Matteo Negri, Marco Turchi

TL;DR
This paper explores whether a single offline-trained speech translation model can effectively perform both offline and simultaneous translation tasks, potentially reducing computational costs and simplifying deployment.
Contribution
It demonstrates that an offline-trained model can match or outperform dedicated simultaneous models in translation quality without additional training.
Findings
Offline model achieves comparable or better quality than simultaneous models.
Using a single offline model simplifies deployment and reduces computational costs.
The approach is competitive with state-of-the-art SimulST methods.
Abstract
In simultaneous speech translation (SimulST), finding the best trade-off between high translation quality and low latency is a challenging task. To meet the latency constraints posed by the different application scenarios, multiple dedicated SimulST models are usually trained and maintained, generating high computational costs. In this paper, motivated by the increased social and environmental impact caused by these costs, we investigate whether a single model trained offline can serve not only the offline but also the simultaneous task without the need for any additional training or adaptation. Experiments on en->{de, es} indicate that, aside from facilitating the adoption of well-established offline techniques and architectures without affecting latency, the offline solution achieves similar or better translation quality compared to the same model trained in simultaneous settings, as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis
