SpeechT: Findings of the First Mentorship in Speech Translation

Yasmin Moslem; Juan Juli\'an Cea Mor\'an; Mariano Gonzalez-Gomez; Muhammad Hazim Al Farouq; Farah Abdou; Satarupa Deb

arXiv:2502.12050·cs.CL·June 3, 2025

SpeechT: Findings of the First Mentorship in Speech Translation

Yasmin Moslem, Juan Juli\'an Cea Mor\'an, Mariano Gonzalez-Gomez, Muhammad Hazim Al Farouq, Farah Abdou, Satarupa Deb

PDF

Open Access

TL;DR

This paper reports on the first mentorship in speech translation, highlighting activities like data augmentation and system comparison across multiple languages, aiming to advance speech translation research.

Contribution

It introduces the first mentorship program in speech translation, involving diverse languages and exploring data techniques and system architectures.

Findings

01

Explored data augmentation techniques for speech translation.

02

Compared end-to-end and cascaded systems across languages.

03

Provided insights into multilingual speech translation challenges.

Abstract

This work presents the details and findings of the first mentorship in speech translation (SpeechT), which took place in December 2024 and January 2025. To fulfil the mentorship requirements, the participants engaged in key activities, including data preparation, modelling, and advanced research. The participants explored data augmentation techniques and compared end-to-end and cascaded speech translation systems. The projects covered various languages other than English, including Arabic, Bengali, Galician, Indonesian, Japanese, and Spanish.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques