TalTech Systems for the Interspeech 2025 ML-SUPERB 2.0 Challenge

Tanel Alum\"ae; Artem Fedorchenko

arXiv:2506.01458·cs.CL·June 3, 2025

TalTech Systems for the Interspeech 2025 ML-SUPERB 2.0 Challenge

Tanel Alum\"ae, Artem Fedorchenko

PDF

Open Access

TL;DR

This paper presents a hybrid language identification and multilingual speech recognition system developed for the Interspeech 2025 ML-SUPERB 2.0 Challenge, achieving top overall performance.

Contribution

It introduces a novel hybrid system combining pretrained language embeddings with multiple speech recognition models tailored per language.

Findings

01

System achieved top overall score in the challenge.

02

Hybrid approach effectively combines language identification and speech recognition.

03

Multiple models optimized for different languages based on data availability.

Abstract

This paper describes the language identification and multilingual speech recognition system developed at Tallinn University of Technology for the Interspeech 2025 ML-SUPERB 2.0 Challenge. A hybrid language identification system is used, consisting of a pretrained language embedding model and a light-weight speech recognition model with a shared encoder across languages and language-specific bigram language models. For speech recognition, three models are used, where only a single model is applied for each language, depending on the training data availability and performance on held-out data. The model set consists of a finetuned version of SeamlessM4T, MMS-1B-all with custom language adapters and MMS-zeroshot. The system obtained the top overall score in the challenge.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed and Parallel Computing Systems · Scientific Computing and Data Management · Parallel Computing and Optimization Techniques

MethodsSparse Evolutionary Training