Adapting Language Balance in Code-Switching Speech

Enes Yavuz Ugan; Ngoc-Quan Pham; Alexander Waibel

arXiv:2510.18724·cs.CL·October 22, 2025

Adapting Language Balance in Code-Switching Speech

Enes Yavuz Ugan, Ngoc-Quan Pham, Alexander Waibel

PDF

Open Access

TL;DR

This paper introduces a method to improve large models' performance on code-switching speech by emphasizing learning at code-switching points, reducing errors in recognizing language switches, especially in Arabic and Chinese-English datasets.

Contribution

It proposes a differentiable surrogate that highlights code-switching points during training, enhancing model robustness against context bias in code-switching scenarios.

Findings

01

Reduced substitution errors at code-switching points

02

Improved prediction accuracy for language switches

03

Enhanced model robustness in multilingual speech recognition

Abstract

Despite achieving impressive results on standard benchmarks, large foundational models still struggle against code-switching test cases. When data scarcity cannot be used as the usual justification for poor performance, the reason may lie in the infrequent occurrence of code-switched moments, where the embedding of the second language appears subtly. Instead of expecting the models to learn this infrequency on their own, it might be beneficial to provide the training process with labels. Evaluating model performance on code-switching data requires careful localization of code-switching points where recognition errors are most consequential, so that the analysis emphasizes mistakes occurring at those moments. Building on this observation, we leverage the difference between the embedded and the main language to highlight those code-switching points and thereby emphasize learning at those…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Natural Language Processing Techniques · Neurobiology of Language and Bilingualism