Code-Switching Detection with Data-Augmented Acoustic and Language Models
Emre Y{\i}lmaz, Henk van den Heuvel, David A. van Leeuwen

TL;DR
This paper enhances code-switching detection in speech recognition by data augmentation of acoustic and language models, focusing on Frisian-Dutch broadcasts, and reports improved detection accuracy and error analysis.
Contribution
It introduces data-augmented acoustic and language models trained on monolingual and generated CS text, significantly improving CS detection performance.
Findings
Improved CS detection accuracy over baseline models
Effective use of monolingual Dutch data for acoustic modeling
Enhanced language models with generated CS text
Abstract
In this paper, we investigate the code-switching detection performance of a code-switching (CS) automatic speech recognition (ASR) system with data-augmented acoustic and language models. We focus on the recognition of Frisian-Dutch radio broadcasts where one of the mixed languages, namely Frisian, is under-resourced. Recently, we have explored how the acoustic modeling (AM) can benefit from monolingual speech data belonging to the high-resourced mixed language. For this purpose, we have trained state-of-the-art AMs on a significantly increased amount of CS speech by applying automatic transcription and monolingual Dutch speech. Moreover, we have improved the language model (LM) by creating CS text in various ways including text generation using recurrent LMs trained on existing CS text. Motivated by the significantly improved CS ASR performance, we delve into the CS detection…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Natural Language Processing Techniques · Speech and dialogue systems
