Code-Switching Detection with Data-Augmented Acoustic and Language   Models

Emre Y{\i}lmaz; Henk van den Heuvel; David A. van Leeuwen

arXiv:1808.00521·cs.CL·August 3, 2018·1 cites

Code-Switching Detection with Data-Augmented Acoustic and Language Models

Emre Y{\i}lmaz, Henk van den Heuvel, David A. van Leeuwen

PDF

Open Access

TL;DR

This paper enhances code-switching detection in speech recognition by data augmentation of acoustic and language models, focusing on Frisian-Dutch broadcasts, and reports improved detection accuracy and error analysis.

Contribution

It introduces data-augmented acoustic and language models trained on monolingual and generated CS text, significantly improving CS detection performance.

Findings

01

Improved CS detection accuracy over baseline models

02

Effective use of monolingual Dutch data for acoustic modeling

03

Enhanced language models with generated CS text

Abstract

In this paper, we investigate the code-switching detection performance of a code-switching (CS) automatic speech recognition (ASR) system with data-augmented acoustic and language models. We focus on the recognition of Frisian-Dutch radio broadcasts where one of the mixed languages, namely Frisian, is under-resourced. Recently, we have explored how the acoustic modeling (AM) can benefit from monolingual speech data belonging to the high-resourced mixed language. For this purpose, we have trained state-of-the-art AMs on a significantly increased amount of CS speech by applying automatic transcription and monolingual Dutch speech. Moreover, we have improved the language model (LM) by creating CS text in various ways including text generation using recurrent LMs trained on existing CS text. Motivated by the significantly improved CS ASR performance, we delve into the CS detection…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Natural Language Processing Techniques · Speech and dialogue systems