Semi-supervised Learning for Code-Switching ASR with Large Language Model Filter
Yu Xi, Wen Ding, Kai Yu, Junjie Lai

TL;DR
This paper introduces a semi-supervised learning approach for code-switching automatic speech recognition, leveraging large language models to improve performance with limited code-switching data.
Contribution
It proposes a novel LLM-Filter within a noisy student training framework to enhance CS-ASR using monolingual data, outperforming existing methods.
Findings
Significant improvements over baseline models.
Achieves near upper-bound performance on CS English data.
Benefits from linguistically relevant monolingual data.
Abstract
Code-switching (CS) phenomenon occurs when words or phrases from different languages are alternated in a single sentence. Due to data scarcity, building an effective CS Automatic Speech Recognition (ASR) system remains challenging. In this paper, we propose to enhance CS-ASR systems by utilizing rich unsupervised monolingual speech data within a semi-supervised learning framework, particularly when access to CS data is limited. To achieve this, we establish a general paradigm for applying noisy student training (NST) to the CS-ASR task. Specifically, we introduce the LLM-Filter, which leverages well-designed prompt templates to activate the correction capability of large language models (LLMs) for monolingual data selection and pseudo-labels refinement during NST. Our experiments on the supervised ASRU-CS and unsupervised AISHELL-2 and LibriSpeech datasets show that our method not only…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Text and Document Classification Technologies · Network Packet Processing and Optimization
MethodsStochastic Depth · RandAugment · Dropout · Noisy Student
