Improving speech recognition models with small samples for air traffic control systems
Yi Lin, Qin Li, Bo Yang, Zhen Yan, Huachun Tan, and Zhengmao Chen

TL;DR
This paper introduces a novel training approach combining unsupervised pretraining and transfer learning to improve speech recognition in air traffic control systems with limited labeled data, achieving significant error reduction.
Contribution
It proposes a new training framework that leverages unsupervised pretraining and transfer learning for domain-specific ASR with small datasets, enhancing performance.
Findings
Character error rate reduced to one-third of supervised training
Effective domain adaptation for ATC speech recognition
Validated on three real ATC datasets
Abstract
In the domain of air traffic control (ATC) systems, efforts to train a practical automatic speech recognition (ASR) model always faces the problem of small training samples since the collection and annotation of speech samples are expert- and domain-dependent task. In this work, a novel training approach based on pretraining and transfer learning is proposed to address this issue, and an improved end-to-end deep learning model is developed to address the specific challenges of ASR in the ATC domain. An unsupervised pretraining strategy is first proposed to learn speech representations from unlabeled samples for a certain dataset. Specifically, a masking strategy is applied to improve the diversity of the sample without losing their general patterns. Subsequently, transfer learning is applied to fine-tune a pretrained or other optimized baseline models to finally achieves the supervised…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Natural Language Processing Techniques
