ATCSpeech: a multilingual pilot-controller speech corpus from real Air Traffic Control environment
Bo Yang, Xianlong Tan, Zhengmao Chen, Bing Wang, Dan Li, Zhongping, Yang, Xiping Wu, Yi Lin

TL;DR
This paper introduces ATCSpeech, a multilingual speech corpus from real Air Traffic Control environments, to support ASR research with real-world data and baseline model evaluations.
Contribution
It presents the first real, multilingual ATC speech corpus from actual systems, including Mandarin Chinese and English, with detailed data attributes and baseline ASR performance.
Findings
The corpus includes diverse speakers and speech qualities.
Baseline ASR models achieve promising results on the dataset.
The dataset supports non-commercial research and development in ATC ASR.
Abstract
Automatic Speech Recognition (ASR) is greatly developed in recent years, which expedites many applications on other fields. For the ASR research, speech corpus is always an essential foundation, especially for the vertical industry, such as Air Traffic Control (ATC). There are some speech corpora for common applications, public or paid. However, for the ATC, it is difficult to collect raw speeches from real systems due to safety issues. More importantly, for a supervised learning task like ASR, annotating the transcription is a more laborious work, which hugely restricts the prospect of ASR application. In this paper, a multilingual speech corpus (ATCSpeech) from real ATC systems, including accented Mandarin Chinese and English, is built and released to encourage the non-commercial ASR research in ATC domain. The corpus is detailly introduced from the perspective of data amount, speaker…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
