A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers
Juan Zuluaga-Gomez, Amrutha Prasad, Iuliia Nigmatulina, Petr Motlicek,, Matthias Kleinert

TL;DR
This paper introduces a comprehensive AI-based virtual simulation-pilot system for air traffic controller training, capable of understanding and generating realistic pilot communications using open-source tools.
Contribution
It presents the first fully open-source, modular virtual pilot system integrating speech recognition, understanding, and response generation for ATC training.
Findings
Achieved low word error rates of 5.5% and 15.9% on high and low-quality audio.
Enhanced call sign detection accuracy to over 96% with surveillance data.
Developed a robust, modular system adaptable with real-time data and training scenarios.
Abstract
In this paper we propose a novel virtual simulation-pilot engine for speeding up air traffic controller (ATCo) training by integrating different state-of-the-art artificial intelligence (AI) based tools. The virtual simulation-pilot engine receives spoken communications from ATCo trainees, and it performs automatic speech recognition and understanding. Thus, it goes beyond only transcribing the communication and can also understand its meaning. The output is subsequently sent to a response generator system, which resembles the spoken read back that pilots give to the ATCo trainees. The overall pipeline is composed of the following submodules: (i) automatic speech recognition (ASR) system that transforms audio into a sequence of words; (ii) high-level air traffic control (ATC) related entity parser that understands the transcribed voice communication; and (iii) a text-to-speech submodule…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Speech and dialogue systems · Natural Language Processing Techniques
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Sigmoid Activation · Linear Layer · Bidirectional GRU · Highway Layer · Highway Network · Linear Warmup With Linear Decay · Max Pooling
