Unmanned Aerial Vehicle Control Through Domain-based Automatic Speech Recognition
Ruben Contreras, Angel Ayala, Francisco Cruz

TL;DR
This paper introduces a domain-based speech recognition system for controlling drones, improving accuracy over cloud-based methods, and supporting both English and Spanish in noisy environments.
Contribution
The work presents a novel domain-based speech recognition architecture that enhances drone control accuracy and supports bilingual commands in noisy conditions.
Findings
Phoneme matching significantly improves speech-to-action accuracy.
The system achieves 93.33% accuracy for English and 100% for Spanish commands.
Robustness to noise is demonstrated through experiments with distorted voice inputs.
Abstract
Currently, unmanned aerial vehicles, such as drones, are becoming a part of our lives and reaching out to many areas of society, including the industrialized world. A common alternative to control the movements and actions of the drone is through unwired tactile interfaces, for which different remote control devices can be found. However, control through such devices is not a natural, human-like communication interface, which sometimes is difficult to master for some users. In this work, we present a domain-based speech recognition architecture to effectively control an unmanned aerial vehicle such as a drone. The drone control is performed using a more natural, human-like way to communicate the instructions. Moreover, we implement an algorithm for command interpretation using both Spanish and English languages, as well as to control the movements of the drone in a simulated domestic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
