A Multi-tasking Model of Speaker-Keyword Classification for Keeping Human in the Loop of Drone-assisted Inspection
Yu Li, Anisha Parsan, Bill Wang, Penghao Dong, Shanshan Yao, Ruwen Qin

TL;DR
This paper introduces a multi-task deep learning model with a Share-Split-Collaborate architecture for speaker and keyword classification in drone-assisted inspections, enabling efficient adaptation to new inspectors and robust human-in-the-loop communication.
Contribution
The paper presents a novel multi-tasking model architecture that effectively handles heterogeneous inspectors and adapts with minimal data, improving human-robot interaction in inspection tasks.
Findings
Achieved over 95% accuracy in keyword classification for authorized inspectors.
Attained 99.2% accuracy in speaker identification.
Successfully verified inspectors with at least 93.9% success rate.
Abstract
Audio commands are a preferred communication medium to keep inspectors in the loop of civil infrastructure inspection performed by a semi-autonomous drone. To understand job-specific commands from a group of heterogeneous and dynamic inspectors, a model must be developed cost-effectively for the group and easily adapted when the group changes. This paper is motivated to build a multi-tasking deep learning model that possesses a Share-Split-Collaborate architecture. This architecture allows the two classification tasks to share the feature extractor and then split subject-specific and keyword-specific features intertwined in the extracted features through feature projection and collaborative training. A base model for a group of five authorized subjects is trained and tested on the inspection keyword dataset collected by this study. The model achieved a 95.3% or higher mean accuracy in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInfrastructure Maintenance and Monitoring · Occupational Health and Safety Research · Domain Adaptation and Few-Shot Learning
MethodsBalanced Selection
