Interpretable Multimodal Gesture Recognition for Drone and Mobile Robot Teleoperation via Log-Likelihood Ratio Fusion
Seungyeol Baek, Jaspreet Singh, Lala Shakti Swarup Ray, Hymalai Bello, Paul Lukowicz, and Sungho Suh

TL;DR
This paper presents a multimodal gesture recognition system combining inertial and capacitive data with a log-likelihood ratio fusion method, enabling robust, interpretable, and real-time teleoperation of robots and drones in hazardous environments.
Contribution
It introduces a novel multimodal fusion framework with interpretability for gesture recognition, validated on a new dataset, improving robustness and efficiency over vision-based methods.
Findings
Achieved recognition performance comparable to state-of-the-art vision methods.
Reduced computational cost, model size, and training time.
Demonstrated robustness in real-world teleoperation scenarios.
Abstract
Human operators are still frequently exposed to hazardous environments such as disaster zones and industrial facilities, where intuitive and reliable teleoperation of mobile robots and Unmanned Aerial Vehicles (UAVs) is essential. In this context, hands-free teleoperation enhances operator mobility and situational awareness, thereby improving safety in hazardous environments. While vision-based gesture recognition has been explored as one method for hands-free teleoperation, its performance often deteriorates under occlusions, lighting variations, and cluttered backgrounds, limiting its applicability in real-world operations. To overcome these limitations, we propose a multimodal gesture recognition framework that integrates inertial data (accelerometer, gyroscope, and orientation) from Apple Watches on both wrists with capacitive sensing signals from custom gloves. We design a late…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHand Gesture Recognition Systems · Teleoperation and Haptic Systems · Social Robot Interaction and HRI
