Multimodal Interaction with Multiple Co-located Drones in Search and Rescue Missions
Jonathan Cacace, Alberto Finzi, Vincenzo Lippiello

TL;DR
This paper introduces a multimodal interaction framework enabling a human rescuer to effectively communicate with multiple co-located drones during search and rescue missions, supporting natural mixed-initiative interaction in complex scenarios.
Contribution
The work presents a novel multimodal interaction framework tailored for human operators involved in search and rescue, accommodating sparse instructions and mixed-initiative communication with multiple drones.
Findings
Framework demonstrated in a simulated case study
Supports natural multimodal communication
Enhances human-robot collaboration in rescue scenarios
Abstract
We present a multimodal interaction framework suitable for a human rescuer that operates in proximity with a set of co-located drones during search missions. This work is framed in the context of the SHERPA project whose goal is to develop a mixed ground and aerial robotic platform to support search and rescue activities in a real-world alpine scenario. Differently from typical human-drone interaction settings, here the operator is not fully dedicated to the drones, but involved in search and rescue tasks, hence only able to provide sparse, incomplete, although high-value, instructions to the robots. This operative scenario requires a human-interaction framework that supports multimodal communication along with an effective and natural mixed-initiative interaction between the human and the robots. In this work, we illustrate the domain and the proposed multimodal interaction framework…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Robotics and Automated Systems · Interactive and Immersive Displays
