Pose2Drone: A Skeleton-Pose-based Framework for Human-Drone Interaction
Zdravko Marinov, Stanka Vasileva, Qing Wang, Constantin Seibold,, Jiaming Zhang, Rainer Stiefelhagen

TL;DR
This paper presents Pose2Drone, a gesture-based human-drone interaction framework utilizing skeleton pose estimation and monocular distance estimation, achieving high gesture recognition accuracy without additional sensors.
Contribution
Introducing a novel gesture-based HDI framework with monocular distance estimation, enabling safe and natural drone control using simple arm gestures.
Findings
93.5% gesture recognition accuracy
Effective monocular distance estimation method
Framework enables safe human-drone interaction
Abstract
Drones have become a common tool, which is utilized in many tasks such as aerial photography, surveillance, and delivery. However, operating a drone requires more and more interaction with the user. A natural and safe method for Human-Drone Interaction (HDI) is using gestures. In this paper, we introduce an HDI framework building upon skeleton-based pose estimation. Our framework provides the functionality to control the movement of the drone with simple arm gestures and to follow the user while keeping a safe distance. We also propose a monocular distance estimation method, which is entirely based on image features and does not require any additional depth sensors. To perform comprehensive experiments and quantitative analysis, we create a customized testing dataset. The experiments indicate that our HDI framework can achieve an average of 93.5\% accuracy in the recognition of 11…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Hand Gesture Recognition Systems · Video Surveillance and Tracking Methods
