Leveraging Semantic and Geometric Information for Zero-Shot Robot-to-Human Handover
Jiangshan Liu, Wenlong Dong, Jiankun Wang, Max Q.-H. Meng

TL;DR
This paper introduces a zero-shot robot-to-human handover system that combines semantic and geometric data, improving grasp selection and interaction success in diverse object scenarios.
Contribution
It presents a novel zero-shot approach integrating vision-language models and customized prompts for optimal grasping in human-robot handovers.
Findings
Improved handover success rates in real-world tests
Enhanced user preference and interaction quality
Effective zero-shot grasp region identification
Abstract
Human-robot interaction (HRI) encompasses a wide range of collaborative tasks, with handover being one of the most fundamental. As robots become more integrated into human environments, the potential for service robots to assist in handing objects to humans is increasingly promising. In robot-to-human (R2H) handover, selecting the optimal grasp is crucial for success, as it requires avoiding interference with the humans preferred grasp region and minimizing intrusion into their workspace. Existing methods either inadequately consider geometric information or rely on data-driven approaches, which often struggle to generalize across diverse objects. To address these limitations, we propose a novel zero-shot system that combines semantic and geometric information to generate optimal handover grasps. Our method first identifies grasp regions using semantic knowledge from vision-language…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
