Task-Oriented Robot-Human Handovers on Legged Manipulators
Andreea Tulbure, Carmen Scheidemann, Elias Steiner, Marco Hutter

TL;DR
This paper introduces AFT-Handover, a novel framework combining large language model reasoning and texture-based transfer to enable zero-shot, generalizable task-oriented robot-human handovers on legged manipulators, improving success rates and user preference.
Contribution
The paper presents a new framework that integrates LLM-driven affordance reasoning with texture transfer for zero-shot, generalizable handovers, extending capabilities to legged manipulators.
Findings
Improved handover success rates across diverse object-task pairs.
Significantly higher user preference over state-of-the-art methods.
Effective demonstration on legged manipulators for real-world applications.
Abstract
Task-oriented handovers (TOH) are fundamental to effective human-robot collaboration, requiring robots to present objects in a way that supports the human's intended post-handover use. Existing approaches are typically based on object- or task-specific affordances, but their ability to generalize to novel scenarios is limited. To address this gap, we present AFT-Handover, a framework that integrates large language model (LLM)-driven affordance reasoning with efficient texture-based affordance transfer to achieve zero-shot, generalizable TOH. Given a novel object-task pair, the method retrieves a proxy exemplar from a database, establishes part-level correspondences via LLM reasoning, and texturizes affordances for feature-based point cloud transfer. We evaluate AFT-Handover across diverse task-object pairs, showing improved handover success rates and stronger generalization compared to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Social Robot Interaction and HRI · Multimodal Machine Learning Applications
