Toward Human-Robot Teaming: Learning Handover Behaviors from 3D Scenes

Yuekun Wu; Yik Lung Pang; Andrea Cavallaro; Changjae Oh

arXiv:2508.09855·cs.RO·August 14, 2025

Toward Human-Robot Teaming: Learning Handover Behaviors from 3D Scenes

Yuekun Wu, Yik Lung Pang, Andrea Cavallaro, Changjae Oh

PDF

TL;DR

This paper presents a novel method for training human-robot handover policies using only RGB images and scene reconstruction, eliminating the need for real-robot data or simulation, thereby improving robustness and efficiency.

Contribution

The authors introduce a scene reconstruction-based approach that enables learning handover behaviors solely from RGB images without real-robot training or simulation, addressing domain gap issues.

Findings

01

Effective in both reconstructed scenes and real-world experiments

02

Enables stable object handovers with collision avoidance

03

Reduces reliance on large-scale real-robot data collection

Abstract

Human-robot teaming (HRT) systems often rely on large-scale datasets of human and robot interactions, especially for close-proximity collaboration tasks such as human-robot handovers. Learning robot manipulation policies from raw, real-world image data requires a large number of robot-action trials in the physical environment. Although simulation training offers a cost-effective alternative, the visual domain gap between simulation and robot workspace remains a major limitation. We introduce a method for training HRT policies, focusing on human-to-robot handovers, solely from RGB images without the need for real-robot training or real-robot data collection. The goal is to enable the robot to reliably receive objects from a human with stable grasping while avoiding collisions with the human hand. The proposed policy learner leverages sparse-view Gaussian Splatting reconstruction of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.